When it comes to typesetting like this (usage of drawings/clips/animated text) you don't want to subject your users to rendering it. That image to ass program does a 1:1 translation from pixel to ass (last I saw of its discussion) -- definitely not practical especially if you're working frame by frame. Overlays are probably the way to go here.
|