fal
https://github.com/fal
fal / video-understanding
Ask a natural-language question about a video and get a detailed text answer — scene description, action recognition, on-screen text (OCR), and visual Q&A.