First end-to-end automated story video workflow of its kind
Videos like these [1][2] are a dime a dozen and have been out for years. Surprised you think this is completely novel.
Patrick here. Yeah there's a lot of AI video out there. There are also plenty that are more story-driven better ones nowadays. https://runwayml.com/gen48
None of these are end-to-end automated though. Even for a video without a story like the Harry Potter Balenciaga style ones, there's a lot of manual cherry-picking and manual editing going on. Here's a process example for that type of content that looks quite automateable https://www.youtube.com/watch?v=TGD8zKvRxc4 - but both 1. No one _has_ automated this and 2. It's much more difficult than it looks because of the manual cherry-picking part, and the story, character/enivornment consistency, etc.
I am really looking for another instance of "type text, get story video". I do think it's a bold claim that we're first but I haven't seen a counterexample yet.
Curious, why Mandarin only? Is there something about the language that makes it more feasible or more attractive to the business?. FWIW I would consider using something like this if it could teach Japanese.
First end-to-end automated story video workflow of its kind Videos like these [1][2] are a dime a dozen and have been out for years. Surprised you think this is completely novel.
[1] https://www.youtube.com/watch?v=tzE7TYwAYq4 [2] https://www.youtube.com/watch?v=az7KfOQkMu0
Patrick here. Yeah there's a lot of AI video out there. There are also plenty that are more story-driven better ones nowadays. https://runwayml.com/gen48
None of these are end-to-end automated though. Even for a video without a story like the Harry Potter Balenciaga style ones, there's a lot of manual cherry-picking and manual editing going on. Here's a process example for that type of content that looks quite automateable https://www.youtube.com/watch?v=TGD8zKvRxc4 - but both 1. No one _has_ automated this and 2. It's much more difficult than it looks because of the manual cherry-picking part, and the story, character/enivornment consistency, etc.
I am really looking for another instance of "type text, get story video". I do think it's a bold claim that we're first but I haven't seen a counterexample yet.
Curious, why Mandarin only? Is there something about the language that makes it more feasible or more attractive to the business?. FWIW I would consider using something like this if it could teach Japanese.
We plan to expand to all languages (the only real limits are languages with enough training data for LLMs / TTS models, so all popular languages)
Mandarin is just first for us because:
1. Thomas was already learning it
2. We can talk to users in English
3. We have some native-speaking friends who helped early
4. Has excellent support in AI products (second only to English)