Yes, this is kind of what I'm doing! I feel like the answer to this question of source text limits really depends on what source you're using. If you're using a film, vs. a tv series, vs. MVs, how you approach your source corpus could be very different!
no subject