Longer Audios #168
Replies: 6 comments 11 replies
-
LLMs are known for their limited output number of tokes size. I'll work on it next; No clue what the solution will be since there isn't any known best practice for it but it will be fun to develop a robust solution for it. stay tuned |
Beta Was this translation helpful? Give feedback.
-
As of a couple of days ago you can choose 100 llm models to generate the
transcript including gpt4-o.
however, in my experience, llms are trained to give the shortest answer
possible
i'm currently working on a new multi-prompt transcript generation approach
leveraging a content chunking with contextual linking approach
In that way, we generate detailed transcripts per input chunck and then
stitch responses together while maintaining cohesion.
Easier said than done, but I am almost there.
Would love your feedback once done.
Best Regards,
…--
Thársis
souzatharsis.com <http://www.souzatharsis.com/>
<http://linkedin.com/in/tharsissouza>
On Sun, Nov 10, 2024 at 1:05 PM Guilherme Negri ***@***.***> wrote:
Thanks for the answer. Today, the LLM that generates the podcast script is
Gemini, right? What I do to get larger responses in LLMS is to use the
OpenAI API, I make the request using the gpt4-o model and create a thread,
which is a native option. This way, in addition to having the initial
context, it can remember previous requests for each request, working
similarly to the chatgpt app. So I ask it to send me the responses in
separate blocks, so I can receive more tokens. I believe this could be an
option to be tested.
—
Reply to this email directly, view it on GitHub
<#168 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADTMY3LQY7CJUZ2A67TUVS3Z7575LAVCNFSM6AAAAABROLFFTOVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTCMRQGUYDANQ>
.
You are receiving this because you commented.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
I've now managed to implement long-form podcast generation. Please see sample below. Sample 1: open-notebook.ai and its sub-pages
Sample 2: Some tech docs on surfsense.net shared by the author here
Sample 3: Transformer pdf paper |
Beta Was this translation helpful? Give feedback.
-
Sample 4: Gemini Prompt Engineering Guide Transcript: https://drive.google.com/file/d/1ed2Xl9zPLyXPQNqJEcxGedgp9cMyN97T/view?usp=drive_link |
Beta Was this translation helpful? Give feedback.
-
This has been implemented and released - I'd love feedback! [0.3.6] - 2024-11-13Added
|
Beta Was this translation helpful? Give feedback.
-
I want at least an hour. |
Beta Was this translation helpful? Give feedback.
-
What is the relationship between the size of the content sent to podcastify and the "word_count" setting to get longer audios? I tested several settings but the maximum I could get was 4 minutes.
Beta Was this translation helpful? Give feedback.
All reactions