Topic Extraction
- The extraction graph creates an endpoint which accepts audio files and transcribes them using OpenAI's Whisper model
- The transcription is fed into an LLM for Topic Extraction.
- The transcription is fed into a summarization model to summarize the entire transcript.
Code Reference
graph.yaml
- contains the extraction graph.setup_graph.py
- Sets up the extraction graph in Indexify Serverupload_and_retrieve.py
- Uploads audio into the extraction graph, waits for extraction and finally retrieves from the endpoint.
Download & Start Indexify Server
Download & Join Indexify Extractors
Terminal 2
virtualenv ve
source ve/bin/activate
pip install indexify-extractor-sdk
indexify-extractor download tensorlake/whisper-asr
indexify-extractor download tensorlake/summarization
indexify-extractor download tensorlake/openai
indexify-extractor join-server
Setup the Graph
![](https://docs.getindexify.ai/example_code/audio/topic_extraction/carbon.png)
Upload Data and Retrieve
The next step is to upload an audio file and retrieve the transcript