How to Summarize the Audio Using Gemini Pro Multimodal Post date August 22, 2024 Post author By Vasukumar P Post categories In artificial-intelligence, audio, llm, python, streamlit
Local RAG with Llama3-instruct (Ollama) Post date August 6, 2024 Post author By Vasukumar P Post categories In langchain, llama-3, llm, ollama, text-generation
How to Build a Text, Image, and Audio-Capable Multimodal LLM (LLaVA + Whisper) Post date June 18, 2024 Post author By Vasukumar P Post categories In artificial-intelligence, large-language-models, open source, openai-whisper, python