Independent Science + Technology

Author: Vasukumar P

How to Summarize the Audio Using Gemini Pro Multimodal

Post date August 22, 2024
Post author By Vasukumar P
Post categories In artificial-intelligence, audio, llm, python, streamlit

Local RAG with Llama3-instruct (Ollama)

Post date August 6, 2024
Post author By Vasukumar P
Post categories In langchain, llama-3, llm, ollama, text-generation

How to Build a Text, Image, and Audio-Capable Multimodal LLM (LLaVA + Whisper)

Post date June 18, 2024
Post author By Vasukumar P
Post categories In artificial-intelligence, large-language-models, open source, openai-whisper, python

Nothing left to load.