InTDS ArchivebyJulian YipBuild Autonomous AI Agents with Function CallingTransform your chatbot into an agent that can interact with external APIsApr 2, 202410Apr 2, 202410
InTDS ArchivebyJulian YipPrompt Like a Data Scientist: Auto Prompt Optimization and Testing with DSPyApplying machine learning methodology to prompt buildingMay 5, 202410May 5, 202410
InTDS ArchivebyCameron R. Wolfe, Ph.D.The Basics of AI-Powered (Vector) SearchHow the modern AI boom has completely revolutionized search applications…Mar 18, 20242Mar 18, 20242
InTDS ArchivebyMarkus StollVisualize your RAG Data — Evaluate your Retrieval-Augmented Generation System with RagasHow to use UMAP dimensionality reduction for Embeddings to show multiple evaluation Questions and their relationships to source documents…Mar 3, 20248Mar 3, 20248
InTDS ArchivebyLeonie MonigattiAdvanced Retrieval-Augmented Generation: From Theory to LlamaIndex ImplementationHow to address limitations of naive RAG pipelines by implementing targeted advanced RAG techniques in PythonFeb 19, 202413Feb 19, 202413
InTDS ArchivebyHet TrivediDeploying LLMs Into Production Using TensorRT LLMA guide on accelerating inference performanceFeb 22, 20245Feb 22, 20245
InTDS ArchivebyHamza GharbiBuilding a Chat App with LangChain, LLMs, and Streamlit for Complex SQL Database InteractionBuild and deploy a chat application for complex database interaction with LangChain agents.Feb 9, 202411Feb 9, 202411
InTDS ArchivebyYanli LiuHow to Chat with Any Open Source LLM for Free with Your iPhoneBuilding an Open Source “ChatGPT” App on iPhone Using Ollama and Google Colab Free T4 GPUFeb 5, 20243Feb 5, 20243
InTDS ArchivebyIulia BrezeanuHow to Find the Best Multilingual Embedding Model for Your RAGOptimize the Embedding Space for Improving RAGJan 27, 20245Jan 27, 20245
Lukas HauzenbergerMultilabel Classification using Mistral-7B on a single GPU with quantization and LoRALLMs have impressed with there abilities to solve a wide variety of tasks, not only for natural language but also in a multimodal setting…Jan 16, 202411Jan 16, 202411
InTDS ArchivebyLuís RoqueMistral AI vs. Meta: Comparing Top Open-source LLMsA comparison between Mistral 7B vs Llama 2 7B and Mixtral 8x7B vs Llama 2 70BJan 23, 20247Jan 23, 20247
InTDS ArchivebyPye Sone KyawRunning Local LLMs and VLMs on the Raspberry PiGet models like Phi-2, Mistral, and LLaVA running locally on a Raspberry Pi with OllamaJan 14, 202423Jan 14, 202423
InStackademicbyFabio MatricardiTiny-Vicuna-1B is the lightweight champion of the Tiny ModelsCommand and Conquer: the smallest Vicuna flavor is the Tiny Master of Instruction, answers your every call (Flawlessly!)Jan 11, 20244Jan 11, 20244
Benjamin MariePhi-2: A Small Model Easy to Fine-tune on Your GPUInstruct fine-tuning and quantization on consumer hardwareJan 2, 20244Jan 2, 20244
InTDS ArchivebySheila TeoHow I Won Singapore’s GPT-4 Prompt Engineering CompetitionA deep dive into the strategies I learned for harnessing the power of Large Language Models (LLMs)Dec 29, 2023139Dec 29, 2023139
InTDS ArchivebyGerasimos Plegas 〽️Can an LLM Replace a FinTech Manager? Comprehensive Guide to Develop a GPU-Free AI Tool for CorpoDevelop your own zero-cost LLM, to extract corporate context, locallyDec 20, 20235Dec 20, 20235
InAI AdvancesbyGavin LiUnbelievable! Run 70B LLM Inference on a Single 4GB GPU with This NEW TechniqueLarge language models require huge amounts of GPU memory. Is it possible to run inference on a single GPU? If so, what is the minimum GPU…Nov 18, 202338Nov 18, 202338
InTowards AIbyIvan Reznikov, PhDLangChain Cheatsheet — All Secrets on a Single PageThe onepager summarizes the basics of LangChain. LangChain cheatsheet includes llms, prompts, memory, indexes, agents, chains and colab…Nov 15, 20231Nov 15, 20231
InTDS ArchivebySamir SaciLeveraging LLMs with LangChain for Supply Chain Analytics — A Control Tower Powered by GPTBuild an automated supply chain control tower with a LangChain SQL agent connecting an LLM with a database using Python.Nov 17, 20234Nov 17, 20234
InTDS ArchivebyLuís RoqueThe Power of Retrieval Augmented Generation: A Comparison between Base and RAG LLMs with Llama2A deep dive into tailoring pre-trained LLMs for custom use cases using a RAG approach, featuring LangChain and Hugging Face integrationNov 29, 20234Nov 29, 20234