Automate Pinecone Daily Upsert Task with Celery and Slack monitoring
3 hours ago
Itâs been a while since my last LLM post and Iâm excited to share that my prototype has been successfully productionized as Outsideâs first LLM-powered chatbot, Scout. If you are an Outside+ member, you can check it out over at https://scout.outsideonline.com/.
This journey began as my weekend curiosity project back in March 2023. I had the idea to build a Q&A chatbot using OpenAIâs LLMs and Outsideâs content as a knowledge base. Later I shared my prototype at our internal product demo day and I was thrilled by the interest it managed to spark. Scout quickly became an official project. On November 28th 2023, we launched Scout to limited Outside+ members. Fast forward to today, April 12th, 2024, over 28.3k unique users have already utilized this Outdoor Companion AI tool.
I couldnât be more grateful for this moonstruck experience and Iâve been planning to write a mini-series to share some behind-the-scenes insights into what it takes to bring LLM & RAG powered apps to life. So far Iâve planned to cover the following three parts:
- 𦦠Part 1: Automate Pinecone Daily Upserts with Celery and Slack monitoring
- 𦦠Part 2: Building an LLM Websocket API in Django with Postman Testing
- 𦦠Part 3: Monitoring LLM Apps with Datadog: synthetic tests, OpenAI, and Pinecone usage tracking
This post will dive into Part 1, setting up scheduled tasks with Celery Beat to automatically upsert embeddings into the Pinecone vector database. And weâll set up slack updates for easy monitoring. Letâs get started!
LLMs typically have training data cut off date, the current gpt-4-turbo was cut off at 2023-Dec (to my writing day -2024-April). The promise of using RAG is that we can equip LLMs with more fresh and domain specific data to reduce hallucinations and improve user experience. Thus the question: how can we keep the knowledge base fresh and up to date? The answer is â using Celery and Celery Beat to schedule a periodical task (daily or weekly) to embed newly publishedâ¦