- Design, implement, and optimize LLM-powered systems (e.g., RAG, chat agents, summarizers, knowledge graph integration).
- Build and manage data indexing and retrieval pipelines using LlamaIndex, LangChain, or similar frameworks.
- Implement and maintain vector databases (e.g., Pinecone, Neo4j, Weaviate, Chroma, or Azure Cognitive Search).
- Integrate open-source and proprietary LLMs (e.g., GPT, Claude, Llama) into the CoreStory Platform.
- Develop and refine AI-driven features including generative insights, automated summarization, and narrative analytics.
- Collaborate with DevOps and backend teams to deploy scalable AI services within cloud infrastructure.
- Continuously benchmark model performance, latency, and cost, identifying opportunities for optimization.
- Contribute to internal documentation, experimentation frameworks, and evaluation methodologies.
PythonMachine LearningFastAPI+3 more