Palantir Technologies

Retrieval Augmented Generation in Multi-Cloud Environments


Project logo

Researched and developed replacement algorithms including Round-Robin & LRU for low-latency caches on Palantir's AI Platform. Built a backend for a proxy service handling 10,000+ IPs to optimally balance worldwide network traffic across distributed cloud infrastructure. Focused on optimizing RAG systems for enterprise-scale AI applications, reducing cache miss rates and improving response times for real-time intelligence queries.


Role

  • Engineering Fellow

Duration

3 months (Dec 2024 - Feb 2025)

Team

  • 1 Designer
  • 3 Engineers
  • 4 BD Reps

Results

  • Deployed to active combat
  • Used by 2,000 government officials