Background Paths
Background Paths

Hi, I'm Saikiran Reddy Jakka

Software Engineer

MS Computer Science graduate student at Stony Brook University. Interested in working with scalable backend systems, advanced machine learning, and building innovative AI-driven applications.

Currently available for software engineering roles
Distributed SystemsMachine LearningBackend EngineeringCloud Infrastructure

About.

Background

I am a graduate researcher and software engineer specializing in distributed systems and applied machine learning. Based at Stony Brook University, my academic and professional work is driven by a focus on building systems that are theoretically sound, performant, and reliable under scale.

Currently, I serve as a Research Assistant developing supply chain intelligence platforms driven by Ergo AI. I work on bridging the gap between statistical machine learning and constraint-based symbolic reasoning, ensuring that AI-driven policy decisions remain verifiable and logically consistent.

My engineering ethos revolves around understanding systems from first principles—whether that entails implementing consensus algorithms from raw specifications or architecting cloud-native data pipelines with strict consistency bounds.

Education

Master of Science in Computer Science

Stony Brook UniversityExpected May 2026

Coursework: Distributed Systems, Machine Learning, Operating Systems, Advanced Algorithms.

Bachelor of Technology in Information Technology

Chennai Institute of TechnologyAug 2019 - May 2023

Graduated with Honors. Focused on Database Management and Core System Design.

Research Focus

Distributed Systems & Consensus

Investigating fault-tolerant architectures and highly available state machines. Deeply interested in the performance characteristics and safety proofs of Paxos, PBFT, and Raft derivatives in adversarial network conditions.

Machine Learning & Verifiable AI

Focusing on the intersection of deep learning and symbolic logic to create explainable, constrained AI pipelines. Researching robust time-series forecasting and inventory optimization for complex supply chains.

Scalable Infrastructure Design

Studying the orchestration of distributed deployments using GitOps and infrastructure-as-code principles. Focused on zero-downtime microservices and cloud-native resilience.

Technical Stack.

Machine Learning & AI

Building intelligent retrieval systems and multi-agent architectures.

PyTorch & scikit-learnRAG & Vector SearchNLP & EmbeddingsLLM Agents & Orchestration
Distributed Systems

Architecting fault-tolerant, high-throughput database and consensus systems.

Raft Consensus & ReplicationLSM-tree Storage EnginesConcurrency & ACID TransactionsRust Systems Programming
Cloud & Infrastructure

Automating resilient, observable deployments across cloud providers.

AWS, GCP & AzureDocker & KubernetesTerraform & AnsibleInfrastructure as Code
Programming Languages

Choosing the right tool for the job across systems, backend, and ML domains.

PythonRustGoTypeScript / JavaScriptC / C++

Designing real-time streaming pipelines and high-performance storage systems.

Apache Kafka & StreamingPostgreSQL & MySQLRedis & BigQueryETL & Data Pipelines
Data Engineering

Building production-grade backends and responsive frontend interfaces.

FastAPI & REST APIsReact / Next.jsTailwind CSSgRPC & Microservices
Web & API Engineering

Automating build, test, and deploy pipelines for reproducible releases.

GitHub Actions & JenkinsGitOps & Release PipelinesLinux AdministrationPrometheus & Grafana
DevOps & CI/CD

Implementing authentication, authorization, and audit systems from scratch.

RBAC & AuthenticationLDAP & KerberosAudit Logging & ComplianceNetwork Security
Security & Access Control

Machine Learning & AI

My ML work started with NLP and recommendation pipelines at HCL Technologies, where I built sentiment classifiers and feature extraction workflows using scikit-learn and PyTorch. At Stony Brook, I took it further designing Graph-Tree RAG, a hybrid retrieval pipeline combining FAISS vector search, BM25, and graph traversal that improved multi-hop accuracy by 37%. I also built a Self-Evolving Agentic AI system with semantic memory and multi-LLM orchestration, eliminating 60% of manual workflow steps through adaptive prompt strategies.

Experience.

Career Path

Building scalable infrastructure and intelligent systems across academia and enterprise.

Graduate Research And Teaching Assistant

Stony Brook UniversityPart-time
Aug 2025 to Present8 mos
Stony Brook, New York, United StatesOn-site

Software Development Intern

HCLTechInternship
Oct 2023 to Mar 20246 mos
Chennai, Tamil Nadu, IndiaRemote

Data Analyst Intern

ExcelR SolutionsContract
Sep 2022 to Nov 20223 mos
Chennai, Tamil Nadu, IndiaRemote

Get the highlights.

A deeper dive into the projects that define my technical journey.

In Progress
Machine LearningLogical ReasoningConstraint SolvingTime Series Forecasting

Ergo AI: Intelligent Supply Chain Optimization

Building a supply chain intelligence platform powered by Ergo AI, combining machine learning, symbolic reasoning, and constraint-based logic to enforce consistent, verifiable, and explainable decision-making.

Completed
LLMsAgent SystemsPythonPrompt Engineering

Budget GPT: Optimizing LLM Inference

Built a 'fast + slow' agent framework to reduce overthinking in Large Reasoning Models (LRMs). A Manager LLM predicts an optimal token budget, while a Reflector performs verbal reinforcement learning via natural language error analysis.

In Progress
RustDistributed DatabasesLSM-TreeMVCC

NexusDB: Distributed MYSQL Database Systems

Architected a distributed SQL database in Rust with an LSM-tree KV engine, SageTree, MVCC, and a custom SQL stack, supporting ACID transactions and scalable query execution handling ~150K rows/sec.

Completed
RustVerusFormal VerificationDistributed Systems

Raft Protocol Implementation in Rust with Verus

Developed a high-assurance Raft/Paxos consensus implementation in Rust, integrated with formal verification using Verus to prove critical safety and liveness guarantees like agreement and validity.

Completed
GoPaxosTwo Phase CommitDistributed Sharding

Scalable Banking System with Modified Paxos & 2PC

Implemented a fault-tolerant distributed transaction processing system integrating Modified Paxos, sharding, replication, and Two-Phase Commit to achieve strong consistency over 3 replicated shards for 3K+ clients.

Completed
Data AnalyticsPythonData VisualizationLevenshtein Distance

Education Statistics & Teaching Quality Metrics

Engineered a data-driven teaching quality metric evaluating 500+ U.S. universities by integrating datasets from RateMyProfessors, College Scorecard, and IPEDS using advanced standardization techniques.

Completed
GoDistributed AlgorithmsBlockchainPBFT

Fault-Tolerant Distributed Systems with PBFT

Engineered a fault-tolerant distributed transaction processing system in Go, implementing a modified linear PBFT protocol to ensure consensus across multiple servers in adversarial network environments.

Completed
DevOpsDockerKubernetesGit

GitOps ML Application Deployment

Built and deployed a machine learning web application using GitOps, CI/CD pipelines, and DevOps automation, improving deployment speed and security by 40%.

Completed
KubernetesContainerizationJenkinsMicrosoft Azure

Automated Web App Deployment CI/CD

Applied Git, Jenkins, and Docker to automate the deployment process, cutting manual errors by 75%. Implemented Kubernetes & Terraform for scaling infrastructure and in-built load balancing.

Completed
Microsoft Power BITableauData AnalyticsMicrosoft Excel

FMCG Analytics: Market Insights

Performed in-depth analytics to gain valuable market insights and understand consumer behavior within the Fast-Moving Consumer Goods (FMCG) industry assisting with promotional campaigns.

Get in touch.

Interested in collaboration or research opportunities? I'd love to hear from you.

Current Status

Graduate Student at Stony Brook
Research Assistant
Open to Opportunities

© 2025 Saikiran Reddy Jakka. Built with Next.js, TypeScript & Tailwind CSS