Software Engineer & AI Builder

Ke (Coco) Zhao

SDE II · Amazon · Seattle, WA

Full-stack engineer with 7+ years building production AI systems and scalable platforms. I architect multi-agent AI infrastructure, design real-time data pipelines, and bridge research with engineering — from brain imaging labs to enterprise AI at Amazon.

Experience LinkedIn GitHub Email

Years Building

40M+

Docs / Day

Papers

207

Citations

About

Engineer × Researcher × Builder

Hey, I'm Coco — a software engineer who straddles the line between production engineering and applied research. I grew up in Wenzhou, China, moved to Philadelphia at 16, and graduated from UPenn with a double major in Computer Science and Cognitive Science, Magna Cum Laude.

At Amazon, I build enterprise AI infrastructure that handles massive scale — multi-agent systems powered by LLMs, real-time data pipelines, and AI safety systems protecting data integrity for thousands of downstream users.

Before Amazon, I co-founded a behavioral-assessment tech startup (acquired in 2022), and spent four years running neuroimaging research at UPenn, Princeton, and UPenn Medicine — publishing 8 papers along the way.

Outside of work

AI Systems at Scale

Multi-agent AI with LLMs, RAG/GraphRAG, MCP — processing 40M+ documents daily at 99.99% uptime.

Full-Stack Engineer

React to Lambda to OpenSearch — end-to-end ownership across the full stack at enterprise scale.

Researcher Turned Engineer

8 peer-reviewed papers, 207 citations in computational neuroscience. I translate science into production systems.

Startup Co-Founder

Built ForeverBrainTech from 0 to 100k+ users and 10+ engineers before acquisition in 2022.

Career

Work Experience

Amazon.com Services LLC

Software Development Engineer II

Current

Seattle, WA July 2022 – Present

Architected full-stack enterprise data platform (React, Lambda/EC2, OpenSearch) processing 40M+ documents daily from 20+ sources with 99.99% uptime; serves 10k+ users with <1s p95 latency across 600k+ daily requests.
Designed multi-agent AI system using Claude (Bedrock) with MCP for agent coordination and RAG/GraphRAG for knowledge retrieval; built an incident-analysis agent that cut investigation time from 3 hours to 3 minutes — a 99% efficiency gain.
Built real-time data ingestion pipeline (SNS/SQS/Kinesis) achieving <5 min latency; developed REST APIs with OAuth, rate limiting, and access controls — exposing data to 100+ downstream teams.
Implemented AI safety guardrails: prompt injection defense, PII detection/scrubbing (>99.9% accuracy), hallucination monitoring, and automated security controls; led products through multiple security reviews.
Maintains high velocity with 150+ code reviews in 6 months; mentors 5+ engineers; bridges scientist requirements with production engineering across product, analytics, and engineering teams.

PythonJavaReact AWS LambdaOpenSearchClaude/Bedrock RAG/GraphRAGSNS/SQSDynamoDBMCP

ForeverBrainTech Ltd.

Co-founder & Software Development Lead

Acquired

Shanghai, China (Remote) May 2020 – June 2022

Architected a full-stack SaaS behavioral-assessment platform (React, Node.js, PostgreSQL) scaling to 100k+ users across 5 provinces; designed for horizontal scaling supporting 10x growth.
Built end-to-end ML data pipeline: real-time behavioral data collection → analysis engine → automated report generation; wrote 20,000+ lines of production code with data scientists and psychologists.
Scaled engineering team from 0 to 10+ engineers. Company acquired by HengNao Tech (Summer 2022).

ReactNode.jsPostgreSQL Machine LearningSaaS

CNDS @ UPenn & Computational Memory Lab @ Princeton

Research Assistant

Philadelphia, PA September 2018 – May 2022

Contributed to RT-Cloud open-source platform: redesigned cloud architecture for real-time fMRI neurofeedback experiments, integrating OpenNeuro datasets into processing pipelines (published in NeuroImage, 2022).
Built full-stack remote research platform (React, Node.js, MongoDB) serving 100k+ participants during COVID-19, enabling multi-site behavioral data collection when physical labs closed.
Developed computational models using PyTorch/PsyNeuLink; published 8 peer-reviewed papers (207 citations) in computational neuroscience.

PythonPyTorchfMRI ReactMongoDBAWS

Education

Academic Background

In Progress

NYU Tandon School of Engineering

MS, Cybersecurity

January 2026 – Present

Deepening expertise in security architecture, threat detection, and risk management to better protect systems in today's AI-powered world.

Cyber Fellow Scholarship

Completed

University of Pennsylvania

BA, Computer Science & Cognitive Science

Graduated May 2022 · Magna Cum Laude

Double major spanning AI, algorithms, data systems, perception, decision-making, and neuroeconomics. Four years of parallel research in neuroimaging labs.

Technical Skills

What I Work With

Languages

PythonJava JavaScriptTypeScript

Frontend & Full-Stack

ReactNode.js RESTful APIsWebSockets

Backend & Infrastructure

Spring BootAWS Lambda EC2Microservices Distributed Systems

AI / ML

LLMs (Claude/Bedrock) Multi-Agent Systems RAG / GraphRAG MCPPyTorch Harness Engineering

Data Engineering

SNS/SQS/KinesisOpenSearch DynamoDBSQL ETLReal-time Streaming

Cloud & DevOps

AWS (Lambda, S3, SageMaker) KubernetesCI/CD API Gateway

Security

Auth/Authorization PII Protection Prompt Injection Defense Threat Modeling

Portfolio

Featured Projects

Enterprise AI Data Platform @ Amazon

Full-stack platform (React + Lambda/EC2 + OpenSearch) aggregating data from 20+ sources, serving 10k+ users with <1s p95 latency and 99.99% uptime. Exposes 600k+ daily requests via secure REST APIs to 100+ downstream teams.

Multi-Agent AI Incident Analysis System

Designed a multi-agent AI system using Claude (Bedrock) with MCP for coordination and RAG/GraphRAG for knowledge retrieval. Reduced incident triage time from 3 hours to 3 minutes — a 99% efficiency gain during high-stakes operational events.

AI Safety Guardrails System

Designed and implemented production AI safety infrastructure: prompt injection defense, PII detection/scrubbing (>99.9% accuracy), hallucination monitoring, and automated security controls — protecting data for 10k+ users across 100+ downstream teams.

ForeverBrainTech — Behavioral Assessment SaaS

Co-founded and architected a full-stack behavioral assessment platform scaling to 100k+ users across 5 provinces. Built an end-to-end ML pipeline for automated behavioral reporting. Scaled team from 0 to 10+ engineers before acquisition.

CogBrain — Remote Research Platform

Built a full-stack web platform serving 100k+ research participants during COVID-19. Enabled concurrent multi-site studies with real-time data aggregation when physical labs closed worldwide.

RT-Cloud — Real-time fMRI Platform

Contributed to BrainIAK's open-source RT-Cloud: redesigned cloud architecture for real-time fMRI neurofeedback experiments, integrating OpenNeuro into processing pipelines. Published in NeuroImage, 2022.

Academia

Research & Publications

Four years of cognitive neuroscience research at UPenn, Princeton, and the University of Rochester — studying decision-making, sleep, and brain imaging. 8 published papers · 207 citations.

Nature and Science of Sleep

Mao, T., Dinges, D., Deng, Y., Zhao, K., et al. Impaired vigilant attention partly accounts for inhibition control deficits after total sleep deprivation and partial sleep restriction.

Frontiers in Neuroscience

Arya, N., Vaish, A., Zhao, K., & Rao, H. Neural mechanisms underlying breast cancer related fatigue: a systematic review of neuroimaging studies.

AAAI ICWSM

Giorgi, S., Zhao, K., Feng, A.H., & Martin, L.J. Author as character and narrator: Deconstructing personal narratives from the r/amitheasshole Reddit community. Proceedings of the International AAAI Conference on Web and Social Media, 17.

NeuroImage

Wallace, G., Polcyn, S., Brooks, P.P., Mennen, A.C., Zhao, K., et al. RT-Cloud: A cloud-based software framework to simplify and standardize real-time fMRI.