π©π½βπ» About Me
Welcome! I'm Shreya Shankar. I am a machine learning (ML) engineer and computer scientist in the Bay Area.
I am completing my PhD in data management systems for ML, with a human-centered focus. I am fortunate to be advised by Dr. Aditya Parameswaran at UC Berkeley. Go Bears! π»
I also consult on ML engineering and production AI strategy for enterprises. Prior to my PhD, I was the first ML engineer at a startup, did research engineering at Google Brain, and engineering at Facebook. Before all of that, I did my BS and MS in computer science at Stanford. Go Trees! π²
Research
I study ML and artificial intelligence (AI) engineers and how to build better tools for our community. I am particularly interested in data quality for production AI systems, human-centered ML workflows, and pretty much any other MLOps challenge. I'm working on the following:
Data Management Systems
- Bespoke data quality constraints for complex, end-to-end ML and LLM pipelines
- Maintaining LLM-generated sketches over streams of unstructured data
- Real-time, approximate monitoring of ML pipeline performance (e.g., accuracy) with feedback delays
Human-Computer Interaction
- Interfaces for managing the MLOps lifecycle of evaluation, monitoring, and fine-tuning
- Low-code tools to program complex LLM workflows
If you are interested in working on any of these projects or collaborating, please contact me via email. In your email, please include which project(s) you are interested in working on, or any of my papers that you found interesting. I am open to undergrads who have taken a databases course and/or a graduate level machine learning course. If you go to UC Berkeley, please mention that in your email. If you have experience working as an ML engineer, data engineer, or data scientist in industry (even as an intern), please mention that as well. I apologize if I am unable to respond to your email in a timely manner.
Mentorship
I am fortunate to work with many talented students at UC Berkeley. Below is a list of students I am currently mentoring or have mentored for a year or more.
Current Students
- Reya Vir (UC Berkeley undergraduate) - Working on a benchmark for synthesizing data quality constraints for LLM applications.
- Quentin Romero Lauro (University of Pittsburgh undergraduate, doing an REU at UC Berkeley) - Developing interfaces for iterating on retrieval-augmented generation (RAG) architectures for LLM applications.
- Rachel Lin (UC Berkeley master's student) - Developing interfaces for iterative dataset search with LLMs; co-mentored with Madelon Hulsebos.
Past Students
- Parth Asawa (former UC Berkeley undergraduate) - Worked on data quality constraints for LLM applications and declarative LLM workflows. Now pursuing a PhD at UC Berkeley.
- Yujie Wang (former UC Berkeley undergraduate) - Worked on monitoring ML performance metrics without ground-truth labels. Now at Google.
- Aditi Mahajan (former UC Berkeley undergraduate) - Worked on unit tests for end-to-end ML pipelines. Now at Google.
Contact
To reach me, you can email shreyashankar@berkeley.edu. I am also on Twitter and Github.
My (typically-outdated) CV is available for download here.