|
Liana Patel
I'm a fifth year CS PhD student at Stanford University, where I'm very fortunate to be advised by
Matei Zaharia and
Carlos Guestrin.
Prior to my PhD, I had the honor of working with Professor Boon Thau Loo at UPenn.
My research interest are in building scalable and efficient systems that support knowledge-intensive AI applications, which leverage large amounts of data.
Email /
Scholar /
Twitter /
Github
|
|
|
|
Semantic Operators: A Declarative Model for Rich, AI-based Analytics Over Text Data
Liana Patel,
Sid Jha,
Melissa Pan,
Harshit Gupta,
Parth Asawa,
Carlos Guestrin,
Matei Zaharia
VLDB, 2024
Project Page
/
Preprint
/
Github
LOTUS is a query engine for LLM-powered data processing over. LOTUS introduces the semantic operator model, a declarative programming model for AI-based data transformations.
LOTUS allows programmers to write state-of-the-art AI pipelines in a few lines of code for a wide array of applications, including fact-checking, biomedical extraction, complex search and ranking, and research analysis.
|
|
|
DeepScholar-Bench: A Live Benchmark and Autoamted Evaluation for Generative Research Synthesis
Liana Patel,
Negara Arabzadeh,
Harshit Gupta,
Ankita Sundar,
Ion Stoica,
Carlos Guestrin,
Matei Zaharia
arXiv, 2024
Project Page
/
DeepResearch Preview
/
Preprint
/
Github
DeepScholar-Bench is a live benchmark and holistic, automated evaluation framework to evaluate generative research synthesis.
We also develop DeepScholar-base, a reference pipeline implemented efficiently using the LOTUS API.
Overall, we find that DeepScholar-bench remains far from saturated.
|
|
|
Text2SQL is Not Enough: Unifying AI and Databases with TAG
Asim Biswal*,
Liana Patel*,
Sid Jha,
Amog Kamsetty,
Shu Liu,
Joseph Gonzalez,
Carlos Guestrin,
Matei Zaharia
CIDR, 2025
Github
/
Preprint
Table-Augmented Generation (TAG) is
a unified and general-purpose paradigm for answering natural language questions over databases. TAG generalizes and outperforms prior methods, such as RAG and Text2SQL on a benchmark of complex NL questions.
|
|
|
ACORN: Performant and Predicate-Agnostic Search Over Vector Embeddings and Structured Data
Liana Patel,
Peter Kraft,
Carlos Guestrin,
Matei Zaharia
SIGMOD, 2024
Github
/
Paper
ACORN is an index for state-of-the-art search over vector embeddings and structured data. ACORN introduces predicate-agnostic index construction and search methods, which allows it to serve a wide range of queries with arbitrary predicates, while also outperforming prior methods.
|
|
|
Compass: Encrypted Semantic Search with High Accuracy
Jinhao Zhu,
Liana Patel,
Matei Zaharia,
Raluca Ada Popa,
Preprint, 2024
Paper
Compass is a semantic search system over encrypted data that offers high accuracy, comparable to state-of-the-art plaintext search algorithms, while protecting data, queries and search results from a fully compromised server.
|
|