CaltechTHESIS
A Caltech Library Service

White Elephants and Cash Cows: Economically Wrangling the Zoo of AI Models

Citation

Zellinger, Michael J. (2026) White Elephants and Cash Cows: Economically Wrangling the Zoo of AI Models. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/xj31-xm14. https://resolver.caltech.edu/CaltechTHESIS:07282025-205651638

Abstract

The capabilities of artificial intelligence are rapidly expanding, but deploying AI systems in practice still poses significant challenges. Specifically, practitioners find limited guidance on selecting the most suitable AI model for a concrete use case, balancing the economics of an AI deployment, and managing the risk of AI errors. These challenges call for a unified framework addressing pain points in a conceptually clear and statistically sound manner. In this thesis, we present several components of such a framework: 1) uncertainty-aware system optimization, 2) economic evaluation, 3) error reduction with human-in-the-loop, and 4) a proof-of-concept system for synthetic data generation. Our work presents novel technical and conceptual approaches for orchestrating natural language-based systems, advancing the economical and reliable deployment of artificial intelligence.

Item Type: Thesis (Dissertation (Ph.D.))
Subject Keywords: artificial intelligence, machine learning, natural language processing
Degree Grantor: California Institute of Technology
Division: Engineering and Applied Science
Major Option: Computing and Mathematical Sciences
Thesis Availability: Public (worldwide access)
Research Advisor(s):
  • Thomson, Matthew (advisor)
  • Bühlmann, Peter (co-advisor)
Thesis Committee:
  • Perona, Pietro (chair)
  • Wierman, Adam C.
  • Thomson, Matthew
  • Bühlmann, Peter
Defense Date: 17 July 2025
Record Number: CaltechTHESIS:07282025-205651638
Persistent URL: https://resolver.caltech.edu/CaltechTHESIS:07282025-205651638
DOI: 10.7907/xj31-xm14
Related URLs:
URL URL Type Description
https://openreview.net/forum?id=YCBVcGSZeR Publisher Article adapted for Ch. 2: Rational Tuning of LLM Cascades via Probabilistic Modeling
https://arxiv.org/abs/2507.03834 arXiv Article adapted for Ch. 3: Economic Evaluation of LLMs
https://arxiv.org/abs/2507.14406v1 arXiv Article adapted for Ch. 4: Fail Fast, or Ask: Mitigating the Deficiencies of Reasoning LLMs with Human-in-the-Loop Systems Engineering
https://doi.org/10.1007/s00357-025-09501-w DOI Article adapted for Ch. 5: Natural Language-Based Synthetic Data Generation for Cluster Analysis
ORCID:
Author ORCID
Zellinger, Michael J. 0009-0001-7499-148X
Default Usage Policy: No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code: 17566
Collection: CaltechTHESIS
Deposited By: Michael Zellinger
Deposited On: 11 Aug 2025 20:14
Last Modified: 18 Aug 2025 20:37

Thesis Files

[img] PDF - Final Version
See Usage Policy.

12MB

Repository Staff Only: item control page