CaltechTHESIS
A Caltech Library Service

Visual and Spatial Representation Learning with Applications in Ecology

Citation

Cole, Elijah Henry John (2023) Visual and Spatial Representation Learning with Applications in Ecology. Dissertation (Ph.D.), California Institute of Technology. doi:10.7907/xp2k-es46. https://resolver.caltech.edu/CaltechTHESIS:06072023-210232983

Abstract

Machine learning has the potential to empower scientists, physicians, and other human experts working to solve problems of societal importance. To realize this goal, we need algorithms that can distill useful knowledge from real-world data. However, most machine learning research focuses on benchmarks that seldom reflect real-world challenges, such as learning from limited, noisy, or weak supervision. This thesis develops new benchmarks, algorithms, and problem settings that link fundamental machine learning research to impactful applications in ecology. In Part I, we provide context and motivation for our work. How and why should machine learning researchers work with domain experts on real-world problems? What is the appeal of ecology specifically Part II focuses on visual representation learning with an emphasis on label efficiency. We discuss the strengths and limitations of self-supervised learning, the relationship between concept specificity and representation learning, and multi-label learning with minimal labeled data. Part III covers our work in the emerging field on spatial representation learning. In particular, we consider the problem of modeling the spatial distribution of plant and animal species. We review this important ecological problem from a machine learning perspective before showing how deep learning can transform the way these models are applied (using spatial models to assist image classifiers) and developed (jointly learning spatial distributions and representations). Finally, Part IV concludes and highlights opportunities for future work.

Item Type: Thesis (Dissertation (Ph.D.))
Subject Keywords: computer vision; machine learning; deep learning; conservation technology
Degree Grantor: California Institute of Technology
Division: Engineering and Applied Science
Major Option: Computing and Mathematical Sciences
Awards: Amori Doctoral Prize in CMS, 2023.
Thesis Availability: Public (worldwide access)
Research Advisor(s):
  • Perona, Pietro
Group: Resnick Sustainability Institute
Thesis Committee:
  • Perona, Pietro
  • Yue, Yisong (chair)
  • Bouman, Katherine L.
  • Belongie, Serge J.
Defense Date: 10 May 2023
Funders:
Funding Agency Grant Number
National Science Foundation DGE1745301
Resnick Sustainability Institute UNSPECIFIED
U.S. Fish and Wildlife Service F22AP01490-00
Nissan Corporation UNSPECIFIED
Record Number: CaltechTHESIS:06072023-210232983
Persistent URL: https://resolver.caltech.edu/CaltechTHESIS:06072023-210232983
DOI: 10.7907/xp2k-es46
Related URLs:
URL URL Type Description
https://arxiv.org/abs/2103.16483 arXiv Article for Chapter 2
https://arxiv.org/abs/2105.05837 arXiv Article for Chapter 3
https://arxiv.org/abs/2207.10225 arXiv Article for Chapter 4
https://arxiv.org/abs/2106.09708 arXiv Article for Chapter 5
https://arxiv.org/abs/2107.10400 arXiv Article for Chapter 6
https://arxiv.org/abs/1906.05272 arXiv Article for Chapter 7
https://arxiv.org/abs/2306.02564 arXiv Article for Chapter 8
ORCID:
Author ORCID
Cole, Elijah Henry John 0000000166230966
Default Usage Policy: No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code: 16093
Collection: CaltechTHESIS
Deposited By: Elijah Cole
Deposited On: 09 Jun 2023 14:58
Last Modified: 04 Feb 2025 23:36

Thesis Files

[img] PDF - Final Version
See Usage Policy.

15MB

Repository Staff Only: item control page