Understanding, Formally Characterizing, and Robustly Handling Real-World Distribution Shift (CMU PhD Thesis, 2024)

web

kilthub.cmu.edu·kilthub.cmu.edu/articles/thesis/Understanding_Formally_Ch...

A 2024 CMU ML PhD thesis providing theoretical grounding for distribution shift robustness; relevant to AI safety researchers interested in formal guarantees, out-of-distribution generalization, and reliable deployment of ML systems.

Metadata

Importance: 55/100book chapterprimary source

Summary

Elan Rosenfeld's CMU PhD thesis develops theoretical and empirical foundations for handling distribution shift in ML systems, covering adversarial robustness certification, latent variable models of distribution shift using causal structure, and empirical analysis of real-world data variation. It introduces the concept of 'environment/intervention complexity' as a core measure for domain generalization and causal representation learning.

Key Points

•Proposes scalable methods for certifying deep neural network robustness to adversarial attacks on test samples, training data, or any model-influencing input.
•Develops latent variable models of distribution shift grounded in causality, enabling formal analysis of multi-distribution robust learning methods.
•Introduces 'environment/intervention complexity' as a statistical measure quantifying identifiability conditions for domain generalization.
•Empirically investigates real-world heavy tails and distribution shift to understand practical failures of modern ML systems.
•Argues benchmarks fundamentally cannot capture all real-world variation, motivating formal characterizations of shift structure.

Cited by 1 page

Page	Type	Quality
AI Distributional Shift	Risk	91.0

Cached Content Preview

HTTP 200Fetched Apr 7, 20265 KB

Item - Understanding, Formally Characterizing, and Robustly Handling Real-World Distribution Shift - Carnegie Mellon University - Figshare Understanding,&#32;Formally&#32;Characterizing,&#32;and&#32;Robustly&#32;Handling&#32;Real-World&#32;Distribution&#32;Shift

 Expand https://doi.org/10.1184/R1/26312050 Copy identifier URL to clipboard Identifier Info 
 Cite Download ( 12.74 MB ) Share Embed thesis posted on 2024-07-23, 16:42 authored by Elan&#32;Rosenfeld Elan Rosenfeld &#60;p&#62;Distribution&#32;shift&#32;remains&#32;a&#32;significant&#32;obstacle&#32;to&#32;successful&#32;and&#32;&#160;reliable&#32;deployment&#32;of&#32;machine&#32;learning&#32;&#40;ML&#41;&#32;systems.&#32;Long-term&#32;&#160;solutions&#32;to&#32;these&#32;vulnerabilities&#32;can&#32;only&#32;come&#32;with&#32;the&#32;understanding&#32;&#160;that&#32;benchmarks&#32;fundamentally&#32;cannot&#32;capture&#32;all&#32;possible&#32;variation&#32;&#160;which&#32;may&#32;occur&#59;&#32;equally&#32;important,&#32;however,&#32;is&#32;careful&#32;experimenta&#32;tion&#32;with&#32;AI&#32;systems&#32;to&#32;understand&#32;their&#32;failures&#32;under&#32;shift&#32;in&#32;practice.&#32;&#160;&#60;&#47;p&#62;&#10;&#60;p&#62;This&#32;thesis&#32;describes&#32;my&#32;work&#32;towards&#32;building&#32;a&#32;foundation&#32;for&#32;trustworthy&#32;and&#32;reliable&#32;machine&#32;learning.&#32;The&#32;surveyed&#32;work&#32;falls&#32;&#160;roughly&#32;into&#32;three&#32;major&#32;categories&#58;&#32;&#40;i&#41;&#32;designing&#32;formal,&#32;practical&#32;char&#32;acterizations&#32;of&#32;the&#32;structure&#32;of&#32;real-world&#32;distribution&#32;shift&#59;&#32;&#40;ii&#41;&#32;leverag&#32;ing&#32;this&#32;structure&#32;to&#32;develop&#32;provably&#32;correct&#32;and&#32;efficient&#32;learning&#32;algo&#32;rithms&#32;which&#32;handle&#32;such&#32;shifts&#32;robustly&#59;&#32;and&#32;&#40;iii&#41;&#32;experimenting&#32;with&#32;&#160;modern&#32;ML&#32;systems&#32;to&#32;to&#32;understand&#32;the&#32;practical&#32;implications&#32;of&#32;real&#32;world&#32;heavy&#32;tails&#32;and&#32;distribution&#32;shift,&#32;both&#32;average-&#32;and&#32;worst-case.&#32;&#160;&#60;&#47;p&#62;&#10;&#60;p&#62;Part&#32;I&#32;describes&#32;work&#32;on&#32;scalably&#32;certifying&#32;the&#32;robustness&#32;of&#32;deep&#32;&#160;neural&#32;networks&#32;to&#32;adversarial&#32;attacks.&#32;The&#32;proposed&#32;approach&#32;can&#32;&#160;be&#32;used&#32;to&#32;certify&#32;robustness&#32;to&#32;attacks&#32;on&#32;test&#32;samples,&#32;training&#32;data,&#32;&#160;or&#32;more&#32;generally&#32;any&#32;input&#32;which&#32;influences&#32;the&#32;model&#8217;s&#32;eventual&#32;&#160;prediction.&#32;In&#32;Part&#32;II,&#32;we&#32;focus&#32;on&#32;latent&#32;variable&#32;models&#32;of&#32;shifts,&#32;&#160;drawing&#32;on&#32;concepts&#32;from&#32;causality&#32;and&#32;other&#32;structured&#32;encodings&#32;of&#32;&#160;real-world&#32;variation.&#32;We&#32;demonstrate&#32;how&#32;these&#32;models&#32;enable&#32;for&#32;mal&#

... (truncated, 5 KB total)

Resource ID: 56f1ba822bd9862d | Stable ID: sid_n59mef9SzE