Chiron: Privacy-preserving Machine Learning as a Service

Congzheng Song; Emmett Witchel; Reza Shokri; Tyler Hunt; Vitaly Shmatikov

arxiv: 1803.05961 · v1 · pith:BJWRDMKEnew · submitted 2018-03-15 · 💻 cs.CR

Chiron: Privacy-preserving Machine Learning as a Service

Tyler Hunt , Congzheng Song , Reza Shokri , Vitaly Shmatikov , Emmett Witchel This is my paper

classification 💻 cs.CR

keywords chirontrainingdataservicelearningmodelmachineml-as-a-service

0 comments

read the original abstract

Major cloud operators offer machine learning (ML) as a service, enabling customers who have the data but not ML expertise or infrastructure to train predictive models on this data. Existing ML-as-a-service platforms require users to reveal all training data to the service operator. We design, implement, and evaluate Chiron, a system for privacy-preserving machine learning as a service. First, Chiron conceals the training data from the service operator. Second, in keeping with how many existing ML-as-a-service platforms work, Chiron reveals neither the training algorithm nor the model structure to the user, providing only black-box access to the trained model. Chiron is implemented using SGX enclaves, but SGX alone does not achieve the dual goals of data privacy and model confidentiality. Chiron runs the standard ML training toolchain (including the popular Theano framework and C compiler) in an enclave, but the untrusted model-creation code from the service operator is further confined in a Ryoan sandbox to prevent it from leaking the training data outside the enclave. To support distributed training, Chiron executes multiple concurrent enclaves that exchange model parameters via a parameter server. We evaluate Chiron on popular deep learning models, focusing on benchmark image classification tasks such as CIFAR and ImageNet, and show that its training performance and accuracy of the resulting models are practical for common uses of ML-as-a-service.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 7 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Characterizing Trust Boundary Vulnerabilities in TEE Containers: An Empirical Study
cs.CR 2025-08 unverdicted novelty 8.0

Empirical study of TEE containers identifies six attack vectors, twelve new bugs, and three CVEs through systematic testing of isolation boundaries.
TENNOR: Trustworthy Execution for Neural Networks through Obliviousness and Retrievals
cs.CR 2026-05 unverdicted novelty 7.0

TENNOR enables efficient private training of wide neural networks in TEEs by recasting sparsification as doubly oblivious LSH retrievals and introducing MP-WTA to cut hash table memory by 50x while preserving accuracy.
Fine-Tuning Integrity for Modern Neural Networks: Structured Drift Proofs via Norm, Rank, and Sparsity Certificates
cs.CR 2026-04 unverdicted novelty 6.0

Succinct Model Difference Proofs certify that a neural-network update stays inside a policy-defined drift class using zero-knowledge proofs whose cost depends only on the drift structure.
Towards Characterizing and Limiting Information Exposure in DNN Layers
cs.CR 2019-07 unverdicted novelty 6.0

Framework quantifies per-layer sensitive information exposure in DNNs via generalization error and evaluates TEE-based protection for the most exposed layers against white-box membership inference.
When Agents Handle Secrets: A Survey of Confidential Computing for Agentic AI
cs.CR 2026-05 unverdicted novelty 5.0

A survey providing a taxonomy of TEE platforms, an agent-centric threat model, and open challenges for applying confidential computing to secure agentic AI systems.
When Agents Handle Secrets: A Survey of Confidential Computing for Agentic AI
cs.CR 2026-05 unverdicted novelty 4.0

A structured survey of confidential computing for agentic AI that catalogs TEE platforms, agent-specific threats, transferable defenses, and remaining gaps in end-to-end frameworks.
The Value of Collaboration in Convex Machine Learning with Differential Privacy
cs.CR 2019-06 conditional novelty 4.0

The fitness difference between DP and non-private convex ML models is inversely proportional to training dataset size squared and privacy budget squared.