Model Extraction

定义 Definition

model extraction（模型提取/模型抽取）通常指在机器学习与安全语境中，通过访问模型的输出（如预测结果、置信度、API 返回）来推断并复制目标模型的结构或参数，从而得到一个功能相近的“替代模型”（也常被称为 model stealing，模型窃取）。在更宽泛的技术语境中，也可指把“模型”从某个系统/流程中“提取出来”以便迁移、分析或部署。

例句 Examples

The attacker performed model extraction by querying the API repeatedly.
攻击者通过反复调用该 API 进行模型提取。

Model extraction can reveal a proprietary classifier’s behavior, even when the training data and parameters are hidden.
即使训练数据与参数被隐藏，模型提取仍可能揭示专有分类器的行为特征。

发音 Pronunciation

/ˈmɑːdəl ɪkˈstrækʃən/

词源 Etymology

model 源自拉丁语 modulus（“尺度、标准”之意），引申为“模型、范式”。extraction 来自拉丁语 extrahere（ex- “向外” + trahere “拉、拖”），字面义是“拉出来”，引申为“提取、抽取”。合起来 model extraction 就是“把模型抽取出来/复制出来”。

文献与作品 Literary Works

Stealing Machine Learning Models via Prediction APIs（Tramèr et al., 2016）
Practical Black-Box Attacks against Machine Learning（相关研究中常以“model extraction/stealing”讨论黑盒场景）
Adversarial Machine Learning（多部综述/教材与章节会专门涵盖 model extraction attacks）

Model Extraction

定义 Definition

例句 Examples

发音 Pronunciation

词源 Etymology

相关词 Related Words

文献与作品 Literary Works