首页 /研究 /Using Machine Teaching to Identify Optimal Training-Set Attacks on Machine Learners

OTHER

Using Machine Teaching to Identify Optimal Training-Set Attacks on Machine Learners

Shike Mei, Xiaojin Zhu

发表年份: 2015
引用次数: 368
访问权限: 开放获取

摘要

We investigate a problem at the intersection of machine learning and security: training-set attacks on machine learners. In such attacks an attacker contaminates the training data so that a specific learning algorithm would produce a model profitable to the attacker. Understanding training-set attacks is important as more intelligent agents (e.g. spam filters and robots) are equipped with learning capability and can potentially be hacked via data they receive from the environment. This paper identifies the optimal training-set attack on a broad family of machine learners. First we show that optimal training-set attack can be formulated as a bilevel optimization problem. Then we show that for machine learners with certain Karush-Kuhn-Tucker conditions we can solve the bilevel problem efficiently using gradient methods on an implicit function. As examples, we demonstrate optimal training-set attacks on Support VectorMachines, logistic regression, and linear regression with extensive experiments. Finally, we discuss potential defenses against such attacks.

关键词

Computer scienceMachine learningArtificial intelligenceSet (abstract data type)Intersection (aeronautics)Training setBilevel optimizationTraining (meteorology)Function (biology)Logistic regression

Using Machine Teaching to Identify Optimal Training-Set Attacks on Machine Learners

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control