Online Multi-task Learning with Hard Constraints

Gabor Lugosi; Gilles Stoltz (DMA; GREGH); Omiros Papaspiliopoulos

arxiv: 0902.3526 · v2 · submitted 2009-02-20 · 📊 stat.ML · cs.LG· math.ST· stat.TH

Online Multi-task Learning with Hard Constraints

Gabor Lugosi , Omiros Papaspiliopoulos , Gilles Stoltz (DMA , GREGH) This is my paper

classification 📊 stat.ML cs.LGmath.STstat.TH

keywords constraintsdiscusstasksactionsdecisionlearningmakermulti-task

0 comments

read the original abstract

We discuss multi-task online learning when a decision maker has to deal simultaneously with M tasks. The tasks are related, which is modeled by imposing that the M-tuple of actions taken by the decision maker needs to satisfy certain constraints. We give natural examples of such restrictions and then discuss a general class of tractable constraints, for which we introduce computationally efficient ways of selecting actions, essentially by reducing to an on-line shortest path problem. We briefly discuss "tracking" and "bandit" versions of the problem and extend the model in various ways, including non-additive global losses and uncountably infinite sets of tasks.

This paper has not been read by Pith yet.

Online Multi-task Learning with Hard Constraints

discussion (0)