Bayesian Optimization with Directionally Constrained Search

Yang Li; Yaqiang Yao

arxiv: 1906.09459 · v1 · pith:BSSPE3YAnew · submitted 2019-06-22 · 💻 cs.LG · stat.ML

Bayesian Optimization with Directionally Constrained Search

Yang Li , Yaqiang Yao This is my paper

Pith reviewed 2026-05-25 18:00 UTC · model grok-4.3

classification 💻 cs.LG stat.ML

keywords bayesian optimizationdirectional constraintslimited evaluationssearch efficiencyglobal optimizationacquisition functionresource budget

0 comments

The pith

Constraining search directions lets Bayesian optimization focus effort on promising regions and reach better points within a fixed evaluation budget.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to show that Bayesian optimization can be improved for real scenarios with limited evaluations by adding directional constraints that steer queries toward areas the model currently views as most promising. This is presented as a hybrid of local and global search that deliberately cuts back on exploration steps judged unlikely to help. The authors argue that the result is more efficient use of each function evaluation, leading to a better final recommendation about the location of the optimum. If the approach works as described, practitioners facing resource limits would obtain higher-quality solutions without increasing the number of expensive evaluations.

Core claim

By constraining searching directions the method dedicates model capability to the most promising area, functioning as a combination of local and global searching policies that reduces inefficient exploration in less useful regions and thereby returns a better point within a prescribed evaluation budget.

What carries the argument

Directional constraint on the search, which limits queries to directions identified as promising by early model estimates.

If this is right

The optimizer spends its limited evaluations more effectively by avoiding low-value local searches.
It produces a higher-quality recommendation of the optimum location under evaluation constraints.
Performance gains appear on both synthetic test functions and real-world applications.
The method remains applicable whenever an optimizer must operate inside a hard evaluation budget.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same directional idea could be tested in other model-based optimizers that also maintain uncertainty estimates.
In high-dimensional problems the constraint might need to be relaxed periodically to avoid permanently missing distant optima.
An ablation that varies how aggressively early estimates are used to set constraints would clarify the robustness of the promising-area identification step.

Load-bearing premise

Early model estimates must correctly flag promising areas without excluding the true global optimum, and the added constraint mechanism must not introduce new sources of failure that erase the efficiency gain.

What would settle it

Run the constrained optimizer and a standard Bayesian optimizer on the same synthetic function where the initial promising region chosen by the model does not contain the global optimum; if the constrained version returns a worse point within the same budget, the claim is falsified.

Figures

Figures reproduced from arXiv: 1906.09459 by Yang Li, Yaqiang Yao.

**Figure 2.** Figure 2: This figure illustrates the directional adherence be [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: A schematic diagram to infer the posterior distribut [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 4.** Figure 4: The circular histogram for both prior and posterior d [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗

**Figure 6.** Figure 6: The DCBO traces under a constraint. The blank areas in [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗

**Figure 7.** Figure 7: (a): The comparison between cBO and DCBO in terms of me [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗

**Figure 8.** Figure 8: (a): The comparison between PoI and DCBO in terms of me [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗

**Figure 9.** Figure 9: (a): The comparison between the data-orientated app [PITH_FULL_IMAGE:figures/full_fig_p009_9.png] view at source ↗

**Figure 10.** Figure 10: The negative log likelihood returned by Bayesian se [PITH_FULL_IMAGE:figures/full_fig_p010_10.png] view at source ↗

read the original abstract

Bayesian optimization offers a flexible framework to optimize an objective function that is expensive to be evaluated. A Bayesian optimizer iteratively queries the function values on its carefully selected points. Subsequently, it makes a sensible recommendation about where the optimum locates based on its accumulated knowledge. This procedure usually demands a long execution time. In practice, however, there often exists a computational budget or an evaluation limitation allocated to an optimizer, due to the resource scarcity. This constraint demands an optimizer to be aware of its remaining budget and able to spend it wisely, in order to return as better a point as possible. In this paper, we propose a Bayesian optimization approach in this evaluation-limited scenario. Our approach is based on constraining searching directions so as to dedicate the model capability to the most promising area. It could be viewed as a combination of local and global searching policies, which aims at reducing inefficient exploration in the local searching areas, thus making a searching policy more efficient. Experimental studies are conducted on both synthetic and real-world applications. The results demonstrate the superior performance of our newly proposed approach in searching for the optimum within a prescribed evaluation budget.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The directional constraint idea targets a real BO budget issue but the paper supplies almost no implementation details or experimental evidence, leaving the central claim unevaluable.

read the letter

The main point on this paper is a proposal to add directional constraints to Bayesian optimization so the search focuses on areas the early model thinks are promising, framed as a local-global hybrid for fixed evaluation budgets. The abstract motivates it from the practical need to spend a limited number of queries wisely rather than waste them on broad exploration. That motivation is reasonable and the high-level idea is easy to understand. Beyond that, there is little to evaluate. No equations, no pseudocode, and no description of how the constraint is derived from the GP posterior or whether it includes any relaxation or safeguard. The experiments are said to show superior performance on synthetic and real tasks, but the abstract gives no baselines, no run counts, no statistical tests, and no ablation, so the efficiency claim cannot be checked. The stress-test concern about early posteriors misidentifying promising areas and potentially excluding the global optimum looks like it lands directly on the work; nothing in the provided text indicates the authors addressed it. This version would mainly interest practitioners looking for quick heuristic tweaks in applied optimization settings, but it does not supply enough technical content for a reading group or for citation. I would not send it to peer review until the methods and results sections are substantially expanded with reproducible details.

Referee Report

2 major / 1 minor

Summary. The paper claims to propose a Bayesian optimization approach for evaluation-limited scenarios that constrains search directions to dedicate model capacity to promising areas, framing it as a hybrid of local and global search policies that reduces inefficient exploration. It asserts that experimental studies on synthetic and real-world tasks demonstrate superior performance within a prescribed evaluation budget.

Significance. If the directional constraint mechanism can be shown to avoid prematurely excluding the global optimum while delivering measurable efficiency gains, the work could contribute a practical variant of BO for resource-constrained settings. The manuscript supplies no equations, pseudocode, or experimental evidence, so the significance cannot be assessed from the provided text.

major comments (2)

[Abstract] Abstract: the central efficiency claim rests on constraining search directions derived from early posterior estimates, yet the text supplies no description of how directions are selected, how the constraint is enforced, or any relaxation schedule; without this, it is impossible to evaluate whether the hybrid policy avoids the failure mode of excluding the true optimum.
[Abstract] Abstract: the assertion of 'superior performance' on synthetic and real-world tasks is made without reference to any baselines, statistical tests, ablation studies, or implementation details, rendering the experimental claim unevaluable and load-bearing for the paper's contribution.

minor comments (1)

[Abstract] The sentence 'return as better a point as possible' contains awkward phrasing that should be revised for clarity.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the feedback. We address the two major comments on the abstract below. The full manuscript contains the requested technical details and experimental evidence; we will revise the abstract for improved clarity and evaluability.

read point-by-point responses

Referee: [Abstract] Abstract: the central efficiency claim rests on constraining search directions derived from early posterior estimates, yet the text supplies no description of how directions are selected, how the constraint is enforced, or any relaxation schedule; without this, it is impossible to evaluate whether the hybrid policy avoids the failure mode of excluding the true optimum.

Authors: The abstract is a concise summary and omits implementation specifics by design. The full manuscript details the derivation of directions from early posterior estimates, the enforcement of the directional constraint, and the relaxation schedule (to avoid excluding the global optimum) in the methodology section. We will revise the abstract to add a brief reference to these elements and point to the relevant sections. revision: yes
Referee: [Abstract] Abstract: the assertion of 'superior performance' on synthetic and real-world tasks is made without reference to any baselines, statistical tests, ablation studies, or implementation details, rendering the experimental claim unevaluable and load-bearing for the paper's contribution.

Authors: The abstract summarizes the outcome; the full manuscript reports the experimental studies with baselines, statistical tests, ablation studies, and implementation details. We will revise the abstract to name the primary baselines and indicate that full results appear in the experiments section. revision: yes

Circularity Check

0 steps flagged

No circularity detected; proposal is a high-level algorithmic idea without self-referential derivations

full rationale

The paper describes a Bayesian optimization variant that imposes directional constraints to focus search on promising regions, framed as a hybrid local-global policy. No equations, parameter fits, predictions, or uniqueness theorems appear in the provided text. The approach is introduced conceptually without any reduction of outputs to inputs by construction, self-citation load-bearing premises, or renaming of known results. The derivation chain is therefore self-contained at the level of an algorithmic proposal.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract supplies no explicit free parameters, mathematical axioms, or new postulated entities; the method is described only at the policy level.

pith-pipeline@v0.9.0 · 5716 in / 994 out tokens · 52303 ms · 2026-05-25T18:00:59.710828+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Our approach is based on constraining searching directions so as to dedicate the model capability to the most promising area... H(x)ρ(t)[max{0,f(x+)−f(˜x)}]^{1−ρ(t)} ≡ H(x)^ρ EI(x)^{1−ρ(t)}
IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean absolute_floor_iff_bare_distinguishability unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Vmf(g|θ,κ)=Cd(κ)exp(κθ^T g) ... posterior p(gt+1|xt+1) via VMF update rules

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

23 extracted references · 23 canonical work pages · 1 internal anchor

[1]

On bayesian methods for seeking the extremu m,

J. Moˇ ckus, “On bayesian methods for seeking the extremu m,” in Optimization T echniques IFIP T echnical Conference. Springer, 1975, pp. 400–404

work page 1975
[2]

A taxonomy of global optimization methods b ased on response surfaces,

D. R. Jones, “A taxonomy of global optimization methods b ased on response surfaces,” Journal of global optimization , vol. 21, no. 4, pp. 345–383, 2001

work page 2001
[3]

Gaussian p ro- cess optimization in the bandit setting: No regret and experimental design,

N. Srinivas, A. Krause, S. Kakade, and M. Seeger, “Gaussian p ro- cess optimization in the bandit setting: No regret and experimental design,” in International Conference on International Conference on Machine Learning, 2010, pp. 1015–1022

work page 2010
[4]

A new method of locating the maximum point of an arbitrary multipeak curve in the presence of noise,

H. J. Kushner, “A new method of locating the maximum point of an arbitrary multipeak curve in the presence of noise,” Journal of Basic Engineering, vol. 86, no. 1, pp. 97–106, 1964

work page 1964
[5]

Information-Theoretic Regret Bounds for Gaussian Proces s Op- timization in the Bandit Setting,

N. Srinivas, A. Krause, S. M. Kakade, and M. W. Seeger, “Information-Theoretic Regret Bounds for Gaussian Proces s Op- timization in the Bandit Setting,” IEEE T ransactions on Information Theory, vol. 58, no. 5, pp. 3250–3265, May 2012

work page 2012
[6]

Convergence rates of efﬁcient global optimi zation algorithms,

A. D. Bull, “Convergence rates of efﬁcient global optimi zation algorithms,” Journal of Machine Learning Research , vol. 12, no. Oct, pp. 2879–2904, 2011

work page 2011
[7]

Entropy search for informati on- efﬁcient global optimization,

P . Hennig and C. J. Schuler, “Entropy search for informati on- efﬁcient global optimization,” Journal of Machine Learning Research , vol. 13, no. Jun, pp. 1809–1837, 2012

work page 2012
[8]

Predictive Entropy Search for Bayesian Optimization with Unknown Constraints,

J. M. Hern´ andez-Lobato, M. A. Gelbart, M. W. Hoffman, R. P . Adams, and Z. Ghahramani, “Predictive Entropy Search for Bayesian Optimization with Unknown Constraints,” in Interna- tional Conference on Machine Learning , vol. 37, 2015, pp. 1699–1707

work page 2015
[9]

Predictive Entropy Search for Multi-objective Baye sian Optimization,

D. Hern´ andez-Lobato, J. M. Hern´ andez-Lobato, A. Shah, and R. P . Adams, “Predictive Entropy Search for Multi-objective Baye sian Optimization,” in International Conference on Machine Learning , vol. 48, 2016, pp. 1492–1501

work page 2016
[10]

C. M. Bishop, Pattern Recognition and Machine Learning . Springer, 2006, vol. 4, no. 4

work page 2006
[11]

C. E. Rasmussen and C. K. I. Williams, Gaussian processes for machine learning, ser. Adaptive computation and machine learning. Cambridge, Mass: MIT Press, 2006

work page 2006
[12]

A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning

E. Brochu, V . M. Cora, and N. de Freitas, “A Tutorial on Ba yesian Optimization of Expensive Cost Functions, with Applicatio n to Active User Modeling and Hierarchical Reinforcement Learn ing,” arXiv:1012.2599 [cs], Dec. 2010

work page internal anchor Pith review Pith/arXiv arXiv 2010
[13]

Efﬁcient Globa l Optimization of Expensive Black-Box Functions,

D. R. Jones, M. Schonlau, and W. J. Welch, “Efﬁcient Globa l Optimization of Expensive Black-Box Functions,” Journal of Global Optimization, vol. 13, no. 4, pp. 455–492, Dec. 1998

work page 1998
[14]

Bayesian Optimization with Inequality C on- straints

J. R. Gardner, M. J. Kusner, Z. E. Xu, K. Q. Weinberger, an d J. P . Cunningham, “Bayesian Optimization with Inequality C on- straints.” in International Conference on Machine Learning , 2014, pp. 937–945

work page 2014
[15]

Bayesian optimization under mixed constraints with a slack-variabl e aug- mented Lagrangian,

V . Picheny , R. B. Gramacy , S. M. Wild, and S. L. Digabel, “Bayesian optimization under mixed constraints with a slack-variabl e aug- mented Lagrangian,” in Advances in Neural Information Processing Systems, 2016, pp. 1435–1443

work page 2016
[16]

Optimization under unknown constraints,

J. Bernardo, M. J. Bayarri, J. O. Berger, A. P . Dawid, D. H eckerman, A. F. M. Smith, and M. West, “Optimization under unknown constraints,” Bayesian Statistics, vol. 9, no. 9, p. 229, 2011

work page 2011
[17]

A Stepwise uncertainty reduction approach to con- strained global optimization,

V . Picheny , “A Stepwise uncertainty reduction approach to con- strained global optimization,” in Artiﬁcial Intelligence and Statistics , Apr. 2014, pp. 787–795

work page 2014
[18]

Lookahead Bayesian Optimizatio n with Inequality Constraints,

R. Lam and K. Willcox, “Lookahead Bayesian Optimizatio n with Inequality Constraints,” in Advances in Neural Information Process- ing Systems , I. Guyon, U. v . Luxburg, S. Bengio, H. M. Wallach, R. Fergus, S. V . N. Vishwanathan, and R. Garnett, Eds., 2017, p p. 1888–1898

work page 2017
[19]

Bayesian optimiz ation with a ﬁnite budget: An approximate dynamic programming approach,

R. Lam, K. Willcox, and D. H. Wolpert, “Bayesian optimiz ation with a ﬁnite budget: An approximate dynamic programming approach,” in Advances in Neural Information Processing Systems , 2016, pp. 883–891

work page 2016
[20]

S. R. Jammalamadaka and A. Sengupta, T opics in circular statistics, ser. Series on multivariate analysis. River Edge, N.J: World Scientiﬁc, 2001, no. v . 5

work page 2001
[21]

Clusterin g on the Unit Hypersphere Using V on Mises-Fisher Distributions,

A. Banerjee, I. S. Dhillon, J. Ghosh, and S. Sra, “Clusterin g on the Unit Hypersphere Using V on Mises-Fisher Distributions,” J. Mach. Learn. Res., vol. 6, pp. 1345–1382, Dec. 2005

work page 2005
[22]

Learning the linear dynamical system with ASOS,

J. Martens, “Learning the linear dynamical system with ASOS,” in International Conference on Machine Learning , 2010, pp. 743–750. 13

work page 2010
[23]

Modeling hum an motion using binary latent variables,

G. W. Taylor, G. E. Hinton, and S. T. Roweis, “Modeling hum an motion using binary latent variables,” in Advances in neural infor- mation processing systems , 2007, pp. 1345–1352

work page 2007

[1] [1]

On bayesian methods for seeking the extremu m,

J. Moˇ ckus, “On bayesian methods for seeking the extremu m,” in Optimization T echniques IFIP T echnical Conference. Springer, 1975, pp. 400–404

work page 1975

[2] [2]

A taxonomy of global optimization methods b ased on response surfaces,

D. R. Jones, “A taxonomy of global optimization methods b ased on response surfaces,” Journal of global optimization , vol. 21, no. 4, pp. 345–383, 2001

work page 2001

[3] [3]

Gaussian p ro- cess optimization in the bandit setting: No regret and experimental design,

N. Srinivas, A. Krause, S. Kakade, and M. Seeger, “Gaussian p ro- cess optimization in the bandit setting: No regret and experimental design,” in International Conference on International Conference on Machine Learning, 2010, pp. 1015–1022

work page 2010

[4] [4]

A new method of locating the maximum point of an arbitrary multipeak curve in the presence of noise,

H. J. Kushner, “A new method of locating the maximum point of an arbitrary multipeak curve in the presence of noise,” Journal of Basic Engineering, vol. 86, no. 1, pp. 97–106, 1964

work page 1964

[5] [5]

Information-Theoretic Regret Bounds for Gaussian Proces s Op- timization in the Bandit Setting,

N. Srinivas, A. Krause, S. M. Kakade, and M. W. Seeger, “Information-Theoretic Regret Bounds for Gaussian Proces s Op- timization in the Bandit Setting,” IEEE T ransactions on Information Theory, vol. 58, no. 5, pp. 3250–3265, May 2012

work page 2012

[6] [6]

Convergence rates of efﬁcient global optimi zation algorithms,

A. D. Bull, “Convergence rates of efﬁcient global optimi zation algorithms,” Journal of Machine Learning Research , vol. 12, no. Oct, pp. 2879–2904, 2011

work page 2011

[7] [7]

Entropy search for informati on- efﬁcient global optimization,

P . Hennig and C. J. Schuler, “Entropy search for informati on- efﬁcient global optimization,” Journal of Machine Learning Research , vol. 13, no. Jun, pp. 1809–1837, 2012

work page 2012

[8] [8]

Predictive Entropy Search for Bayesian Optimization with Unknown Constraints,

J. M. Hern´ andez-Lobato, M. A. Gelbart, M. W. Hoffman, R. P . Adams, and Z. Ghahramani, “Predictive Entropy Search for Bayesian Optimization with Unknown Constraints,” in Interna- tional Conference on Machine Learning , vol. 37, 2015, pp. 1699–1707

work page 2015

[9] [9]

Predictive Entropy Search for Multi-objective Baye sian Optimization,

D. Hern´ andez-Lobato, J. M. Hern´ andez-Lobato, A. Shah, and R. P . Adams, “Predictive Entropy Search for Multi-objective Baye sian Optimization,” in International Conference on Machine Learning , vol. 48, 2016, pp. 1492–1501

work page 2016

[10] [10]

C. M. Bishop, Pattern Recognition and Machine Learning . Springer, 2006, vol. 4, no. 4

work page 2006

[11] [11]

C. E. Rasmussen and C. K. I. Williams, Gaussian processes for machine learning, ser. Adaptive computation and machine learning. Cambridge, Mass: MIT Press, 2006

work page 2006

[12] [12]

A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning

E. Brochu, V . M. Cora, and N. de Freitas, “A Tutorial on Ba yesian Optimization of Expensive Cost Functions, with Applicatio n to Active User Modeling and Hierarchical Reinforcement Learn ing,” arXiv:1012.2599 [cs], Dec. 2010

work page internal anchor Pith review Pith/arXiv arXiv 2010

[13] [13]

Efﬁcient Globa l Optimization of Expensive Black-Box Functions,

D. R. Jones, M. Schonlau, and W. J. Welch, “Efﬁcient Globa l Optimization of Expensive Black-Box Functions,” Journal of Global Optimization, vol. 13, no. 4, pp. 455–492, Dec. 1998

work page 1998

[14] [14]

Bayesian Optimization with Inequality C on- straints

J. R. Gardner, M. J. Kusner, Z. E. Xu, K. Q. Weinberger, an d J. P . Cunningham, “Bayesian Optimization with Inequality C on- straints.” in International Conference on Machine Learning , 2014, pp. 937–945

work page 2014

[15] [15]

Bayesian optimization under mixed constraints with a slack-variabl e aug- mented Lagrangian,

V . Picheny , R. B. Gramacy , S. M. Wild, and S. L. Digabel, “Bayesian optimization under mixed constraints with a slack-variabl e aug- mented Lagrangian,” in Advances in Neural Information Processing Systems, 2016, pp. 1435–1443

work page 2016

[16] [16]

Optimization under unknown constraints,

J. Bernardo, M. J. Bayarri, J. O. Berger, A. P . Dawid, D. H eckerman, A. F. M. Smith, and M. West, “Optimization under unknown constraints,” Bayesian Statistics, vol. 9, no. 9, p. 229, 2011

work page 2011

[17] [17]

A Stepwise uncertainty reduction approach to con- strained global optimization,

V . Picheny , “A Stepwise uncertainty reduction approach to con- strained global optimization,” in Artiﬁcial Intelligence and Statistics , Apr. 2014, pp. 787–795

work page 2014

[18] [18]

Lookahead Bayesian Optimizatio n with Inequality Constraints,

R. Lam and K. Willcox, “Lookahead Bayesian Optimizatio n with Inequality Constraints,” in Advances in Neural Information Process- ing Systems , I. Guyon, U. v . Luxburg, S. Bengio, H. M. Wallach, R. Fergus, S. V . N. Vishwanathan, and R. Garnett, Eds., 2017, p p. 1888–1898

work page 2017

[19] [19]

Bayesian optimiz ation with a ﬁnite budget: An approximate dynamic programming approach,

R. Lam, K. Willcox, and D. H. Wolpert, “Bayesian optimiz ation with a ﬁnite budget: An approximate dynamic programming approach,” in Advances in Neural Information Processing Systems , 2016, pp. 883–891

work page 2016

[20] [20]

S. R. Jammalamadaka and A. Sengupta, T opics in circular statistics, ser. Series on multivariate analysis. River Edge, N.J: World Scientiﬁc, 2001, no. v . 5

work page 2001

[21] [21]

Clusterin g on the Unit Hypersphere Using V on Mises-Fisher Distributions,

A. Banerjee, I. S. Dhillon, J. Ghosh, and S. Sra, “Clusterin g on the Unit Hypersphere Using V on Mises-Fisher Distributions,” J. Mach. Learn. Res., vol. 6, pp. 1345–1382, Dec. 2005

work page 2005

[22] [22]

Learning the linear dynamical system with ASOS,

J. Martens, “Learning the linear dynamical system with ASOS,” in International Conference on Machine Learning , 2010, pp. 743–750. 13

work page 2010

[23] [23]

Modeling hum an motion using binary latent variables,

G. W. Taylor, G. E. Hinton, and S. T. Roweis, “Modeling hum an motion using binary latent variables,” in Advances in neural infor- mation processing systems , 2007, pp. 1345–1352

work page 2007