Resume for John Langford

Phone: 914-948-1871
E-mail: jl@hunch.net
Web: http://hunch.net/~jl/

Research Interests

I want to solve machine learning. In a solved form, machine learning is automatic, robust, scalable, and easily used for many problems. In the last several years, we have made great progress in this direction.

Education

M.S./Ph.D. - Computer Science, Carnegie Mellon, June 2000/2002. Advisors: Avrim Blum and Sebastian Thrun My thesis was on tight sample complexity bounds, used for evaluation and learning algorithm design.
B.S./B.S. - Physics/Computer Science, California Institute of Technology, June 1997.

Employment History

Microsoft Research, New York, NY May 2012-present Principal Research Scientist

Yahoo! Research, New York, NY June 2006-April 2012 Senior Research Scientist

TTI-Chicago, Chicago, IL September 2003 - May 2006 Research Assistant Professor

IBM, TJ Watson, Yorktown, NY September 2002 - August 2003 Herman Goldstine Fellow

University of Pennsylvania, Philadelphia, PA June 2002 - August 2002 Postdoc with Michael Kearns

Activities

Blog: Machine Learning (Theory)
Tutorial: Real World Interactive Learning at ICML 2017.
Class: Machine Learning the Future at Cornell Tech in 2017.
Tutorial: Learning to Search at HLT-NAACL 2015 and ICML 2015.
Tutorial: Learning to Interact at NIPS 2013.
Class: Large Scale Machine Learning
Tutorial: Scaling up Machine Learning at KDD 2011.
Tutorial: Learning through Exploration Video at ICML 2010 and KDD 2010.
Workshop: Organizer Cores, Clusters, and Clouds at NIPS 2010.
Tutorial: Active Learning Video at ICML 2009.
Tutorial: Reductions in Machine Learning Video at ICML 2009
Workshop: Organizer Principles of Learning Problem Design at NIPS 2007
Tutorial: Learning Reductions IJCAI2005 and MLSS 2005
School: Organizer Machine Learning Summer School Chicago 2005
Workshop: Organizer (Ab)Use of Bounds NIPS 2004
Workshop: Organizer Machine Learning Reductions TTI-Chicago, 2003
Tutorial: Practical Prediction Theory for Classification ICML2003 and MLSS 2005

Service

President-Elect of ICML in 2017
General Chair: ICML 2016
Program Chair: ICML 2012
New York ML Symposium co-organizer: 2008 2009 2010 2011 2012 2014 2015 2016 2017 2018
Area chair/Senior PC: ICML 2004 NIPS 2006 ICML 2007 ICML 2009 ICML 2010 KDD 2010 NIPS 2010 ICML 2011 and other since.
Program committee: ICML 2003, 2005 & 2008, SODA 2008, AAAI 2005, AAAI 2007ALT 2004, UAI 2007 AIStat 2005 COLT 2008 COLT 2009 and others
Reviewing: NIPS 2001, 2002, 2003, 2005 & 2007, AAAI 2002, MLJ, JMLR, JAIR, JCSS, TCS, JACM, and others

Mentoring

I have worked with many students over time. In all cases, I try to learn something from them as well.

Jacob Abernethy (Professor, University of Michigan)
Alekh Agarwal (Research Scientist, MSR-NYC)
Luis von Ahn (*) (Professor Carnegie Mellon)
Ashwinkumar Badanidiyuru (Research Scientist, Google)
Nina Balcan (*) (Professor Carnegie Mellon)
Arindam Banerjee (*)(Professor UMinnesota)
Alberto Bietti(INRIA)
Kai-Wei Chang (Professor, UCLA)
Carl Burch (*) (Google)
Anna Choromanska (*) (Professor, NYU)
Christoph Dann (CMU)
Hal Daume (*) (Professor U Maryland & Researcher at MSR-NYC)
Varsha Dani (Research Professor U New Mexico)
Nick Hopper (*) (Professor UMinnesota)
Daniel Hsu (*) (Professor, Columbia)
Nan Jiang (Postdoc at MSR-NYC
Nikos Karampatziakis (*) (Microsoft)
Matti Kääriäinen (*) (Nokia?)
Sham Kakade (*) (University of Washington)
Adam Kalai (*) (Microsoft Research)
Akshay Krishnamurthy (Professor, University of Massachusetts, Amherst & Researcher at MSR-NYC)
Nicolas Lambert (Professor Stanford)
Lihong Li (Google)
Haipeng Luo (USC Professor)
Dipendra Misra(Cornell Tech)
Joseph O'Sullivan (ex-Google :-)
Pradeep Ravikumar (Professor UTAustin)
Ruslan Salakhutdinov (Professor Carnegie Mellon)
Matthias Seeger (*) (Amazon)
Vin de Silva (*) (Professor Pomona)
Alex Strehl (*) (Facebook)
Adith Swaminathan
Jennifer Wortman Vaughan (Microsoft Research)
Vandi Verma (*) (JPL)
Yevgeniy Vorobeychik (Vanderbilt University)
Eric Weiwiora (UCSD)
Bianca Zadrozny (*) (IBM)
Martin Zinkevich (Google Research)

(*) = on work which became part of their thesis

Select Publications

Ron Bekkerman, Misha Bilenko, and John Langford (Editors) Scaling Up Machine Learning, Cambridge University Press, 2011.
Miroslav Dudík, Daniel Hsu, Satyen Kale, Nikos Karampatziakis, John Langford, Lev Reyzin, Tong Zhang: Efficient Optimal Learning for Contextual Bandits, UAI 2011
Alina Beygelzimer, Sham Kakade, and John Langford Cover Trees for Nearest Neighbor, ICML-2006 [Pat Goldberg Best Paper Award in Computer Science, Electrical Engineering, and Mathematics]
Luis von Ahn, Manuel Blum, Nick Hopper and John Langford CAPTCHA: Using Hard AI Problems for Security Eurocrypt 2003
Josh Tenenbaum, Vin de Silva and John Langford. A Global Geometric Framework for Nonlinear Dimensionality Reduction . Science 290, pages 2319-2323, 2000 isomap site

Papers, forthcoming

Alekh Agarwal, Sarah Bird, Markus Cozowicz, Luong Hoang, John Langford, Stephen Lee, Jiaji Li, Dan Melamed, Gal Oshri, Oswaldo Ribas, Siddhartha Sen, Alex Slivkins, A Multiworld Testing Decision Service.

All Publications, appended

Research Programs

Much of my work can be organized into coherent programs.

Creating and using tight sample complexity bounds for evaluation and machine learning algorithm design. This was my thesis work, and a small program of work by others has grown out of it.
Learning Reductions. We transform complex learning problems into simple ones and then use good algorithms for the simple ones to solve the complex problems. Here we designed the method of analysis as well as the algorithms, and again a small program of work by others is growing out of it.
Scalable learning. The goal here is to scale up learning algorithms in all ways. We have addressed scaling in the number of parameters, the number of examples, and the number of labels.
Active Learning. We proved that active learning was possible in the same noisy settings as supervised learning and eventually refined the algorithm to efficiently use any supervised algorithm as an oracle.
Contextual Bandits. In many real-life situations, we only get feedback about choices taken rather than choices not taken, as is assumed in normal supervised learning. This requires a ground-up rethink of what machine learning means. We have developed evaluators (equivalent to test sets in supervised learning), optmized the use of exploration information, developed exponentially faster algorithms, and created a system for learning.
Vowpal Wabbit is an open source machine learning system which encodes some of the above. VW has many unique features which makes it a useful tool for advanced prediction, large scale, and interactive settings.

Patents

References

Avrim Blum (Phd advisor) avrim+@cs.cmu.edu
Michael Kearns mkearns@cis.upenn.edu
Robert Schapire schapire@cs.princeton.edu
Sanjoy Dasgupta dasgupta@cs.ucsd.edu
Preston McAfee preston@mcafee.cc
Prabhakar Raghavan pragh@acm.org

Citizenship	U.S.
Awards	Siebel Scholar, Herman Goldstine Postdoctoral Fellowship Lebanon High School Hall of Fame.

Publications, All

Alekh Agarwal, Alina Beygelzimer, Miro Dudík, John Langford, Hanna Wallach, A reductions approach to fair classification, ICML 2018.
Haipeng Luo, Chen-Yu Wei, Alekh Agarwal, John Langford, Efficient Contextual Bandits in Non-stationary Worlds, COLT 2018.
Furong Huang, Jordan Ash, John Langford, Robert Schapire, Learning Deep ResNet Blocks Sequentially using Boosting Theory, ICML 2018.
Adith Swaminathan, Akshay Krishnamurthy, Alekh Agarwal, Miro Dudík, John Langford, Damien Jose, Imed Zitouni: Off-policy evaluation for slate recommendation. NIPS 2017.
Dipendra Misra, John Langford, Yoav Artzi, Mapping Instructions and Visual Observations to Actions with Reinforcement Learning, EMNLP 2017.
Akshay Krishnamurthy, Alekh Agarwal, Tzu-Kuo Huang, Hal Daume III, John Langford, Active Learning for Cost-Sensitive Classification, ICML 2017.
Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford, Robert Schapire, Contextual Decision Processes with low Bellman rank are PAC-Learnable. ICML 2017
Hal Daume III, Nikos Karampatziakis, John Langford, Paul Mineiro: Logarithmic Time One-Against-Some. ICML 2017.
Kai-Wei Chang, He He, Hal Daumé III, John Langford, Stephane Ross, A Credit Assignment Compiler for Joint Prediction NIPS 2016.
John Langford, Mark Guzdial, The solution to AI, what real researchers do, and expectations for CS classrooms. Commun. ACM 59(6): 10-11 (2016).
Alina Beygelzimer, Hal Daumé III, John Langford, Paul Mineiro: Learning Reductions That Really Work. Proceedings of the IEEE 104(1): 136-147 (2016)
Haipeng Luo, Alekh Agarwal, Nicolò Cesa-Bianchi, John Langford: Efficient Second Order Online Learning via Sketching. NIPS 2016.
Akshay Krishnamurthy, Alekh Agarwal, John Langford: Contextual-MDPs for PAC-Reinforcement Learning with Rich Observations. NIPS 2016.
Alina Beygelzimer, Daniel J. Hsu, John Langford, Chicheng Zhang: Search Improves Label for Active Learning. NIPS 2016.
John Langford, Mark Guzdial, The arbitrariness of reviews, and advice for school administrators. Commun. ACM 58(4): 12-13 (2015).
Nicolas S. Lambert, John Langford, Jennifer Wortman Vaughan, Yiling Chen, Daniel M. Reeves, Yoav Shoham, David M. Pennock An axiomatic characterization of wagering mechanisms. J. Economic Theory 156: 389-416 (2015).
Kai-Wei Chang, Akshay Krishnamurthy, Alekh Agarwal, Hal Daumé III, John Langford: Learning to Search Better than Your Teacher. ICML 2015: 2058-2066.
Hal Daumé III, John Langford, Kai-Wei Chang, He He, Sudha Rao: Hands-on Learning to Search for Structured Prediction. HLT-NAACL 2015: 1.
Anna Choromanska, John Langford: Logarithmic Time Online Multiclass prediction. NIPS 2015: 55-63
Tzu-Kuo Huang, Alekh Agarwal, Daniel J. Hsu, John Langford, Robert E. Schapire: Efficient and Parsimonious Agnostic Active Learning. NIPS 2015: 2755-2763.
John Langford, Mark Guzdial Finding a research job, and teaching CS in high school. Commun. ACM 57(10): 10-11 (2014).
Alekh Agarwal, Olivier Chapelle, Miroslav Dudík, John Langford: A Reliable Effective Terascale Linear Learning System, Journal of Machine Learning Research 15(1): 1111-1133 (2014).
Ashwinkumar Badanidiyuru, John Langford, Aleksandrs Slivkins: Resourceful Contextual Bandits. COLT 2014: 1109-1134
Alekh Agarwal, Daniel J. Hsu, Satyen Kale, John Langford, Lihong Li, Robert E. Schapire Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits. ICML 2014: 1638-1646
Alekh Agarwal, Alina Beygelzimer, Daniel J. Hsu, John Langford, Matus J. Telgarsky Scalable Non-linear Learning with Adaptive Polynomial Expansions. NIPS 2014: 2051-2059
Stephane Ross, Paul Mineiro, and John Langford, Normalized Online Learning, UAI 2013.
Miroslav Dudík, Dumitru Erhan, John Langford, Lihong Li: Sample-efficient Nonstationary Policy Evaluation for Contextual Bandits. UAI 2012: 247-254
John Langford: Parallel machine learning on big data. ACM Crossroads 19(1): 60-62 (2012)
Lihong Li, Wei Chu, John Langford, Taesup Moon, Xuanhui Wang: Unbiased offline evaluation of contextual-bandit Algorithms with Generalized Linear Models, JMLR 2012
John Langford, Ruben Ortega: Machine learning and algorithms; agile development. Commun. ACM 55(8): 10-11 (2012)
John Langford, Lihong Li, R. Preston McAfee, Kishore Papineni: Cloud control: voluntary admission control for intranet traffic management. Inf. Syst. E-Business Management 10(3): 295-308 (2012)
John Langford, Joelle Pineau: Proceedings of the 29th International Conference on Machine Learning (ICML-12). CoRR abs/1207.4676 (2012)
Alekh Agarwal, Miroslav Dudík, Satyen Kale, John Langford, Robert E. Schapire: Contextual Bandit Learning with Predictable Rewards, AIStats 2012.
Alina Beygelzimer, John Langford, David Pennock: Learning Performance of Prediction Markets with Kelly Bettors, AAMAS 2012
Ron Bekkerman, Misha Bilenko, and John Langford (Editors) Scaling Up Machine Learning, Cambridge University Press, 2011.
Daniel Hsu, Nikos Karampatziakis, John Langford, Alexander J. Smola: Parallel Online Learning, in Scaling Up Machine Learning, Cambridge University Press, 2011.
Miroslav Dudík, John Langford, Lihong Li: Doubly Robust Policy Evaluation and Learning, ICML 2011
Miroslav Dudík, Daniel Hsu, Satyen Kale, Nikos Karampatziakis, John Langford, Lev Reyzin, Tong Zhang: Efficient Optimal Learning for Contextual Bandits, UAI 2011
Nikos Karampatziakis, John Langford: Online Importance Weight Aware Updates, UAI 2011
John Langford, Lihong Li, Preston McAfee, Kishore Papineni: Cloud Control: Voluntary Admission Control for Intranet Traffic Management, Information Systems and e-Business Management
Lihong Li, Wei Chu, John Langford, Xuanhui Wang: Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms, WSDM 2011 [Best Paper Award]
Alina Beygelzimer, John Langford, Lihong Li, Lev Reyzin, Robert E. Schapire: Contextual Bandit Algorithms with Supervised Learning Guarantees, Journal of Machine Learning Research, Proceedings Track 15 (AISTATS 2011): 19-26 (2011) [Notable Paper Award]
Alina Beygelzimer, Daniel Hsu, John Langford, Tong Zhang: Agnostic Active Learning Without Constraints, NIPS 2010
Alexander L. Strehl, John Langford, Lihong Li, Sham Kakade: Learning from Logged Implicit Exploration Data, NIPS 2010
Lihong Li, Wei Chu, John Langford, Robert E. Schapire: A contextual-bandit approach to personalized news article recommendation, WWW 2010
John Langford: Efficient Exploration in Reinforcement Learning. Encyclopedia of Machine Learning 2010: 309-311
John Langford Open Problem: Robust Efficient Conditional Probability Estimation, COLT 2010.
John Langford, Lihong Li, Yevgeniy Vorobeychik, Jennifer Wortman: Maintaining Equilibria During Exploration in Sponsored Search Auctions, Algorithmica 58(4): 990-1021 (2010). Conference version in WINE 2007.
Alina Beygelzimer, John Langford, Pradeep D. Ravikumar: Error-Correcting Tournaments, ALT 2009
Kilian Q. Weinberger, Anirban Dasgupta, John Langford, Alexander J. Smola, Josh Attenberg: Feature hashing for large scale multitask learning, ICML 2009
Alina Beygelzimer, Sanjoy Dasgupta, John Langford: Importance weighted active learning, ICML 2009
John Langford, Ruslan Salakhutdinov, Tong Zhang: Learning nonlinear dynamic models, ICML 2009
Alina Beygelzimer, John Langford: The offset tree for learning with partial labels, KDD 2009
Martin Zinkevich, Alexander J. Smola, John Langford: Slow Learners are Fast, NIPS 2009
Daniel Hsu, Sham Kakade, John Langford, Tong Zhang: Multi-Label Prediction via Compressed Sensing, NIPS 2009
Alina Beygelzimer, John Langford, Yury Lifshits, Gregory B. Sorkin, Alexander L. Strehl: Conditional Probability Tree Estimation Analysis and Algorithms, UAI 2009
Nicholas J. Hopper, John Langford, and Luis von Ahn: Provably Secure Steganography, Crypto 2002. Journal version: IEEE Transactions on Computers 58(5): 662-676 (2009)
Maria-Florina Balcan, Alina Beygelzimer, John Langford: Agnostic active learning, ICML 2006. Journal version: JCSS 75(1): 78-89 (2009)
Qinfeng Shi, James Petterson, Gideon Dror, John Langford, Alexander J. Smola, S. V. N. Vishwanathan: Hash Kernels for Structured Data, Journal of Machine Learning Research 10: 2615-2637 (2009)
Qinfeng Shi, James Petterson, Gideon Dror, John Langford, Alexander J. Smola, Alexander L. Strehl, Vishy Vishwanathan: Hash Kernels, Journal of Machine Learning Research - Proceedings Track 5 (AISTATS 2009): 496-503 (2009)
John Langford, Lihong Li, Tong Zhang: Sparse Online Learning via Truncated Gradient, NIPS 2008. Journal version: Journal of Machine Learning Research 10: 777-801 (2009)
Hal Daumé III, John Langford, Daniel Marcu: Search-based structured prediction, Machine Learning 75(3): 297-325 (2009)
Nicolas S. Lambert, John Langford, Jennifer Wortman, Yiling Chen, Daniel M. Reeves, Yoav Shoham, David M. Pennock: Self-financed wagering mechanisms for forecasting, EC 2008. [Winner of an Outstanding Paper Award]
John Langford, Alexander L. Strehl, Jennifer Wortman: Exploration scavenging, ICML 2008
Sharad Goel, John Langford, Alexander L. Strehl: Predictive Indexing for Fast Search, NIPS 2008
Maria-Florina Balcan, Nikhil Bansal, Alina Beygelzimer, Don Coppersmith, John Langford, Gregory B. Sorkin: Robust reductions from ranking to classification, COLT 2007. Journal version: Machine Learning 72(1-2): 139-153 (2008)
John Langford, Tong Zhang: The Epoch-Greedy Algorithm for Contextual Multi-armed Bandits. NIPS 2007
Alexander L. Strehl, Lihong Li, Eric Wiewiora, John Langford, Michael L. Littman: PAC model-free reinforcement learning, ICML 2006
Alina Beygelzimer, Sham Kakade, and John Langford Cover Trees for Nearest Neighbor, ICML-2006 [Pat Goldberg Best Paper Award in Computer Science, Electrical Engineering, and Mathematics]
Naoki Abe, Bianca Zadrozny, John Langford: Outlier detection by active learning, KDD 2006
John Langford, Roberto Oliveira, Bianca Zadrozny: Predicting Conditional Quantiles via Reduction to Classification, UAI 2006
Jacob Abernethy, John Langford, Manfred Warmuth Continuous Experts and the Binning Algorithm, COLT 2006
John Langford and Bianca Zadrozny Relating Reinforcement Learning Performance to Classification Performance ICML 2005
Matti Kääriäinen and John Langford A Comparison of Tight Generalization Bounds ICML 2005
Matti Kääriäinen, John Langford: A comparison of tight generalization error bounds. ICML 2005
Alina Beygelzimer, Varsha Dani, Tom Hayes, John Langford, and Bianca Zadrozny Error limiting reductions between classification tasks ICML 2005
Alina Beygelzimer, John Langford, and Bianca Zadrozny Weighted One Against All AAAI 2005
John Langford and Alina Beygelzimer Sensitive Error Correcting Output Codes COLT 2005
Luis von Ahn, Nick Hopper, and John Langford Covert Two-Party Computation STOC 2005
John Langford and Bianca Zadrozny Estimating Class Membership Probabilities Using Classifier Learners AISTAT 2005
John Langford Tutorial on Practical Prediction Theory for Classification JMLR 2005
Arindam Banerjee and John Langford An Objective Evaluation Criterion for Clustering KDD 2004
Naoki Abe, Bianca Zadrozny, and John Langford An Iterative Method for Multi-class Cost-sensitive Learning KDD 2004
Peter Grünwald and John Langford Suboptimal Behavior of Bayes and MDL in Classification under Misspecification COLT 2004. Journal version: Machine Learning 66(2-3): 119-149 (2007)
Luis von Ahn, Manuel Blum, Nick Hopper and John Langford CAPTCHA: Using Hard AI Problems for Security Eurocrypt 2003
Luis von Ahn, Manuel Blum and John Langford: Telling humans and computers apart automatically. Commun. ACM 47(2): 56-60 (2004)
Sham Kakade, Michael Kearns, John Langford, and Luis Ortiz, Correlated Equilibria in Graphical Games, ACM EC 2003.
Bianca Zadrozny, John Langford, and Naoki Abe Cost Sensitive Learning by Cost-Proportionate Example Weighting ICDM 2003
Avrim Blum and John Langford PAC-MDL Bounds COLT 2003
Sham Kakade, Michael Kearns, and John Langford Exploration in Metric State Spaces ICML2003
John Langford and John Shawe-Taylor PAC-Bayes and Margins. NIPS-2002
Sham Kakade, John Langford Approximately Optimal Approximate Reinforcement Learning ICML-2002
John Langford: Combining Trainig Set and Test Set Bounds. ICML 2002: 331-338
John Langford, Martin Zinkevich, Sham Kakade Competitive Analysis of the Explore/Exploit Tradeoff ICML-2002
John Langford Generic Quantum Block Compression (at xxx.lanl.gov and Phys. rev. A.) May 2002
Sebastian Thrun, John Langford, and Vandi Verma, Risk Sensitive Particle Filters, NIPS2001.
John Langford and Rich Caruana, (Not) Bounding the True Error NIPS2001
John Langford, Matthias Seeger, and Nimrod Megiddo. An Improved Predictive Accuracy Bound for Averaging Classifiers ICML2001
Josh Tenenbaum, Vin de Silva and John Langford. A Global Geometric Framework for Nonlinear Dimensionality Reduction . Science 290, pages 2319-2323, 2000 isomap site
John Langford and David McAllester. Computable Shell Decomposition Bounds. COLT2000 and JMLR 5: 529-547 (2004)
Joseph O'Sullivan, John Langford, Rich Caruana and Avrim Blum. FeatureBoost: A Meta-Learning Algorithm that Improves Model Robustness. ICML2000
John Langford and Avrim Blum 1999. Microchoice Bounds and Self Bounding learning algorithms. COLT99 also, Machine Learning Journal 51(2): 165-179 (2003)
Avrim Blum, Adam Kalai, and John Langford 1999. Beating the Holdout: Bounds for KFold and Progressive Cross-Validation. COLT99
S. Thrun, John Langford, and Dieter Fox 1999. Monte Carlo Hidden Markov Models: Learning Non-Parametric Models of Partially Observable Stochastic Proecesses. ICML99
Avrim Blum and John Langford: Probabilistic Planning in the Graphplan Framework, ECP 1999.
Avrim Blum, Carl Burch, and John Langford: On Learning Monotone Boolean Functions, FOCS 1998.