<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="ru">
	<id>https://www.wikicshse.ru/index.php?action=history&amp;feed=atom&amp;title=Statistical_learning_theory_2022</id>
	<title>Statistical learning theory 2022 - История изменений</title>
	<link rel="self" type="application/atom+xml" href="https://www.wikicshse.ru/index.php?action=history&amp;feed=atom&amp;title=Statistical_learning_theory_2022"/>
	<link rel="alternate" type="text/html" href="https://www.wikicshse.ru/index.php?title=Statistical_learning_theory_2022&amp;action=history"/>
	<updated>2026-06-06T13:28:27Z</updated>
	<subtitle>История изменений этой страницы в вики</subtitle>
	<generator>MediaWiki 1.45.3</generator>
	<entry>
		<id>https://www.wikicshse.ru/index.php?title=Statistical_learning_theory_2022&amp;diff=714&amp;oldid=prev</id>
		<title>imported&gt;Bauwens: Migrated current public revision from wiki.cs.hse.ru</title>
		<link rel="alternate" type="text/html" href="https://www.wikicshse.ru/index.php?title=Statistical_learning_theory_2022&amp;diff=714&amp;oldid=prev"/>
		<updated>2023-09-04T16:22:11Z</updated>

		<summary type="html">&lt;p&gt;Migrated current public revision from wiki.cs.hse.ru&lt;/p&gt;
&lt;p&gt;&lt;b&gt;Новая страница&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&lt;br /&gt;
== General Information ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Lectures: Friday 16h20 -- 17h40, [https://www.hse.ru/en/org/persons/160550073 Bruno Bauwens], [https://www.hse.ru/staff/mkaledin Maxim Kaledin], room M202 and on [https://us02web.zoom.us/j/82300259484?pwd=NWxXekxBeE5yMm9UTmwvLzNNNGlnUT09 zoom]&lt;br /&gt;
&lt;br /&gt;
Seminars: Saturday 14h40 -- 16h00, [https://www.hse.ru/org/persons/225526439 Artur Goldman], room M202 and on zoom (the link will be in telegram)&lt;br /&gt;
&lt;br /&gt;
To discuss the materials, join the [https://t.me/+G0VKOE2-nnkwNDE0 telegram group] The course is similar to [http://wiki.cs.hse.ru/Statistical_learning_theory_2021 last year].&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Problems exam ==&lt;br /&gt;
&lt;br /&gt;
December 21, 13h-16h, computer room G403 ([https://us02web.zoom.us/j/82300259484?pwd=NWxXekxBeE5yMm9UTmwvLzNNNGlnUT09 zoomlink] for students abroad)&amp;lt;br&amp;gt;&lt;br /&gt;
-- You may use handwritten notes, lecture materials from this wiki (either printed or through your PC), Mohri&amp;#039;s book  &amp;lt;br&amp;gt;&lt;br /&gt;
-- You may not search on the internet or interact with other humans (e.g. by phone, forums, etc) &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Course materials ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Video !! Summary !! Slides !! Lecture notes !! Problem list !! Solutions&lt;br /&gt;
|-&lt;br /&gt;
| &lt;br /&gt;
|| &amp;#039;&amp;#039;Part 1. Online learning&amp;#039;&amp;#039; &lt;br /&gt;
|-&lt;br /&gt;
| 02 Sept&lt;br /&gt;
|| Philosophy. The online mistake bound model. The halving and weighted majority algorithms [https://drive.google.com/drive/folders/1NXiLbhmO2Ml7jFmnLtjqhOgCoHg7yn9T?usp=sharing movies]&lt;br /&gt;
|| [https://www.dropbox.com/s/ryvpnfqfrwyurjc/01slides.pdf?dl=0 sl01]&lt;br /&gt;
|| [https://www.dropbox.com/s/oncvg4mxulbt56d/00book_intro.pdf?dl=0 ch00] [https://www.dropbox.com/s/i9pc4kf0zsdeksb/01book_onlineMistakeBound.pdf?dl=0 ch01]&lt;br /&gt;
|| [https://www.dropbox.com/s/ztk3n9s5c0vuzd9/01sem.pdf?dl=0 list 1] &amp;lt;span style=&amp;quot;color:red&amp;quot;&amp;gt;update 05.09&amp;lt;/span&amp;gt;&lt;br /&gt;
|| [https://www.dropbox.com/s/r528uroi60gow08/01sol.pdf?dl=0 solutions 1]&lt;br /&gt;
|-&lt;br /&gt;
| [https://drive.google.com/file/d/16OoCqhh16BKQzyF-HM8RozigyJ3BBVxA/view?usp=sharing 09 Sept]&lt;br /&gt;
|| The perceptron algorithm. The standard optimal algorithm.&lt;br /&gt;
|| [https://www.dropbox.com/s/sy959ee81mov5cr/02slides.pdf?dl=0 sl02] &lt;br /&gt;
|| [https://www.dropbox.com/s/p3auugqwc89132b/02book_sequentialOptimalAlgorithm.pdf?dl=0 ch02] [https://www.dropbox.com/s/b00dcqk1rob7rdz/03book_perceptron.pdf?dl=0 ch03]&lt;br /&gt;
|| [https://www.dropbox.com/s/88jgjvxo16zfrjs/02sem.pdf?dl=0 list 2] &amp;lt;span style=&amp;quot;color:red&amp;quot;&amp;gt;update 25.09&amp;lt;/span&amp;gt;&lt;br /&gt;
|| [https://www.dropbox.com/s/pqblktfky8to5hr/02sol.pdf?dl=0 solutions 2]&lt;br /&gt;
|-&lt;br /&gt;
| [https://www.youtube.com/watch?v=xgyPvnDkyZs 16 Sept]&lt;br /&gt;
|| Kernels and the kernel perceptron algorithm. Prediction with expert advice. Recap probability theory. &lt;br /&gt;
|| [https://www.dropbox.com/s/a60p9b76cxusgqy/03slides.pdf?dl=0 sl03]&lt;br /&gt;
|| [https://www.dropbox.com/s/3vtxvs4esnvbhlb/04book_predictionWithExperts.pdf?dl=0 ch04] [https://www.dropbox.com/s/l11afq1d0qn6za7/05book_introProbability.pdf?dl=0 ch05]&lt;br /&gt;
|| [https://www.dropbox.com/s/fnx2cl5wsgjbmel/03sem.pdf?dl=0 list 3]&lt;br /&gt;
|| [https://www.dropbox.com/s/ysg3nipuzryzqoc/03sol.pdf?dl=0 solutions 3]&lt;br /&gt;
|-&lt;br /&gt;
| &lt;br /&gt;
|| &amp;#039;&amp;#039;Part 2. Distribution independent risk bounds&amp;#039;&amp;#039; &lt;br /&gt;
|-&lt;br /&gt;
| [https://www.youtube.com/watch?v=IatjLyN3dRk 23 Sept]&lt;br /&gt;
|| Sample complexity in the realizable setting, simple examples and bounds using VC-dimension&lt;br /&gt;
|| [https://www.dropbox.com/s/pi0f3wab1xna6d7/04slides.pdf?dl=0 sl04]&lt;br /&gt;
|| [https://www.dropbox.com/s/nh4puyv7nst4ems/06book_sampleComplexity.pdf?dl=0 ch06] &lt;br /&gt;
|| [https://www.dropbox.com/s/u5dtpm69bu52fpn/04sem.pdf?dl=0 list 4]&lt;br /&gt;
|| [https://www.dropbox.com/s/yurv5s42w3kw5vv/04sol.pdf?dl=0 solutions 4]&lt;br /&gt;
|- &lt;br /&gt;
| [https://www.youtube.com/watch?v=8J5B9CCy-ws 30 Sept]&lt;br /&gt;
|| Growth functions, VC-dimension and the characterization of sample comlexity with VC-dimensions&lt;br /&gt;
|| [https://www.dropbox.com/s/rpnh6288rdb3j8m/05slides.pdf?dl=0 sl05]&lt;br /&gt;
|| [https://www.dropbox.com/s/eurz2vkvt1wa5zm/07book_growthFunctions.pdf?dl=0 ch07] [https://www.dropbox.com/s/m7xe7k39qzmzapv/08book_VCdimension.pdf?dl=0 ch08]&lt;br /&gt;
|| [https://www.dropbox.com/s/u1vpi28gwf0zig4/05sem.pdf?dl=0 list 5]&lt;br /&gt;
|| [https://www.dropbox.com/s/3sq7yzv7v4l9tbb/05sol.pdf?dl=0 solutions 5]&lt;br /&gt;
|-&lt;br /&gt;
| [https://drive.google.com/file/d/17zynIg_CZ6cCNBig5QXmBx7VFS8peyuU/view?usp=sharing 07 Oct]&lt;br /&gt;
|| Risk decomposition and the fundamental theorem of statistical learning theory&lt;br /&gt;
|| [https://www.dropbox.com/s/0p8r5wgjy1hlku2/06slides.pdf?dl=0 sl06]&lt;br /&gt;
|| [https://www.dropbox.com/s/8c87619ewkyod4f/09book_riskBounds.pdf?dl=0 ch09]&lt;br /&gt;
|| [https://www.dropbox.com/s/eyfczsuwz60moj7/06sem.pdf?dl=0 list 6]&lt;br /&gt;
|| [https://www.dropbox.com/s/1te4fzlwj72v6ph/06sol.pdf?dl=0 solutions 6]&lt;br /&gt;
|-&lt;br /&gt;
| [https://www.youtube.com/watch?v=yMsUH1brAs8 14 Oct]&lt;br /&gt;
|| Bounded differences inequality, Rademacher complexity, symmetrization, contraction lemma, [https://www.dropbox.com/s/8uravgo5shas55g/07quiz.pdf?dl=0 quiz]&lt;br /&gt;
|| [https://www.dropbox.com/s/kfithyq0dgcq6h8/07slides.pdf?dl=0 sl07]&lt;br /&gt;
|| [https://www.dropbox.com/s/fg4seoqjbeb7a5g/10book_measureConcentration.pdf?dl=0 ch10] [https://www.dropbox.com/s/hfrvhebbsskbk6g/11book_RademacherComplexity.pdf?dl=0 ch11]&lt;br /&gt;
|| [https://www.dropbox.com/s/qofutar8qy5y53i/07sem.pdf?dl=0 list 7] &amp;lt;span style=&amp;quot;color:red&amp;quot;&amp;gt;update 15.10&amp;lt;/span&amp;gt;&lt;br /&gt;
|| [https://www.dropbox.com/s/w1ud2okt120vm5j/07sol.pdf?dl=0 solutions 7]&lt;br /&gt;
|-&lt;br /&gt;
| &lt;br /&gt;
|| &amp;#039;&amp;#039;Part 3. Margin risk bounds with applications&amp;#039;&amp;#039; &lt;br /&gt;
|-&lt;br /&gt;
| [https://drive.google.com/file/d/1L-BeDxhoHcoDrdlVTlfoMFwnWXKV46cr/view?usp=sharing 21 Oct]&lt;br /&gt;
|| Simple regression, support vector machines, margin risk bounds, and neural nets &lt;br /&gt;
|| [https://www.dropbox.com/s/oo1qny9busp3axn/08slides.pdf?dl=0 sl08]&lt;br /&gt;
|| [https://www.dropbox.com/s/573a2vtjfx8qqo8/12book_regression.pdf?dl=0 ch12] [https://www.dropbox.com/s/jaym44fmif2uw05/13book_SVM.pdf?dl=0 ch13]&lt;br /&gt;
|| [https://www.dropbox.com/s/bzo6msrxcfa8tpp/08sem.pdf?dl=0 list 8]&lt;br /&gt;
|| [https://www.dropbox.com/s/pe7yctcr93yaw95/08sol.pdf?dl=0 solutions 8]&lt;br /&gt;
|-&lt;br /&gt;
| [https://youtu.be/9FhFxLHR4eE 04 Nov]&lt;br /&gt;
|| Kernels: RKHS, representer theorem, risk bounds&lt;br /&gt;
|| [https://www.dropbox.com/s/jst60ww8ev4ypie/09slides.pdf?dl=0 sl09]&lt;br /&gt;
|| [https://www.dropbox.com/s/ply602zthd7r3jv/14book_kernels.pdf?dl=0 ch14]&lt;br /&gt;
|| [https://www.dropbox.com/s/54xdufimavhd646/09sem.pdf?dl=0 list 9]&lt;br /&gt;
|| [https://www.dropbox.com/s/i3rx26ya6kvm5p2/09sol.pdf?dl=0 solutions 9]&lt;br /&gt;
|- &lt;br /&gt;
| [https://youtu.be/1oUXZy6Sqlk 11 Nov]&lt;br /&gt;
|| AdaBoost and the margin hypothesis&lt;br /&gt;
|| [https://www.dropbox.com/s/umum3kd9439dt42/10slides.pdf?dl=0 sl10]&lt;br /&gt;
|| [https://www.dropbox.com/s/e7m1cs7e8ulibsf/15book_AdaBoost.pdf?dl=0 ch15]&lt;br /&gt;
|| [https://www.dropbox.com/s/nu4a55qbfqlp3bl/10sem.pdf?dl=0 list 10]&lt;br /&gt;
|| [https://www.dropbox.com/s/t64mjapdzcm1313/10sol.pdf?dl=0 solutions 10]&lt;br /&gt;
|- &lt;br /&gt;
| [https://youtu.be/GL574ljefJ8 18 Nov]&lt;br /&gt;
|| Implicit regularization of stochastic gradient descent in neural nets&lt;br /&gt;
|| &lt;br /&gt;
|| [https://www.dropbox.com/s/b4xac5uki7l1ysq/16book_implicitRegularization.pdf?dl=0 ch16]&lt;br /&gt;
|| no seminar&lt;br /&gt;
|| &lt;br /&gt;
|-&lt;br /&gt;
|&lt;br /&gt;
|| &amp;#039;&amp;#039;Part 4. Other topics&amp;#039;&amp;#039; &lt;br /&gt;
|-&lt;br /&gt;
| [https://youtu.be/kOXi_m9dBzE 25 Nov]&lt;br /&gt;
|| Regression  I: fixed design with sub-Gaussian noise&lt;br /&gt;
|| &lt;br /&gt;
|| [https://disk.yandex.ru/i/tI7NiGsvQP0Jww notes12]&lt;br /&gt;
|| [https://disk.yandex.ru/d/9fFxVlMw4kPfEQ list 12]&lt;br /&gt;
|| [https://disk.yandex.ru/i/5vBE2VC7zNC3Rg solutions 12]&lt;br /&gt;
|-&lt;br /&gt;
| [https://youtu.be/GEYT_IxXEX0 02 Dec]&lt;br /&gt;
|| Multiarmed bandids I&lt;br /&gt;
|| &lt;br /&gt;
|| [https://disk.yandex.ru/i/lvqXofEbaFkfAA notes13]&lt;br /&gt;
|| [https://disk.yandex.ru/i/ZXXJbBiJUPNiOw list 13]&lt;br /&gt;
||&lt;br /&gt;
|-&lt;br /&gt;
| [https://youtu.be/Uybf6mCp2Es 09 Dec]&lt;br /&gt;
|| Multiarmed bandids II (optional)&lt;br /&gt;
||&lt;br /&gt;
|| [https://disk.yandex.ru/i/Nqy9-wmZ-g5o8g notes14]&lt;br /&gt;
|| [https://disk.yandex.ru/d/0Qupo2CNSjS_pQ notebook], [https://disk.yandex.ru/d/suY6d58SFf09Bg notebook(solved)]&lt;br /&gt;
||&lt;br /&gt;
|-&lt;br /&gt;
| 16 Dec&lt;br /&gt;
|| &amp;#039;&amp;#039;Colloquium&amp;#039;&amp;#039;&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
The lectures in October and November are based on the book:&lt;br /&gt;
Foundations of machine learning 2nd ed, Mehryar Mohri, Afshin Rostamizadeh, and Ameet Talwalker, 2018. This book can be downloaded from [https://libgen.is Library Genesis] (the link changes sometimes and sometimes vpn is needed).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- A gentle introduction to the materials of the first 3 lectures and an overview of probability theory, can be found in chapters 1-6 and 11-12 of the following book:&lt;br /&gt;
Sanjeev Kulkarni and Gilbert Harman: An Elementary Introduction to Statistical Learning Theory, 2012.--&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Grading formula ==&lt;br /&gt;
&lt;br /&gt;
Final grade = 0.35 * [score of homeworks] + 0.35 * [score of colloquium] + 0.3 * [score on the exam] + bonus from quizzes.&lt;br /&gt;
&lt;br /&gt;
All homework questions have the same weight. Each solved extra homework task increases the score of the final exam by 1 point. &lt;br /&gt;
&lt;br /&gt;
There is no rounding except on the final grade. Grades fractional grades above 5/10 are rounded up, those below 5/10 are rounded down. &lt;br /&gt;
&lt;br /&gt;
Autogrades: if you only need 4/10 to pass with maximal final score, it will be given automatically. This may happen because of extra questions and bonuses from quizzes. &lt;br /&gt;
&lt;br /&gt;
For students who want to pass with 4/10 with minimal effort: each year on the exam, I ask to calculate the VC-dimension or Rademacher complexity of some class. It should be easy to have 4/10 for the final exam. If you understand all lecture notes, you pass the colloquium with maximal score. Together this is enough. If only a few students fail and the grades are at least 3.8/10 then failed students may resubmit a few homework tasks to pull up the grade. (This happened in the last 3 years.)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Colloquium ==&lt;br /&gt;
&lt;br /&gt;
[https://www.dropbox.com/s/7djya6nc8ietd32/colloqQuest.pdf?dl=0 Rules and questions.] Update 12/12 added question 24 and corrected typos. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;!-- [https://docs.google.com/spreadsheets/d/13ox_EN6YJBEC93A6YgbbzawXE2RfynyQTUf90SXE4GQ/edit?usp=sharing Choose the day: 16 or 17 Dec.] --&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Homeworks ==&lt;br /&gt;
&lt;br /&gt;
Email to brbauwens-at-gmail.com. Start the subject line with SLT-HW.&lt;br /&gt;
&lt;br /&gt;
Deadline before the start of the lecture, every other lecture.&lt;br /&gt;
&lt;br /&gt;
Sat. 17 Sept 18h10: problems 1.7, 1.8, 2.9, and 2.11 &amp;lt;br&amp;gt;&lt;br /&gt;
Sat. 01 Oct 18h10: see lists 3 and 4, and 2.10  &amp;lt;br&amp;gt;&lt;br /&gt;
Fri. 14 Oct 16h20: see problem lists 5 and 6 &amp;lt;br&amp;gt; &lt;br /&gt;
Sat. 05 Nov 20h00: see problem lists 7 and 8 &amp;lt;br&amp;gt;&lt;br /&gt;
Sat. 19 Nov 20h00: see problem lists 9 and 10 &amp;lt;br&amp;gt;&lt;br /&gt;
Sun. 04 Dec 23h59: see problem list 12 send it to maxkaledin@gmail.com with subject line SLT-HW-Reg &amp;lt;YourName&amp;gt;_&amp;lt;YourSurname&amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Office hours ==&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! Person !! Monday !! Tuesday !! Wednesday !! Thursday !! Friday !! &lt;br /&gt;
|-&lt;br /&gt;
|  Bruno Bauwens || 15-20h ||  || || || 18-20h ||  &lt;br /&gt;
|-&lt;br /&gt;
|  Maxim Kaledin || Write || in  || Telegram  || time is || flexible ||  &lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
It is always good to send an email in advance. Questions and feedback are welcome. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;!--&lt;br /&gt;
== Russian texts  ==&lt;br /&gt;
&lt;br /&gt;
The following links might help students who have trouble with English.  A [http://www.machinelearning.ru/wiki/images/d/d9/Voron-2011-tnop.pdf  lecture] on VC-dimensions was given by K. Vorontsov.&lt;br /&gt;
A [http://machinelearning.ru/wiki/index.php?title=%D0%A2%D0%B5%D0%BE%D1%80%D0%B8%D1%8F_%D1%81%D1%82%D0%B0%D1%82%D0%B8%D1%81%D1%82%D0%B8%D1%87%D0%B5%D1%81%D0%BA%D0%BE%D0%B3%D0%BE_%D0%BE%D0%B1%D1%83%D1%87%D0%B5%D0%BD%D0%B8%D1%8F_(%D0%BA%D1%83%D1%80%D1%81_%D0%BB%D0%B5%D0%BA%D1%86%D0%B8%D0%B9%2C_%D0%9D._%D0%9A._%D0%96%D0%B8%D0%B2%D0%BE%D1%82%D0%BE%D0%B2%D1%81%D0%BA%D0%B8%D0%B9) course] on Statistical Learning Theory by Nikita Zhivotovsky is given at MIPT. Some short description about PAC learning on p136 in the [http://gen.lib.rus.ec/search.php?req=%D0%9D%D0%B0%D1%83%D0%BA%D0%B0+%D0%B8+%D0%B8%D1%81%D0%BA%D1%83%D1%81%D1%81%D1%82%D0%B2%D0%BE+%D0%BF%D0%BE%D1%81%D1%82%D1%80%D0%BE%D0%B5%D0%BD%D0%B8%D1%8F+%D0%B0%D0%BB%D0%B3%D0%BE%D1%80%D0%B8%D1%82%D0%BC%D0%BE%D0%B2%2C+%D0%BA%D0%BE%D1%82%D0%BE%D1%80%D1%8B%D0%B5+%D0%B8%D0%B7%D0%B2%D0%BB%D0%B5%D0%BA%D0%B0%D1%8E%D1%82+%D0%B7%D0%BD%D0%B0%D0%BD%D0%B8%D1%8F+%D0%B8%D0%B7+%D0%B4%D0%B0%D0%BD%D0%BD%D1%8B%D1%85&amp;amp;lg_topic=libgen&amp;amp;open=0&amp;amp;view=simple&amp;amp;res=25&amp;amp;phrase=0&amp;amp;column=def book] &lt;br /&gt;
``Наука и искусство построения алгоритмов, которые извлекают знания из данных&amp;#039;&amp;#039;, Петер Флах. On [http://www.machinelearning.ru machinelearning.ru] &lt;br /&gt;
you can find brief and clear definitions.&lt;br /&gt;
--&amp;gt;&lt;/div&gt;</summary>
		<author><name>imported&gt;Bauwens</name></author>
	</entry>
</feed>