QuickSyllabus

データ科学と機械学習の数理 / Fundamental Mathematics for Machine Learning and Data Sciences

単位数: 2. 担当教員: 大関真之. 開講年度: 2024.

メディア授業科目/Media Class Subjects

○

主要授業科目/Essential Subjects

○

授業の目的・概要及び達成方法等

Google Classroomのクラスコードは工学部Webページにて確認すること。
学部シラバス・時間割(https://www.eng.tohoku.ac.jp/edu/syllabus-ug.html)

目的: 現代的なデータ科学と機械学習において現れる基本的な数理構造を解き明かし理解すること。
概要: 大規模なデータを学習しさまざまなタスクを実現する今日の人工知能技術のベースとなるのは、非常に単純な勾配法であり最適化問題を以下にして解くかである。その源泉は物理学で登場したような運動方程式によるものであり、それらといかに密接に関連しているのかを学ぶことにより、将来再び起こるブレークスルーを俯瞰できるようにする。
達成目標等: 現代の機械学習等に現れる数理最適化および学習技術について、流行り廃りから脱却し、その本質部分を単純にして理解できるようにすること。

オンライン授業で実施され詳細や連絡は下記Google Classroomを通じて行われる。
[TB26411] (大関真之教授): Google Classroomを利用

授業の目的・概要及び達成方法等(Ｅ）

The class code for Google Classroom can be found on the Web site of
the School of Engineering:
https://www.eng.tohoku.ac.jp/edu/syllabus-ug.html (JP Only)

Object and Summary: To unravel and understand the fundamental mathematical structures that emerge in modern data science and machine learning.
The basis of today's artificial intelligence technologies for learning large data sets and performing a variety of tasks is how to solve optimization problems using very simple gradient methods. The source is the equations of motion as they appear in physics, and by learning how they are closely related to these equations, we will be able to look at breakthroughs that will occur again in the future.
Goal.: To be able to understand mathematical optimization and learning techniques as they appear in modern machine learning and other areas, breaking free from fads and simplifying the essential parts of these techniques.

The course will be conducted online and details and communication will be via Google Classroom.
[TB26411] (Masayuki Ohzeki): using Google Classroom

他の授業科目との関連及び履修上の注意

履修要望科目：全学教育科目の解析学Ａ、Ｂ、線形代数学Ａ、B

他の授業科目との関連及び履修上の注意(Ｅ）

Linear Algebra A, B and Analysis A, B are relevant as other subjects

授業計画

1. データと確率モデル
2. 事後確率
3. 行列とベクトル
4. 最急降下法と代理関数
5. 近接勾配法
6. ニュートン法
7. ラグランジュ未定乗数
8. 拡張ラグランジュ法
9. カーネル法
10. ベイズ的最適化
11. 行列分解とクラスタリング
12. 非負値行列分解
13. Pythonを用いた実装-基本編
14. Pythonを用いた実装-ニューラルネットワーク編
15. Pythonを用いた実装-ベイズ最適化編

授業計画(Ｅ）

1. data and probability Model
2. posterior probabilities
3. Matrices and Vectors
4. steepest descent method and surrogate model
5. proximity gradient method
6. Newton's method
7. Lagrangian multiplier
8. Extended Lagrangian Method
9. Kernel method
10. Bayesian optimization
11. matrix factorization and clustering
12. nonnegative matrix factorization
13. Implementation in Python - Basic
14. Implementation in Python - Neural Networks
15. Implementation in Python - Bayesian Optimization

授業時間外学習

授業時間は限られているので、自主学習が重要になる。
毎回の授業に対して、予習（２時間）と復習（２時間）は最低限必要である。
手を動かして理解を深めること。

授業時間外学習（Ｅ）

Class time is limited, so the independent study is important.
Preparation (2 hours) and review (2 hours) for each class are required as a minimum.
Students should deepen their understanding by working with their hands.

成績評価方法及び基準

出題された各課題のレポートの評価により成績評価を行う。

成績評価方法及び基準(Ｅ）

Grading will be based on the evaluation of reports for each assignment.

教科書および参考書

画像処理の統計モデリング, 片岡駿、大関真之、安田宗樹、田中和之, 共立出版 (2018) ISBN/ISSN: 9784320111233 資料種別:教科書
機械学習のための連続最適化, 金森敬文、鈴木大慈、竹内一郎、佐藤一誠, 講談社サイエンティフィク (2016) ISBN/ISSN: 9784061529205 資料種別:参考書
確率的最適化, 鈴木大慈, 講談社サイエンティフィク (2015) ISBN/ISSN: 9784061529076 資料種別:参考書

オフィスアワー

電子メールにてアポイントをとった上で来室すること.

オフィスアワー（Ｅ）

Students should visit my office after taking an appointment by e-mail.