Review: Reinforcement Learning
Tabular Value Based Methods Backup Monte-Carlo backup: zero bias, high variance. [\begin{align} V(S_t) \leftarrow V(S_t) + \alpha(G_t - V(S_t)) \end{align}] Temporal difference backup: hig...
Tabular Value Based Methods Backup Monte-Carlo backup: zero bias, high variance. [\begin{align} V(S_t) \leftarrow V(S_t) + \alpha(G_t - V(S_t)) \end{align}] Temporal difference backup: hig...
《C++ Primer》读书笔记。 基本作用 const 对象创建后,值不能再改变。 初始化: int i = 42; const int ci = i; // 正确:i 的值被拷贝给了 ci int j = ci; // 正确: ci 的值被拷贝给了 j 默认情况下,const 对象仅在文件内有效。 需要文件间共享时,使用 extern 关键字。 ...
Tasks Case: Discover side effects for hypertension medications. The text mining pipeline: Filter the data: Retrieve relevant messages. Process the data: Clean, anonymize. Create training dat...
Basics Hash function: A hash function maps hash-keys of some data type to integer bucket numbers. A good hash function distributes the possible hash-key values approximately evenly among buckets. A...
Building Blocks Brief History Early booming (50’s - early 60’s) Rosenblatt (1958) Perceptron: Network of threshold nodes for pattern classification. Perceptron convergence theorem: Everything ...