Prob/Stats Any good math-theory-focused books on neural networks and data science?

AI Thread Summary
The discussion centers on finding books that provide a mathematically rigorous approach to data science, particularly in the context of neural networks and optimization theorems. Recommendations include "Algorithms for Optimization" by Kochenderfer, which features practical examples in Julia, and "The Hundred-Page Machine Learning Book" by Burkov, available as a try-and-buy online option. Additionally, "Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow" by Geron is noted for its mathematical discussions, though it may not meet the desired level of rigor. The conversation acknowledges that many rigorous insights are currently found in academic papers rather than textbooks. Several relevant research papers on function approximation, gradient descent effectiveness, and error estimation are also shared, indicating ongoing advancements in the field.
s00mb
Messages
33
Reaction score
10
Hi. I'm looking for books on data science, preferably leaning towards neural networks, that focus on mathematical rigor. For example, theorems on optimization, minimum number of layers to accomplish a task efficiently, etc. Most books I've seen seem to hand wave this stuff. Anyone know any juicy books on the topic?
 
Physics news on Phys.org
There's a rather recent book by Kochenderfer called Algorithms for Optimization with many examples written in Julia, a hot programming language from MIT that folks are using for numerical work in diverse fields including ML and Data Science.

https://www.amazon.com/dp/0262039427/?tag=pfamazon01-20

There's also the 100 page ML book by Burkov:

https://www.amazon.com/dp/199957950X/?tag=pfamazon01-20

which is available online as a try and buy book.

Lastly, Geron's book Hands-on ML with Scikit-Learn, Keras and Tensorflow:

https://www.amazon.com/dp/1492032646/?tag=pfamazon01-20

All are good books that discuss the math behind the ML although not at the rigor you're looking for.
 
I looked through "Data mining" by Witten, Frank, Hall and, Pal which covers most of these, but it isn't rigorous like the good old math analysis books. No wonder for such a rapidly developing (practical) area.
 
By looking around, it seems like Dr. Hassani's books are great for studying "mathematical methods for the physicist/engineer." One is for the beginner physicist [Mathematical Methods: For Students of Physics and Related Fields] and the other is [Mathematical Physics: A Modern Introduction to Its Foundations] for the advanced undergraduate / grad student. I'm a sophomore undergrad and I have taken up the standard calculus sequence (~3sems) and ODEs. I want to self study ahead in mathematics...
Back
Top