Skip to content

Latest commit

 

History

History
49 lines (24 loc) · 921 Bytes

stats.md

File metadata and controls

49 lines (24 loc) · 921 Bytes

Describing data 2: multivariate data [??]


Overview.

Python tools.

Applications.

Code.


Describing multivariate data: scatterplots, multivariate regression

Data science

Two paths, stats and cs. ...

  • Stats. Start with a model, use data to estimate its parameters (numbers)...

  • CS. Start with data, look for patterns.

Complementary...

Claudia's hospital example

http://www.forbes.com/sites/gilpress/2013/05/28/a-very-short-history-of-data-science/

Pokemon: https://pixelastic.github.io/pokemonorbigdata/

http://statweb.stanford.edu/~tibs/stat315a/glossary.pdf

Simpson's paradox

Multivariate regression

https://matloff.wordpress.com/2016/03/07/after-150-years-the-asa-says-no-to-p-values/

References

http://sebastianraschka.com/faq/index.html

Strang's linear algebra course at MIT: http://ocw.mit.edu/courses/mathematics/18-06-linear-algebra-spring-2010/