Bookmarks tagged [bigdata]
https://github.com/onurakpolat/awesome-bigdata#readme
A curated list of awesome big data frameworks, ressources and other awesomeness. - onurakpolat/awesome-bigdata
- tags: awesome-list, bigdata
source code
https://github.com/awesomedata/awesome-public-datasets#readme
A topic-centric list of HQ open datasets. PR ☛☛☛. Contribute to awesomedata/awesome-public-datasets development by creating an account on GitHub.
- tags: awesome-list, bigdata, datasets
source code
https://github.com/youngwookim/awesome-hadoop#readme
A curated list of amazingly awesome Hadoop and Hadoop ecosystem resources - youngwookim/awesome-hadoop
- tags: awesome-list, bigdata, hadoop
source code
https://github.com/igorbarinov/awesome-data-engineering#readme
A curated list of data engineering tools for software developers - igorbarinov/awesome-data-engineering
https://github.com/manuzhang/awesome-streaming#readme
a curated list of awesome streaming frameworks, applications, etc - manuzhang/awesome-streaming
- tags: awesome-list, bigdata, streaming
source code
https://github.com/awesome-spark/awesome-spark#readme
A curated list of awesome Apache Spark packages and resources. - awesome-spark/awesome-spark
- tags: awesome-list, bigdata, apache-spark
source code
https://www.splunk.com/pdfs/ebooks/the-essential-guide-to-machine-data.pdf
Whatever you call it, machine data is one of the most underused and undervalued assets of any organization. And, unfortunately, it’s usually kept for some minimum amount of time before being tossed ou...
http://dataminingguide.books.yourtion.com
https://github.com/linyiqun/DataMiningAlgorithm
https://github.com/Flowerowl/Big-Data-Resources
https://code.csdn.net/CODE_Translation/spark_matei_phd
https://aiyanbo.gitbooks.io/spark-programming-guide-zh-cn/content/