• 设为首页
  • 点击收藏
  • 手机版
    手机扫一扫访问
    迪恩网络手机版
  • 关注官方公众号
    微信扫一扫关注
    公众号

PacktCode/Practical-Machine-Learning

原作者: [db:作者] 来自: 网络 收藏 邀请

开源软件名称(OpenSource Name):

PacktCode/Practical-Machine-Learning

开源软件地址(OpenSource Url):

https://github.com/PacktCode/Practical-Machine-Learning

开源编程语言(OpenSource Language):

Java 27.6%

开源软件介绍(OpenSource Introduction):

Practical-Machine-Learning

This book is best for professional data scientists or wanting-to-be data scientists who are looking at learning the fundamentals of Machine Learning Techniques and the most efficient ways of applying and implementing these machine learning techniques on large datasets using the most relevant machine learning frameworks and tools on or off Hadoop platform, given the problem definition, the hands-on way. The readers are expected to have basic programming skills in java and knowledge of any scripting languages will be a bonus.

This book focuses on exploring all the Machine Learning techniques and some specific behavioral differences or implementation intricacies with the parallel or distributed processing approach. Additionally, for each technique along with a deep dive on internals of each algorithm, example implementations using top and evolving machine learning frameworks and tools like R, SPSS, Apache Mahout, Python, Julia and Spark is explained. This book helps readers master Machine Learning techniques and gain ability to identify and apply appropriate techniques in the given problem context. In the context of large datasets, multi-core cluster based learning, distributed learning, parallel computation tools and libraries and more. The readers will be exposed to a list of machine learning frameworks and for each of the frameworks detailed implementation aspects like function libraries, syntax, installation or set-up and integration with Hadoop (wherever applicable) will be covered.

Until recent past, the machine learning community has assumed sequential algorithms on data that fits in memory. This assumption is no longer realistic for many recent scenarios and has brought in some interesting perspectives to Advanced Machine Learning. Despite this growing interest, there haven’t been many publications on how these solutions integrate with our data management systems. The success of data-driven solutions for complex problems with the dropping infrastructure or storage costs has brought focus on large scale machine learning. Below is a list of topics that will be covered in this book:

  1. Learn and master platforms, algorithms, and applications for machine learning techniques classified under supervised, unsupervised, semi-supervised, reinforcement and deep learning.
  2. Analyze and prepare large data sets and design your own machine learning system
  3. Take a deep dive into each of the machine learning algorithm and learn how to implement in more than one ways (Explore alternative implementation platforms and learn how to rationalize which one to choose), given the problem context.
  4. For each of the identified platforms, learn how to set-up environment, load large scale data and explore the syntax and understand the implementation nuances.
  5. How does Machine Learning link with Hadoop? Understand Hadoop as a platform for distributed and parallel processing paradigm.
  6. For each of the Machine Learning Technique, take a deep dive into the internals of the concept and implement using one or more of the identified tools or libraries that includes Mahout, R, Python, SPSS and Spark. For each of the libraries or framework: a. Learn to set-up the environment b. Develop machine learning programs for real world examples, c. Deploy and execute these programs on large data sets in Hadoop (wherever applicable) to identify precise patterns and predict the outcomes.

This book covers all important machine learning techniques that include:

  1. Chapter 5: Decision Tree based learning methods - Decision trees using C4.5, C5.0 and Random Forests
  2. Chapter 6: Association rule based learning methods - Apriori and FP-growth
  3. Chapter 7: Instance based learning methods - K-Nearest Neighbors
  4. Chapter 7: Kernel based learning methods - Supprt Vector Machines
  5. Chapter 8: Clustering based learning methods - K means clustering
  6. Chapter 9: Bayesian learning methods - Naive Bayes
  7. Chapter 10: Regression learning methods - Linear and Logistic regression
  8. Chapter 11: Deep learning methods
  9. Chapter 12: Reinforcement learning methods - Q-learning
  10. Chapter 13: Ensemble methods - Bosstong (Ada, Gradient), Random forests

For each of the learning methods the implementation source code is provided in the following programing languauges

  1. Apache Mahout
  2. R
  3. Spark - MLib
  4. Python (sckit-learn)
  5. Julia (Java & Scala based)

The project structure is maintained per programming language wise, further by chapter and then specific algorithm.




鲜花

握手

雷人

路过

鸡蛋
该文章已有0人参与评论

请发表评论

全部评论

专题导读
上一篇:
ChicagoBoothML/MachineLearning_Fall2015: BUS 41204: Machine Learning发布时间:2022-08-18
下一篇:
ml-tooling/best-of-ml-python: 发布时间:2022-08-18
热门推荐
阅读排行榜

扫描微信二维码

查看手机版网站

随时了解更新最新资讯

139-2527-9053

在线客服(服务时间 9:00~18:00)

在线QQ客服
地址:深圳市南山区西丽大学城创智工业园
电邮:jeky_zhao#qq.com
移动电话:139-2527-9053

Powered by 互联科技 X3.4© 2001-2213 极客世界.|Sitemap