• Skip to main content
  • Skip to secondary menu
  • Skip to primary sidebar
  • Home
  • Contact Us

iHash

News and How to's

  • HP ProDesk 600G5 Desktop Hexa Core Intel i5 (3.2GHz) 16GB DDR4 RAM 500GB SSD Windows 11 Pro (Refurbished) for $584

    HP ProDesk 600G5 Desktop Hexa Core Intel i5 (3.2GHz) 16GB DDR4 RAM 500GB SSD Windows 11 Pro (Refurbished) for $584
  • Dell OptiPlex 7010 RGB Desktop Quad Core Intel i5 (3.2GHz) 8GB DDR3 RAM 250GB SSD Windows 10 Pro (Refurbished) for $162

    Dell OptiPlex 7010 RGB Desktop Quad Core Intel i5 (3.2GHz) 8GB DDR3 RAM 250GB SSD Windows 10 Pro (Refurbished) for $162
  • Dell OptiPlex 5040 (RGB) Desktop Quad Core Intel i5 (3.2GHz) 16GB DDR3 RAM 500GB SSD Windows 10 Pro (Refurbished) for $249

    Dell OptiPlex 5040 (RGB) Desktop Quad Core Intel i5 (3.2GHz) 16GB DDR3 RAM 500GB SSD Windows 10 Pro (Refurbished) for $249
  • Zerrio: The Ultimate All-In-One Business Management Toolkit (Lifetime Subscription) for $59

    Zerrio: The Ultimate All-In-One Business Management Toolkit (Lifetime Subscription) for $59
  • DNS FireWall: Lifetime Subscription for $59

    DNS FireWall: Lifetime Subscription for $59
  • News
    • Rumor
    • Design
    • Concept
    • WWDC
    • Security
    • BigData
  • Apps
    • Free Apps
    • OS X
    • iOS
    • iTunes
      • Music
      • Movie
      • Books
  • How to
    • OS X
      • OS X Mavericks
      • OS X Yosemite
      • Where Download OS X 10.9 Mavericks
    • iOS
      • iOS 7
      • iOS 8
      • iPhone Firmware
      • iPad Firmware
      • iPod touch
      • AppleTV Firmware
      • Where Download iOS 7 Beta
      • Jailbreak News
      • iOS 8 Beta/GM Download Links (mega links) and How to Upgrade
      • iPhone Recovery Mode
      • iPhone DFU Mode
      • How to Upgrade iOS 6 to iOS 7
      • How To Downgrade From iOS 7 Beta to iOS 6
    • Other
      • Disable Apple Remote Control
      • Pair Apple Remote Control
      • Unpair Apple Remote Control
  • Special Offers
  • Contact us

Book Review: Tree-based Methods for Statistical Learning in R

Feb 23, 2023 by iHash Leave a Comment

Here’s a new title that is a “must have” for any data scientist who uses the R language. It’s a wonderful learning resource for tree-based techniques in statistical learning, one that’s become my go-to text when I find the need to do a deep dive into various ML topic areas for my work. The methods discussed represent the cornerstone for using tabular data sets for making predictions using decision trees, ensemble methods like random forest, and of course the industry’s darling gradient boosting machines (GBM). Algorithms like XGBoost are king of the hill for solving problems involving tabular data. A number of timely and somewhat high-profile benchmarks show that this class of algorithm beats deep learning algorithms for many problem domains.

This book “Tree-based Methods for Statistical Learning in R,” is by Brandon M. Greenwell, a data scientist with 84.51° where he works on a diverse team to enable, empower, and enculturate statistical and machine learning best practices where applicable to help others solve real business problems. Greenwell’s book covers important topics such as: decision trees, tree-based ensembles such as random forests and gradient boosting machines. Chapter 7 on random forests, and Chapter 8 on GBMs are brimming over with information providing a strong foundation for doing real-world machine learning (along with a moderate amount of math throughout), coupled with plenty of code examples.

The book is primarily aimed at researchers and practitioners who want to go beyond a fundamental understanding of tree-based methods. It could also serve as a useful supplementary text for a graduate level course on statistical/machine learning. Some parts of the book necessarily involve more math and notation than others. For example, Chapter 3 on conditional inference trees involves a bit of linear algebra and matrix notation, but the math-oriented sections can often be skipped without sacrificing too much in the way of understanding the core concepts. The code examples should also help drive the main concepts home by connecting the math to simple coding logic.

The book does assume some familiarity with the basics of machine learning, as well as the R programming language. Useful references and resources are provided in the introductory material in Chapter 1. While Greenwell tries to provide sufficient detail and background where possible, some topics receive only a cursory treatment. Whenever possible he makes an effort to point the more ambitious reader in the right direction in terms of useful references.

The author developed an R package expressly for facilitating examples in the book, “treemisc” which is available on CRAN and a GitHub repo set up by the author. The R code from the book is also available. I found the code in the book to be straightforward and easy to understand. There are also plenty of insightful data visualizations. NOTE: this is not a Tidyverse book, opting rather to use traditional R coding practices.

For background material, I thought Chapter 2 was superb in its coverage of classification and regression trees (CART), originally proposed by Leo Breiman in his 1984 seminal book on the subject. I found Chapters 7 and 8 to be the most useful. Chapter 7 does a great job of outlining and drilling down in to random forests, while Chapter 8 does the same for GBM. At the end of Chapter 8 you’ll find a brief discussion of the most popular boosting algorithms: XGBoost, LightGBM, and CatBoost. Section 8.9.4 has a very nice code example for using XGBoost. Chapter 5 on ensemble algorithms includes a useful treatment of bagging (bootstrap aggregating) and boosting. Finally, Chapter 6 is on the subject of ML interpretability, a hot topic in the industry right now.

So Many Packages, So Little Time

Another area in which this book excels is making the reader aware of all the great tree-based R packages are out there. I learned about a bunch of packages I never knew about. For example, Chapter 3 identifies implementations of CTree, one of the more important developments in recursive partitioning in the past two decades. I learned that it is only available in R (see the party and partykit packages), a good reason to have R programming in your data science arsenal.

Contributed by Daniel D. Gutierrez, Editor-in-Chief and Resident Data Scientist for insideBIGDATA. In addition to being a tech journalist, Daniel also is a consultant in data science, author, educator, and sits on a number of advisory boards for various start-up companies. 

Sign up for the free insideBIGDATA newsletter.

Join us on Twitter: https://twitter.com/InsideBigData1

Join us on LinkedIn: https://www.linkedin.com/company/insidebigdata/

Join us on Facebook: https://www.facebook.com/insideBIGDATANOW

Source link

Share this:

  • Facebook
  • Twitter
  • Pinterest
  • LinkedIn

Filed Under: BigData

Special Offers

  • HP ProDesk 600G5 Desktop Hexa Core Intel i5 (3.2GHz) 16GB DDR4 RAM 500GB SSD Windows 11 Pro (Refurbished) for $584

    HP ProDesk 600G5 Desktop Hexa Core Intel i5 (3.2GHz) 16GB DDR4 RAM 500GB SSD Windows 11 Pro (Refurbished) for $584
  • Dell OptiPlex 7010 RGB Desktop Quad Core Intel i5 (3.2GHz) 8GB DDR3 RAM 250GB SSD Windows 10 Pro (Refurbished) for $162

    Dell OptiPlex 7010 RGB Desktop Quad Core Intel i5 (3.2GHz) 8GB DDR3 RAM 250GB SSD Windows 10 Pro (Refurbished) for $162
  • Dell OptiPlex 5040 (RGB) Desktop Quad Core Intel i5 (3.2GHz) 16GB DDR3 RAM 500GB SSD Windows 10 Pro (Refurbished) for $249

    Dell OptiPlex 5040 (RGB) Desktop Quad Core Intel i5 (3.2GHz) 16GB DDR3 RAM 500GB SSD Windows 10 Pro (Refurbished) for $249
  • Zerrio: The Ultimate All-In-One Business Management Toolkit (Lifetime Subscription) for $59

    Zerrio: The Ultimate All-In-One Business Management Toolkit (Lifetime Subscription) for $59
  • DNS FireWall: Lifetime Subscription for $59

    DNS FireWall: Lifetime Subscription for $59

Reader Interactions

Leave a Reply Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Primary Sidebar

  • Facebook
  • GitHub
  • Instagram
  • Pinterest
  • Twitter
  • YouTube

More to See

HP ProDesk 600G5 Desktop Hexa Core Intel i5 (3.2GHz) 16GB DDR4 RAM 500GB SSD Windows 11 Pro (Refurbished) for $584

Jun 7, 2023 By iHash

Dell OptiPlex 7010 RGB Desktop Quad Core Intel i5 (3.2GHz) 8GB DDR3 RAM 250GB SSD Windows 10 Pro (Refurbished) for $162

Jun 7, 2023 By iHash

Tags

* Apple Cisco computer security cyber attacks cyber crime cyber news cybersecurity Cyber Security cyber security news cyber security news today cyber security updates cyber threats cyber updates data data breach data breaches google hacker hacker news Hackers hacking hacking news how to hack incident response information security iOS 7 iOS 8 iPhone Malware microsoft network security ransomware ransomware malware risk management Secure security security breaches security vulnerabilities software vulnerability the hacker news Threat update video web applications

Latest

The Role of the Ransomware Negotiator

Get exclusive insights from a real ransomware negotiator who shares authentic stories from network hostage situations and how he managed them. The Ransomware Industry Ransomware is an industry. As such, it has its own business logic: organizations pay money, in crypto-currency, in order to regain control over their systems and data. This industry’s landscape is […]

Dell OptiPlex 5040 (RGB) Desktop Quad Core Intel i5 (3.2GHz) 16GB DDR3 RAM 500GB SSD Windows 10 Pro (Refurbished) for $249

Expires April 11, 2123 22:07 PST Buy now and get 52% off KEY FEATURES The Dell OptiPlex 5040 Desktop is the perfect home or office PC for professional multitasking, high-speed web browsing, and multimedia applications. Powered by a Quad-Core Intel i5 processor running at 3.2GHz, this refurbished desktop boasts 16GB DDR3 RAM and a 500GB […]

Apple announces winners of the 2023 Apple Design Awards

June 5, 2023 UPDATE Apple announces winners of the 2023 Apple Design Awards At WWDC23, winners are recognized for excellence in innovation, ingenuity, and technical achievement in app and game design Today, Apple proudly unveiled the winners of its annual Apple Design Awards, celebrating 12 best-in-class apps and games. This year’s winners, spanning development teams around […]

DNS FireWall: Lifetime Subscription for $59

Expires June 04, 2024 23:59 PST Buy now and get 70% off KEY FEATURES DNS Firewall is a security app developed to protect users from online threats, such as malware, phishing, and botnets. It operates at the DNS level, filtering and blocking malicious websites before they can reach the user’s network. DNS Firewall maintains an […]

KeepSolid SmartDNS: Lifetime Subscription for $59

Expires June 05, 2024 23:59 PST Buy now and get 70% off KEY FEATURES Seamless streaming made possible! SmartDNS is a solution to bypass geo-restrictions and access regionally blocked content on various streaming platforms and video services. By redirecting DNS queries through their SmartDNS servers, users can unlock access to content that is typically unavailable […]

Dotan Horovits

From Spotify to Open Source: The Backstory of Backstage

Technology juggernauts–despite their larger staffs and budgets–still face the “cognitive load” for DevOps that many organizations deal with day-to-day. That’s what led Spotify to build Backstage, which supports DevOps and platform engineering practices for the creation of developer portals. Eventually, Spotify made the decision to open source Backstage and donate it to the Cloud Native […]

Jailbreak

Pangu Releases Updated Jailbreak of iOS 9 Pangu9 v1.2.0

Pangu has updated its jailbreak utility for iOS 9.0 to 9.0.2 with a fix for the manage storage bug and the latest version of Cydia. Change log V1.2.0 (2015-10-27) 1. Bundle latest Cydia with new Patcyh which fixed failure to open url scheme in MobileSafari 2. Fixed the bug that “preferences -> Storage&iCloud Usage -> […]

Apple Blocks Pangu Jailbreak Exploits With Release of iOS 9.1

Apple has blocked exploits used by the Pangu Jailbreak with the release of iOS 9.1. Pangu was able to jailbreak iOS 9.0 to 9.0.2; however, in Apple’s document on the security content of iOS 9.1, PanguTeam is credited with discovering two vulnerabilities that have been patched.

Pangu Releases Updated Jailbreak of iOS 9 Pangu9 v1.1.0

  Pangu has released an update to its jailbreak utility for iOS 9 that improves its reliability and success rate.   Change log V1.1.0 (2015-10-21) 1. Improve the success rate and reliability of jailbreak program for 64bit devices 2. Optimize backup process and improve jailbreak speed, and fix an issue that leads to fail to […]

Activator 1.9.6 Released With Support for iOS 9, 3D Touch

  Ryan Petrich has released Activator 1.9.6, an update to the centralized gesture, button, and shortcut manager, that brings support for iOS 9 and 3D Touch.

Copyright iHash.eu © 2023
We use cookies on this website. By using this site, you agree that we may store and access cookies on your device. Accept Read More
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT