Practitioner’s Guide to Data Science: Streamlining Data Science Solutions using Python, Scikit-Learn, and Azure ML Service Platform

Nasir Ali Mirza

Language: English

Publisher: BPB Publications

Published: Jan 17, 2022

Description:

Covers Data Science concepts, processes, and the real-world hands-on use cases.

KEY FEATURES

● Covers the journey from a basic programmer to an effective Data Science developer.

● Applied use of Data Science native processes like CRISP-DM and Microsoft TDSP.

● Implementation of MLOps using Microsoft Azure DevOps.

DESCRIPTION

"How is the Data Science project to be implemented?" has never been more conceptually sounding, thanks to the work presented in this book. This book provides an in-depth look at the current state of the world's data and how Data Science plays a pivotal role in everything we do.

This book explains and implements the entire Data Science lifecycle using well-known data science processes like CRISP-DM and Microsoft TDSP. The book explains the significance of these processes in connection with the high failure rate of Data Science projects.

The book helps build a solid foundation in Data Science concepts and related frameworks. It teaches how to implement real-world use cases using data from the HMDA dataset. It explains Azure ML Service architecture, its capabilities, and implementation to the DS team, who will then be prepared to implement MLOps. The book also explains how to use Azure DevOps to make the process repeatable while we're at it.

WHAT YOU WILL LEARN

● Organize Data Science projects using CRISP-DM and Microsoft TDSP.

● Learn to acquire and explore data using Python visualizations.

● Get well versed with the implementation of data pre-processing and Feature Engineering.

● Understand algorithm selection, model development, and model evaluation.

● Hands-on with Azure ML Service, its architecture, and capabilities.

● Learn to use Azure ML SDK and MLOps for implementing real-world use cases.

WHO THIS BOOK IS FOR

This book is intended for programmers who wish to pursue AI/ML development and build a solid conceptual foundation and familiarity with related processes and frameworks. Additionally, this book is an excellent resource for Software Architects and Managers involved in the design and delivery of Data Science-based solutions.

AUTHOR BIO

Nasir Ali Mirza is a Data Architect and Data Science Professional with over 20 years of experience in data technologies. He has designed and implemented large-scale data movement pipelines and data transformations for very large global organizations in the private and public sectors like Lehman Brothers, Caudwell Communications, Bell South, Museum of Science, Delaware State, Wells Fargo, Kennametal, and GEICO utilizing big data and analytics platforms.

He is currently working as a Data Architect at Applied Information Sciences designing and implementing modern data analytics solutions. Before joining AIS, he served in the Database and BI practice at Microsoft Global Services. In this role, he architected data solutions for customers in the banking, insurance, and telecom industries.

With a desire to share his expertise with others, he has been a presenter at Microsoft TechReady and a contributor to other data communities. He is a Microsoft Certified Professional who has completed the Microsoft Professional Program for Data Science and achieved numerous other Microsoft certifications. He takes a keen interest in community education programs and spends time in natural landscapes. He lives with his mother, wife, and three lovely daughters at what is known in the literature as 'Paradise on Earth' - Kashmir.