Search Results: learning-jupyter

Learning Jupyter

Author: Dan Toomey

Publisher: Packt Publishing Ltd

ISBN: 1785889451

Category: Computers

Page: 238

View: 9514

Learn how to write code, mathematics, graphics, and output, all in a single document, as well as in a web browser using Project Jupyter About This Book Learn to write, execute, and comment your live code and formulae all under one roof using this unique guide This one-stop solution on Project Jupyter will teach you everything you need to know to perform scientific computation with ease This easy-to-follow, highly practical guide lets you forget your worries in scientific application development by leveraging big data tools such as Apache Spark, Python, R etc Who This Book Is For This book caters to all developers, students, or educators who want to execute code, see output, and comment all in the same document, in the browser. Data science professionals will also find this book very useful to perform technical and scientific computing in a graphical, agile manner. What You Will Learn Install and run the Jupyter Notebook system on your machine Implement programming languages such as R, Python, Julia, and JavaScript with Jupyter Notebook Use interactive widgets to manipulate and visualize data in real time Start sharing your Notebook with colleagues Invite your colleagues to work with you in the same Notebook Organize your Notebook using Jupyter namespaces Access big data in Jupyter In Detail Jupyter Notebook is a web-based environment that enables interactive computing in notebook documents. It allows you to create and share documents that contain live code, equations, visualizations, and explanatory text. The Jupyter Notebook system is extensively used in domains such as data cleaning and transformation, numerical simulation, statistical modeling, machine learning, and much more. This book starts with a detailed overview of the Jupyter Notebook system and its installation in different environments. Next we'll help you will learn to integrate Jupyter system with different programming languages such as R, Python, JavaScript, and Julia and explore the various versions and packages that are compatible with the Notebook system. Moving ahead, you master interactive widgets, namespaces, and working with Jupyter in a multiuser mode. Towards the end, you will use Jupyter with a big data set and will apply all the functionalities learned throughout the book. Style and approach This comprehensive practical guide will teach you how to work with the Jupyter Notebook system. It demonstrates the integration of various programming languages with Jupyter Notebook through hands-on examples in every chapter.

Learning Jupyter 5

Explore interactive computing using Python, Java, JavaScript, R, Julia, and JupyterLab, 2nd Edition

Author: Dan Toomey

Publisher: Packt Publishing Ltd

ISBN: 1789137446

Category: Computers

Page: 282

View: 5348

Create and share livecode, equations, visualizations, and explanatory text, in both a single document and a web browser with Jupyter Key Features Learn how to use Jupyter 5.x features such as cell tagging and attractive table styles Leverage big data tools and datasets with different Python packages Explore multiple-user Jupyter Notebook servers Book Description The Jupyter Notebook allows you to create and share documents that contain live code, equations, visualizations, and explanatory text. The Jupyter Notebook system is extensively used in domains such as data cleaning and transformation, numerical simulation, statistical modeling, and machine learning. Learning Jupyter 5 will help you get to grips with interactive computing using real-world examples. The book starts with a detailed overview of the Jupyter Notebook system and its installation in different environments. Next, you will learn to integrate the Jupyter system with different programming languages such as R, Python, Java, JavaScript, and Julia, and explore various versions and packages that are compatible with the Notebook system. Moving ahead, you will master interactive widgets and namespaces and work with Jupyter in a multi-user mode. By the end of this book, you will have used Jupyter with a big dataset and be able to apply all the functionalities you’ve explored throughout the book. You will also have learned all about the Jupyter Notebook and be able to start performing data transformation, numerical simulation, and data visualization. What you will learn Install and run the Jupyter Notebook system on your machine Implement programming languages such as R, Python, Julia, and JavaScript with the Jupyter Notebook Use interactive widgets to manipulate and visualize data in real time Start sharing your Notebook with colleagues Invite your colleagues to work with you on the same Notebook Organize your Notebook using Jupyter namespaces Access big data in Jupyter for dealing with large datasets using Spark Who this book is for Learning Jupyter 5 is for developers, data scientists, machine learning users, and anyone working on data analysis or data science projects across different teams. Data science professionals will also find this book useful for performing technical and scientific computing collaboratively.

Python kurz & gut

Author: Mark Lutz

Publisher: O'Reilly Germany

ISBN: 3955617718

Category: Computers

Page: 280

View: 2657

Die objektorientierte Sprache Python eignet sich hervorragend zum Schreiben von Skripten, Programmen und Prototypen. Sie ist frei verfügbar, leicht zu lernen und zwischen allen wichtigen Plattformen portabel, einschließlich Linux, Unix, Windows und Mac OS. Damit Sie im Programmieralltag immer den Überblick behalten, sind die verschiedenen Sprachmerkmale und Elemente in Python – kurz & gut übersichtlich zusammengestellt. Für Auflage 5 wurde die Referenz komplett überarbeitet, erweitert und auf den neuesten Stand gebracht, so dass sie die beiden aktuellen Versionen 2.7 und 3.4 berücksichtigt. Python – kurz & gut behandelt unter anderem: Eingebaute Typen wie Zahlen, Listen, Dictionarys u.v.a.; nweisungen und Syntax für Entwicklung und Ausführung von Objekten; Die objektorientierten Entwicklungstools in Python; Eingebaute Funktionen, Ausnahmen und Attribute; pezielle Methoden zur Operatorenüberladung; Weithin benutzte Standardbibliotheksmodule und Erweiterungen; Kommandozeilenoptionen und Entwicklungswerkzeuge. Mark Lutz stieg 1992 in die Python-Szene ein und ist seitdem als aktiver Pythonista bekannt. Er gibt Kurse, hat zahlreiche Bücher geschrieben und mehrere Python-Systeme programmiert.

Data Science mit Python

Das Handbuch für den Einsatz von IPython, Jupyter, NumPy, Pandas, Matplotlib und Scikit-Learn

Author: Jake VanderPlas

Publisher: MITP-Verlags GmbH & Co. KG

ISBN: 3958456979

Category: Computers

Page: 552

View: 5822

Die wichtigsten Tools für die Datenanalyse und-bearbeitung im praktischen Einsatz Python effizient für datenintensive Berechnungen einsetzen mit IPython und Jupyter Laden, Speichern und Bearbeiten von Daten und numerischen Arrays mit NumPy und Pandas Visualisierung von Daten mit Matplotlib Python ist für viele die erste Wahl für Data Science, weil eine Vielzahl von Ressourcen und Bibliotheken zum Speichern, Bearbeiten und Auswerten von Daten verfügbar ist. In diesem Buch erläutert der Autor den Einsatz der wichtigsten Tools. Für Datenanalytiker und Wissenschaftler ist dieses umfassende Handbuch von unschätzbarem Wert für jede Art von Berechnung mit Python sowie bei der Erledigung alltäglicher Aufgaben. Dazu gehören das Bearbeiten, Umwandeln und Bereinigen von Daten, die Visualisierung verschiedener Datentypen und die Nutzung von Daten zum Erstellen von Statistiken oder Machine-Learning-Modellen. Dieses Handbuch erläutert die Verwendung der folgenden Tools: ● IPython und Jupyter für datenintensive Berechnungen ● NumPy und Pandas zum effizienten Speichern und Bearbeiten von Daten und Datenarrays in Python ● Matplotlib für vielfältige Möglichkeiten der Visualisierung von Daten ● Scikit-Learn zur effizienten und sauberen Implementierung der wichtigsten und am meisten verbreiteten Algorithmen des Machine Learnings Der Autor zeigt Ihnen, wie Sie die zum Betreiben von Data Science verfügbaren Pakete nutzen, um Daten effektiv zu speichern, zu handhaben und Einblick in diese Daten zu gewinnen. Grundlegende Kenntnisse in Python werden dabei vorausgesetzt. Leserstimme zum Buch: »Wenn Sie Data Science mit Python betreiben möchten, ist dieses Buch ein hervorragender Ausgangspunkt. Ich habe es sehr erfolgreich beim Unterrichten von Informatik- und Statistikstudenten eingesetzt. Jake geht weit über die Grundlagen der Open-Source-Tools hinaus und erläutert die grundlegenden Konzepte, Vorgehensweisen und Abstraktionen in klarer Sprache und mit verständlichen Erklärungen.« – Brian Granger, Physikprofessor, California Polytechnic State University, Mitbegründer des Jupyter-Projekts

Datenanalyse mit Python

Auswertung von Daten mit Pandas, NumPy und IPython

Author: Wes McKinney

Publisher: O'Reilly

ISBN: 3960102143

Category: Computers

Page: 542

View: 5160

Erfahren Sie alles über das Manipulieren, Bereinigen, Verarbeiten und Aufbereiten von Datensätzen mit Python: Aktualisiert auf Python 3.6, zeigt Ihnen dieses konsequent praxisbezogene Buch anhand konkreter Fallbeispiele, wie Sie eine Vielzahl von typischen Datenanalyse-Problemen effektiv lösen. Gleichzeitig lernen Sie die neuesten Versionen von pandas, NumPy, IPython und Jupyter kennen.Geschrieben von Wes McKinney, dem Begründer des pandas-Projekts, bietet Datenanalyse mit Python einen praktischen Einstieg in die Data-Science-Tools von Python. Das Buch eignet sich sowohl für Datenanalysten, für die Python Neuland ist, als auch für Python-Programmierer, die sich in Data Science und Scientific Computing einarbeiten wollen. Daten und zugehöriges Material des Buchs sind auf GitHub verfügbar.Aus dem Inhalt:Nutzen Sie die IPython-Shell und Jupyter Notebook für das explorative ComputingLernen Sie Grundfunktionen und fortgeschrittene Features von NumPy kennenSetzen Sie die Datenanalyse-Tools der pandasBibliothek einVerwenden Sie flexible Werkzeuge zum Laden, Bereinigen, Transformieren, Zusammenführen und Umformen von DatenErstellen Sie interformative Visualisierungen mit matplotlibWenden Sie die GroupBy-Mechanismen von pandas an, um Datensätzen zurechtzuschneiden, umzugestalten und zusammenzufassenAnalysieren und manipulieren Sie verschiedenste Zeitreihen-DatenFür diese aktualisierte 2. Auflage wurde der gesamte Code an Python 3.6 und die neuesten Versionen der pandas-Bibliothek angepasst. Neu in dieser Auflage: Informationen zu fortgeschrittenen pandas-Tools sowie eine kurze Einführung in statsmodels und scikit-learn.

Deep Learning with Python, Francois Chollet, 2018

Python

Author: manning Publications, Co

Publisher: Bukupedia

ISBN: N.A

Category: Computers

Page: 386

View: 6914

If you’ve picked up this book, you’re probably aware of the extraordinary progress that deep learning has represented for the field of artificial intelligence in the recent past. In a mere five years, we’ve gone from near-unusable image recognition and speech transcription, to superhuman performance on these tasks. The consequences of this sudden progress extend to almost every industry. But in order to begin deploying deep-learning technology to every problem that it could solve, we need to make it accessible to as many people as possible, including nonexperts— people who aren’t researchers or graduate students. For deep learning to reach its full potential, we need to radically democratize it. When I released the first version of the Keras deep-learning framework in March 2015, the democratization of AI wasn’t what I had in mind. I had been doing research in machine learning for several years, and had built Keras to help me with my own experiments. But throughout 2015 and 2016, tens of thousands of new people entered the field of deep learning; many of them picked up Keras because it was—and still is—the easiest framework to get started with. As I watched scores of newcomers use Keras in unexpected, powerful ways, I came to care deeply about the accessibility and democratization of AI. I realized that the further we spread these technologies, the more useful and valuable they become. Accessibility quickly became an explicit goal in the development of Keras, and over a few short years, the Keras developer community has made fantastic achievements on this front. We’ve put deep learning into the hands of tens of thousands of people, who in turn are using it to solve important problems we didn’t even know existed until recently. The book you’re holding is another step on the way to making deep learning available to as many people as possible. Keras had always needed a companion course to Licensed to xiv PREFACE simultaneously cover fundamentals of deep learning, Keras usage patterns, and deeplearning best practices. This book is my best effort to produce such a course. I wrote it with a focus on making the concepts behind deep learning, and their implementation, as approachable as possible. Doing so didn’t require me to dumb down anything— I strongly believe that there are no difficult ideas in deep learning. I hope you’ll find this book valuable and that it will enable you to begin building intelligent applications and solve the problems that matter to you. Licensed to

Applied Deep Learning with Python

Use scikit-learn, TensorFlow, and Keras to create intelligent systems and machine learning solutions

Author: Alex Galea,Luis Capelo

Publisher: Packt Publishing Ltd

ISBN: 1789806992

Category: Computers

Page: 334

View: 9778

A hands-on guide to deep learning that’s filled with intuitive explanations and engaging practical examples Key Features Designed to iteratively develop the skills of Python users who don’t have a data science background Covers the key foundational concepts you’ll need to know when building deep learning systems Full of step-by-step exercises and activities to help build the skills that you need for the real-world Book Description Taking an approach that uses the latest developments in the Python ecosystem, you’ll first be guided through the Jupyter ecosystem, key visualization libraries and powerful data sanitization techniques before we train our first predictive model. We’ll explore a variety of approaches to classification like support vector networks, random decision forests and k-nearest neighbours to build out your understanding before we move into more complex territory. It’s okay if these terms seem overwhelming; we’ll show you how to put them to work. We’ll build upon our classification coverage by taking a quick look at ethical web scraping and interactive visualizations to help you professionally gather and present your analysis. It’s after this that we start building out our keystone deep learning application, one that aims to predict the future price of Bitcoin based on historical public data. By guiding you through a trained neural network, we’ll explore common deep learning network architectures (convolutional, recurrent, generative adversarial) and branch out into deep reinforcement learning before we dive into model optimization and evaluation. We’ll do all of this whilst working on a production-ready web application that combines Tensorflow and Keras to produce a meaningful user-friendly result, leaving you with all the skills you need to tackle and develop your own real-world deep learning projects confidently and effectively. What you will learn Discover how you can assemble and clean your very own datasets Develop a tailored machine learning classification strategy Build, train and enhance your own models to solve unique problems Work with production-ready frameworks like Tensorflow and Keras Explain how neural networks operate in clear and simple terms Understand how to deploy your predictions to the web Who this book is for If you're a Python programmer stepping into the world of data science, this is the ideal way to get started.

Jupyter for Data Science

Exploratory analysis, statistical modeling, machine learning, and data visualization with Jupyter

Author: Dan Toomey

Publisher: Packt Publishing Ltd

ISBN: 1785883291

Category: Computers

Page: 242

View: 814

Your one-stop guide to building an efficient data science pipeline using Jupyter About This Book Get the most out of your Jupyter notebook to complete the trickiest of tasks in Data Science Learn all the tasks in the data science pipeline—from data acquisition to visualization—and implement them using Jupyter Get ahead of the curve by mastering all the applications of Jupyter for data science with this unique and intuitive guide Who This Book Is For This book targets students and professionals who wish to master the use of Jupyter to perform a variety of data science tasks. Some programming experience with R or Python, and some basic understanding of Jupyter, is all you need to get started with this book. What You Will Learn Understand why Jupyter notebooks are a perfect fit for your data science tasks Perform scientific computing and data analysis tasks with Jupyter Interpret and explore different kinds of data visually with charts, histograms, and more Extend SQL's capabilities with Jupyter notebooks Combine the power of R and Python 3 with Jupyter to create dynamic notebooks Create interactive dashboards and dynamic presentations Master the best coding practices and deploy your Jupyter notebooks efficiently In Detail Jupyter Notebook is a web-based environment that enables interactive computing in notebook documents. It allows you to create documents that contain live code, equations, and visualizations. This book is a comprehensive guide to getting started with data science using the popular Jupyter notebook. If you are familiar with Jupyter notebook and want to learn how to use its capabilities to perform various data science tasks, this is the book for you! From data exploration to visualization, this book will take you through every step of the way in implementing an effective data science pipeline using Jupyter. You will also see how you can utilize Jupyter's features to share your documents and codes with your colleagues. The book also explains how Python 3, R, and Julia can be integrated with Jupyter for various data science tasks. By the end of this book, you will comfortably leverage the power of Jupyter to perform various tasks in data science successfully. Style and approach This book is a perfect blend of concepts and practical examples, written in a way that is very easy to understand and implement. It follows a logical flow where you will be able to build on your understanding of the different Jupyter features with every chapter.

Praxiseinstieg Deep Learning

Mit Python, Caffe, TensorFlow und Spark eigene Deep-Learning-Anwendungen erstellen

Author: Ramon Wartala

Publisher: O'Reilly

ISBN: 3960101570

Category: Computers

Page: 226

View: 8800

Deep Learning ist ein Teilbereich des Machine Learning und basiert auf künstlichen neuronalen Netzen. Dieser praktische Leitfaden bietet einen schnellen Einstieg in die Schlüsseltechnologie und erschließt Grundlagen und Arbeitsweisen von Deep Learning. Anhand Python-basierter Beispielanwendungen wird der Umgang mit den Frameworks Caffe/Caffe2 und TensorFlow gezeigt. Einfache, alltagstaugliche Beispiele laden zum Nachprogrammieren ein. Darüber hinaus erfahren Sie, warum moderne Grafikkarten, Big Data und Cloud Computing beim Deep Learning so wichtig sind. Wenn Sie bereits mit Python, NumPy und matplotlib arbeiten, ermöglicht Ihnen dieses Buch, praktische Erfahrungen mit Deep-Learning-Anwendungen zu machen. Deep Learning – die Hintergründe - Lernmethoden, die Deep Learning zugrunde liegen - Aktuelle Anwendungsfelder wie maschinelle Übersetzungen, Sprach- und Bilderkennung bei Google, Facebook, IBM oder Amazon Der Werkzeugkasten mit Docker - Der Docker-Container zum Buch: Alle nötigen Tools und Programme sind bereits installiert, damit Sie die Beispiele des Buchs und eigene Deep-Learning-Anwendungen leicht ausführen können. - Die Arbeitsumgebung kennenlernen: Jupyter Notebook, Beispieldatensätze, Web Scraping Der Praxiseinstieg - Einführung in Caffe/Caffe2 und TensorFlow - Deep-Learning-Anwendungen nachprogrammieren: Handschrifterkennung, Bilderkennung und -klassifizierung, Deep Dreaming - Lösungen für Big-Data-Szenarien: verteilte Anwendungen, Spark, Cloud-Systeme - Modelle in produktive Systeme überführen

Learning IPython for Interactive Computing and Data Visualization

Author: Cyrille Rossant

Publisher: Packt Publishing Ltd

ISBN: 1783986999

Category: Computers

Page: 200

View: 4174

Get started with Python for data analysis and numerical computing in the Jupyter notebook About This Book Learn the basics of Python in the Jupyter Notebook Analyze and visualize data with pandas, NumPy, matplotlib, and seaborn Perform highly-efficient numerical computations with Numba, Cython, and ipyparallel Who This Book Is For This book targets students, teachers, researchers, engineers, analysts, journalists, hobbyists, and all data enthusiasts who are interested in analyzing and visualizing real-world datasets. If you are new to programming and data analysis, this book is exactly for you. If you're already familiar with another language or analysis software, you will also appreciate this introduction to the Python data analysis platform. Finally, there are more technical topics for advanced readers. No prior experience is required; this book contains everything you need to know. What You Will Learn Install Anaconda and code in Python in the Jupyter Notebook Load and explore datasets interactively Perform complex data manipulations effectively with pandas Create engaging data visualizations with matplotlib and seaborn Simulate mathematical models with NumPy Visualize and process images interactively in the Jupyter Notebook with scikit-image Accelerate your code with Numba, Cython, and IPython.parallel Extend the Notebook interface with HTML, JavaScript, and D3 In Detail Python is a user-friendly and powerful programming language. IPython offers a convenient interface to the language and its analysis libraries, while the Jupyter Notebook is a rich environment well-adapted to data science and visualization. Together, these open source tools are widely used by beginners and experts around the world, and in a huge variety of fields and endeavors. This book is a beginner-friendly guide to the Python data analysis platform. After an introduction to the Python language, IPython, and the Jupyter Notebook, you will learn how to analyze and visualize data on real-world examples, how to create graphical user interfaces for image processing in the Notebook, and how to perform fast numerical computations for scientific simulations with NumPy, Numba, Cython, and ipyparallel. By the end of this book, you will be able to perform in-depth analyses of all sorts of data. Style and approach This is a hands-on beginner-friendly guide to analyze and visualize data on real-world examples with Python and the Jupyter Notebook.

Beginning Data Science with Python and Jupyter

Use powerful industry-standard tools within Jupyter and the Python ecosystem to unlock new, actionable insights from your data

Author: Alex Galea

Publisher: Packt Publishing Ltd

ISBN: 1789534658

Category: Computers

Page: 194

View: 816

Getting started with data science doesn't have to be an uphill battle. This step-by-step guide is ideal for beginners who know a little Python and are looking for a quick, fast-paced introduction. Key Features Get up and running with the Jupyter ecosystem and some example datasets Learn about key machine learning concepts like SVM, KNN classifiers and Random Forests Discover how you can use web scraping to gather and parse your own bespoke datasets Book Description Get to grips with the skills you need for entry-level data science in this hands-on Python and Jupyter course. You'll learn about some of the most commonly used libraries that are part of the Anaconda distribution, and then explore machine learning models with real datasets to give you the skills and exposure you need for the real world. We'll finish up by showing you how easy it can be to scrape and gather your own data from the open web, so that you can apply your new skills in an actionable context. What you will learn Get up and running with the Jupyter ecosystem and some example datasets Learn about key machine learning concepts like SVM, KNN classifiers, and Random Forests Plan a machine learning classification strategy and train classification, models Use validation curves and dimensionality reduction to tune and enhance your models Discover how you can use web scraping to gather and parse your own bespoke datasets Scrape tabular data from web pages and transform them into Pandas DataFrames Create interactive, web-friendly visualizations to clearly communicate your findings Who this book is for This book is ideal for professionals with a variety of job descriptions across large range of industries, given the rising popularity and accessibility of data science. You'll need some prior experience with Python, with any prior work with libraries like Pandas, Matplotlib and Pandas providing you a useful head start.

Die C++-Programmiersprache

Author: Bjarne Stroustrup

Publisher: Pearson Deutschland GmbH

ISBN: 9783827316608

Category: C+

Page: 1068

View: 7149

IPython Interactive Computing and Visualization Cookbook, Second Edition

Over 100 hands-on recipes to sharpen your skills in high-performance numerical computing and data science in the Jupyter Notebook

Author: Cyrille Rossant

Publisher: Packt Publishing Ltd

ISBN: 1785881930

Category: Computers

Page: 548

View: 537

Learn to use IPython and Jupyter Notebook for your data analysis and visualization work. Key Features Leverage the Jupyter Notebook for interactive data science and visualization Become an expert in high-performance computing and visualization for data analysis and scientific modeling A comprehensive coverage of scientific computing through many hands-on, example-driven recipes with detailed, step-by-step explanations Book Description Python is one of the leading open source platforms for data science and numerical computing. IPython and the associated Jupyter Notebook offer efficient interfaces to Python for data analysis and interactive visualization, and they constitute an ideal gateway to the platform. IPython Interactive Computing and Visualization Cookbook, Second Edition contains many ready-to-use, focused recipes for high-performance scientific computing and data analysis, from the latest IPython/Jupyter features to the most advanced tricks, to help you write better and faster code. You will apply these state-of-the-art methods to various real-world examples, illustrating topics in applied mathematics, scientific modeling, and machine learning. The first part of the book covers programming techniques: code quality and reproducibility, code optimization, high-performance computing through just-in-time compilation, parallel computing, and graphics card programming. The second part tackles data science, statistics, machine learning, signal and image processing, dynamical systems, and pure and applied mathematics. What you will learn Master all features of the Jupyter Notebook Code better: write high-quality, readable, and well-tested programs; profile and optimize your code; and conduct reproducible interactive computing experiments Visualize data and create interactive plots in the Jupyter Notebook Write blazingly fast Python programs with NumPy, ctypes, Numba, Cython, OpenMP, GPU programming (CUDA), parallel IPython, Dask, and more Analyze data with Bayesian or frequentist statistics (Pandas, PyMC, and R), and learn from actual data through machine learning (scikit-learn) Gain valuable insights into signals, images, and sounds with SciPy, scikit-image, and OpenCV Simulate deterministic and stochastic dynamical systems in Python Familiarize yourself with math in Python using SymPy and Sage: algebra, analysis, logic, graphs, geometry, and probability theory Who this book is for This book is intended for anyone interested in numerical computing and data science: students, researchers, teachers, engineers, analysts, and hobbyists. A basic knowledge of Python/NumPy is recommended. Some skills in mathematics will help you understand the theory behind the computational methods.

Statistik-Workshop für Programmierer

Author: Allen B. Downey

Publisher: O'Reilly Germany

ISBN: 3868993436

Category: Computers

Page: 160

View: 7155

Wenn Sie programmieren können, beherrschen Sie bereits Techniken, um aus Daten Wissen zu extrahieren. Diese kompakte Einführung in die Statistik zeigt Ihnen, wie Sie rechnergestützt, anstatt auf mathematischem Weg Datenanalysen mit Python durchführen können. Praktischer Programmier-Workshop statt grauer Theorie: Das Buch führt Sie anhand eines durchgängigen Fallbeispiels durch eine vollständige Datenanalyse -- von der Datensammlung über die Berechnung statistischer Kennwerte und Identifikation von Mustern bis hin zum Testen statistischer Hypothesen. Gleichzeitig werden Sie mit statistischen Verteilungen, den Regeln der Wahrscheinlichkeitsrechnung, Visualisierungsmöglichkeiten und vielen anderen Arbeitstechniken und Konzepten vertraut gemacht. Statistik-Konzepte zum Ausprobieren: Entwickeln Sie über das Schreiben und Testen von Code ein Verständnis für die Grundlagen von Wahrscheinlichkeitsrechnung und Statistik: Überprüfen Sie das Verhalten statistischer Merkmale durch Zufallsexperimente, zum Beispiel indem Sie Stichproben aus unterschiedlichen Verteilungen ziehen. Nutzen Sie Simulationen, um Konzepte zu verstehen, die auf mathematischem Weg nur schwer zugänglich sind. Lernen Sie etwas über Themen, die in Einführungen üblicherweise nicht vermittelt werden, beispielsweise über die Bayessche Schätzung. Nutzen Sie Python zur Bereinigung und Aufbereitung von Rohdaten aus nahezu beliebigen Quellen. Beantworten Sie mit den Mitteln der Inferenzstatistik Fragestellungen zu realen Daten.

Applied Data Science with Python and Jupyter

Use powerful industry-standard tools to unlock new, actionable insights from your data

Author: Alex Galea

Publisher: Packt Publishing Ltd

ISBN: 1789951925

Category: Computers

Page: 192

View: 1969

Become the master player of data exploration by creating reproducible data processing pipelines, visualizations, and prediction models for your applications. Key Features Get up and running with the Jupyter ecosystem and some example datasets Learn about key machine learning concepts such as SVM, KNN classifiers, and Random Forests Discover how you can use web scraping to gather and parse your own bespoke datasets Book Description Getting started with data science doesn't have to be an uphill battle. Applied Data Science with Python and Jupyter is a step-by-step guide ideal for beginners who know a little Python and are looking for a quick, fast-paced introduction to these concepts. In this book, you'll learn every aspect of the standard data workflow process, including collecting, cleaning, investigating, visualizing, and modeling data. You'll start with the basics of Jupyter, which will be the backbone of the book. After familiarizing ourselves with its standard features, you'll look at an example of it in practice with our first analysis. In the next lesson, you dive right into predictive analytics, where multiple classification algorithms are implemented. Finally, the book ends by looking at data collection techniques. You'll see how web data can be acquired with scraping techniques and via APIs, and then briefly explore interactive visualizations. What you will learn Get up and running with the Jupyter ecosystem Identify potential areas of investigation and perform exploratory data analysis Plan a machine learning classification strategy and train classification models Use validation curves and dimensionality reduction to tune and enhance your models Scrape tabular data from web pages and transform it into Pandas DataFrames Create interactive, web-friendly visualizations to clearly communicate your findings Who this book is for Applied Data Science with Python and Jupyter is ideal for professionals with a variety of job descriptions across a large range of industries, given the rising popularity and accessibility of data science. You'll need some prior experience with Python, with any prior work with libraries such as Pandas, Matplotlib, and Pandas providing you a useful head start.

Docker for Data Science

Building Scalable and Extensible Data Infrastructure Around the Jupyter Notebook Server

Author: Joshua Cook

Publisher: Apress

ISBN: 1484230124

Category: Computers

Page: 257

View: 6924

Learn Docker "infrastructure as code" technology to define a system for performing standard but non-trivial data tasks on medium- to large-scale data sets, using Jupyter as the master controller. It is not uncommon for a real-world data set to fail to be easily managed. The set may not fit well into access memory or may require prohibitively long processing. These are significant challenges to skilled software engineers and they can render the standard Jupyter system unusable. As a solution to this problem, Docker for Data Science proposes using Docker. You will learn how to use existing pre-compiled public images created by the major open-source technologies—Python, Jupyter, Postgres—as well as using the Dockerfile to extend these images to suit your specific purposes. The Docker-Compose technology is examined and you will learn how it can be used to build a linked system with Python churning data behind the scenes and Jupyter managing these background tasks. Best practices in using existing images are explored as well as developing your own images to deploy state-of-the-art machine learning and optimization algorithms. What You'll Learn Master interactive development using the Jupyter platform Run and build Docker containers from scratch and from publicly available open-source images Write infrastructure as code using the docker-compose tool and its docker-compose.yml file type Deploy a multi-service data science application across a cloud-based system Who This Book Is For Data scientists, machine learning engineers, artificial intelligence researchers, Kagglers, and software developers

Mastering Geospatial Analysis with Python

Explore GIS processing and learn to work with GeoDjango, CARTOframes and MapboxGL-Jupyter

Author: Silas Toms,Paul Crickard,Eric van Rees

Publisher: Packt Publishing Ltd

ISBN: 1788293819

Category: Computers

Page: 440

View: 3713

Explore GIS processing and learn to work with various tools and libraries in Python. Key Features Analyze and process geospatial data using Python libraries such as; Anaconda, GeoPandas Leverage new ArcGIS API to process geospatial data for the cloud. Explore various Python geospatial web and machine learning frameworks. Book Description Python comes with a host of open source libraries and tools that help you work on professional geoprocessing tasks without investing in expensive tools. This book will introduce Python developers, both new and experienced, to a variety of new code libraries that have been developed to perform geospatial analysis, statistical analysis, and data management. This book will use examples and code snippets that will help explain how Python 3 differs from Python 2, and how these new code libraries can be used to solve age-old problems in geospatial analysis. You will begin by understanding what geoprocessing is and explore the tools and libraries that Python 3 offers. You will then learn to use Python code libraries to read and write geospatial data. You will then learn to perform geospatial queries within databases and learn PyQGIS to automate analysis within the QGIS mapping suite. Moving forward, you will explore the newly released ArcGIS API for Python and ArcGIS Online to perform geospatial analysis and create ArcGIS Online web maps. Further, you will deep dive into Python Geospatial web frameworks and learn to create a geospatial REST API. What you will learn Manage code libraries and abstract geospatial analysis techniques using Python 3. Explore popular code libraries that perform specific tasks for geospatial analysis. Utilize code libraries for data conversion, data management, web maps, and REST API creation. Learn techniques related to processing geospatial data in the cloud. Leverage features of Python 3 with geospatial databases such as PostGIS, SQL Server, and SpatiaLite. Who this book is for The audience for this book includes students, developers, and geospatial professionals who need a reference book that covers GIS data management, analysis, and automation techniques with code libraries built in Python 3.

Mastering Exploratory Analysis with pandas

Build an end-to-end data analysis workflow with Python

Author: Harish Garg

Publisher: Packt Publishing Ltd

ISBN: 178961547X

Category: Computers

Page: 140

View: 733

Explore Python frameworks like pandas, Jupyter notebooks, and Matplotlib to build data pipelines and data visualization Key Features Learn to set up data analysis pipelines with pandas and Jupyter notebooks Effective techniques for data selection, manipulation, and visualization Introduction to Matplotlib for interactive data visualization using charts and plots Book Description The pandas is a Python library that lets you manipulate, transform, and analyze data. It is a popular framework for exploratory data visualization and analyzing datasets and data pipelines based on their properties. This book will be your practical guide to exploring datasets using pandas. You will start by setting up Python, pandas, and Jupyter Notebooks. You will learn how to use Jupyter Notebooks to run Python code. We then show you how to get data into pandas and do some exploratory analysis, before learning how to manipulate and reshape data using pandas methods. You will also learn how to deal with missing data from your datasets, how to draw charts and plots using pandas and Matplotlib, and how to create some effective visualizations for your audience. Finally, you will wrapup your newly gained pandas knowledge by learning how to import data out of pandas into some popular file formats. By the end of this book, you will have a better understanding of exploratory analysis and how to build exploratory data pipelines with Python. What you will learn Learn how to read different kinds of data into pandas DataFrames for data analysis Manipulate, transform, and apply formulas to data imported into pandas DataFrames Use pandas to analyze and visualize different kinds of data to gain real-world insights Extract transformed data form pandas DataFrames and convert it into the formats your application expects Manipulate model time-series data, perform algorithmic trading, derive results on fixed and moving windows, and more Effective data visualization using Matplotlib Who this book is for If you are a budding data scientist looking to learn the popular pandas library, or a Python developer looking to step into the world of data analysis, this book is the ideal resource you need to get started. Some programming experience in Python will be helpful to get the most out of this course

Find eBook