Search Results: r-packages-organize-test-document-and-share-your-code

R Packages

Organize, Test, Document, and Share Your Code

Author: Hadley Wickham

Publisher: "O'Reilly Media, Inc."

ISBN: 1491910542

Category: Computers

Page: 202

View: 6794

Turn your R code into packages that others can easily download and use. This practical book shows you how to bundle reusable R functions, sample data, and documentation together by applying author Hadley Wickham’s package development philosophy. In the process, you’ll work with devtools, roxygen, and testthat, a set of R packages that automate common development tasks. Devtools encapsulates best practices that Hadley has learned from years of working with this programming language. Ideal for developers, data scientists, and programmers with various backgrounds, this book starts you with the basics and shows you how to improve your package writing over time. You’ll learn to focus on what you want your package to do, rather than think about package structure. Learn about the most useful components of an R package, including vignettes and unit tests Automate anything you can, taking advantage of the years of development experience embodied in devtools Get tips on good style, such as organizing functions into files Streamline your development process with devtools Learn the best way to submit your package to the Comprehensive R Archive Network (CRAN) Learn from a well-respected member of the R community who created 30 R packages, including ggplot2, dplyr, and tidyr

Seamless R and C++ Integration with Rcpp

Author: Dirk Eddelbuettel

Publisher: Springer Science & Business Media

ISBN: 146146868X

Category: Computers

Page: 220

View: 8640

Rcpp is the glue that binds the power and versatility of R with the speed and efficiency of C++. With Rcpp, the transfer of data between R and C++ is nearly seamless, and high-performance statistical computing is finally accessible to most R users. Rcpp should be part of every statistician's toolbox. -- Michael Braun, MIT Sloan School of Management "Seamless R and C++ integration with Rcpp" is simply a wonderful book. For anyone who uses C/C++ and R, it is an indispensable resource. The writing is outstanding. A huge bonus is the section on applications. This section covers the matrix packages Armadillo and Eigen and the GNU Scientific Library as well as RInside which enables you to use R inside C++. These applications are what most of us need to know to really do scientific programming with R and C++. I love this book. -- Robert McCulloch, University of Chicago Booth School of Business Rcpp is now considered an essential package for anybody doing serious computational research using R. Dirk's book is an excellent companion and takes the reader from a gentle introduction to more advanced applications via numerous examples and efficiency enhancing gems. The book is packed with all you might have ever wanted to know about Rcpp, its cousins (RcppArmadillo, RcppEigen .etc.), modules, package development and sugar. Overall, this book is a must-have on your shelf. -- Sanjog Misra, UCLA Anderson School of Management The Rcpp package represents a major leap forward for scientific computations with R. With very few lines of C++ code, one has R's data structures readily at hand for further computations in C++. Hence, high-level numerical programming can be made in C++ almost as easily as in R, but often with a substantial speed gain. Dirk is a crucial person in these developments, and his book takes the reader from the first fragile steps on to using the full Rcpp machinery. A very recommended book! -- Søren Højsgaard, Department of Mathematical Sciences, Aalborg University, Denmark "Seamless R and C ++ Integration with Rcpp" provides the first comprehensive introduction to Rcpp. Rcpp has become the most widely-used language extension for R, and is deployed by over one-hundred different CRAN and BioConductor packages. Rcpp permits users to pass scalars, vectors, matrices, list or entire R objects back and forth between R and C++ with ease. This brings the depth of the R analysis framework together with the power, speed, and efficiency of C++. Dirk Eddelbuettel has been a contributor to CRAN for over a decade and maintains around twenty packages. He is the Debian/Ubuntu maintainer for R and other quantitative software, edits the CRAN Task Views for Finance and High-Performance Computing, is a co-founder of the annual R/Finance conference, and an editor of the Journal of Statistical Software. He holds a Ph.D. in Mathematical Economics from EHESS (Paris), and works in Chicago as a Senior Quantitative Analyst.

Nonparametric Models for Longitudinal Data

With Implementation in R

Author: Colin O. Wu,Xin Tian

Publisher: CRC Press

ISBN: 0429939086

Category: Mathematics

Page: 552

View: 2103

Nonparametric Models for Longitudinal Data with Implementations in R presents a comprehensive summary of major advances in nonparametric models and smoothing methods with longitudinal data. It covers methods, theories, and applications that are particularly useful for biomedical studies in the era of big data and precision medicine. It also provides flexible tools to describe the temporal trends, covariate effects and correlation structures of repeated measurements in longitudinal data. This book is intended for graduate students in statistics, data scientists and statisticians in biomedical sciences and public health. As experts in this area, the authors present extensive materials that are balanced between theoretical and practical topics. The statistical applications in real-life examples lead into meaningful interpretations and inferences. Features: Provides an overview of parametric and semiparametric methods Shows smoothing methods for unstructured nonparametric models Covers structured nonparametric models with time-varying coefficients Discusses nonparametric shared-parameter and mixed-effects models Presents nonparametric models for conditional distributions and functionals Illustrates implementations using R software packages Includes datasets and code in the authors’ website Contains asymptotic results and theoretical derivations Both authors are mathematical statisticians at the National Institutes of Health (NIH) and have published extensively in statistical and biomedical journals. Colin O. Wu earned his Ph.D. in statistics from the University of California, Berkeley (1990), and is also Adjunct Professor at the Georgetown University School of Medicine. He served as Associate Editor for Biometrics and Statistics in Medicine, and reviewer for National Science Foundation, NIH, and the U.S. Department of Veterans Affairs. Xin Tian earned her Ph.D. in statistics from Rutgers, the State University of New Jersey (2003). She has served on various NIH committees and collaborated extensively with clinical researchers.

Efficient R Programming

A Practical Guide to Smarter Programming

Author: Colin Gillespie,Robin Lovelace

Publisher: "O'Reilly Media, Inc."

ISBN: 1491950757

Category: Computers

Page: 222

View: 9928

There are many excellent R resources for visualization, data science, and package development. Hundreds of scattered vignettes, web pages, and forums explain how to use R in particular domains. But little has been written on how to simply make R work effectively—until now. This hands-on book teaches novices and experienced R users how to write efficient R code. Drawing on years of experience teaching R courses, authors Colin Gillespie and Robin Lovelace provide practical advice on a range of topics—from optimizing the set-up of RStudio to leveraging C++—that make this book a useful addition to any R user’s bookshelf. Academics, business users, and programmers from a wide range of backgrounds stand to benefit from the guidance in Efficient R Programming. Get advice for setting up an R programming environment Explore general programming concepts and R coding techniques Understand the ingredients of an efficient R workflow Learn how to efficiently read and write data in R Dive into data carpentry—the vital skill for cleaning raw data Optimize your code with profiling, standard tricks, and other methods Determine your hardware capabilities for handling R computation Maximize the benefits of collaborative R programming Accelerate your transition from R hacker to R programmer

My Bed is a Blackhole

Author: Hadley Wickham

Publisher: Book Guild Publishing

ISBN: 1912362740

Category: Psychology

Page: N.A

View: 1986

My Bed is a Blackhole is a non-autobiographical work of fiction which was written over a period of 18-months. The central character in the story is a young woman living in Fremantle, Western Australia and attends university to study psychology. She is nameless until the very last line in the book, an intentional decision by the author as one of the key themes in the book is identity and the struggles that come with it. The book covers a year in the life of the female character and how her relationships and experiences change over the course of time; specifically, the pivotal “make-or-break” point where she is confronted by the true nature of her mental illness and how it has affected her life. The relationships with her parents, siblings, friends and romantic interest are extensively explored and through them, Wickham’s character develops her own identity, particularly in realising the flaws in others and recognising them in herself. Contrasts are made by the female character between the life she remembers (pre-depression), the life that she has, and the life that she hopes to have. A major theme within the book is the female character’s inability to accept her own depression and this is externalised by the Blackhole. Yet she is possessed by an objectiveness which allows her to understand this simply to allow her to function; she is terrified of her depression and of herself; this is communicated by frequent fourth-wall breaks where she directly addresses the reader.

Hands-On Programming with R

Write Your Own Functions and Simulations

Author: Garrett Grolemund

Publisher: "O'Reilly Media, Inc."

ISBN: 1449359108

Category: Computers

Page: 250

View: 3532

Learn how to program by diving into the R language, and then use your newfound skills to solve practical data science problems. With this book, you’ll learn how to load data, assemble and disassemble data objects, navigate R’s environment system, write your own functions, and use all of R’s programming tools. RStudio Master Instructor Garrett Grolemund not only teaches you how to program, but also shows you how to get more from R than just visualizing and modeling data. You’ll gain valuable programming skills and support your work as a data scientist at the same time. Work hands-on with three practical data analysis projects based on casino games Store, retrieve, and change data values in your computer’s memory Write programs and simulations that outperform those written by typical R users Use R programming tools such as if else statements, for loops, and S3 classes Learn how to write lightning-fast vectorized R code Take advantage of R’s package system and debugging tools Practice and apply R programming concepts as you learn them

Software for Data Analysis

Programming with R

Author: John Chambers

Publisher: Springer Science & Business Media

ISBN: 9780387759364

Category: Computers

Page: 500

View: 3597

John Chambers turns his attention to R, the enormously successful open-source system based on the S language. His book guides the reader through programming with R, beginning with simple interactive use and progressing by gradual stages, starting with simple functions. More advanced programming techniques can be added as needed, allowing users to grow into software contributors, benefiting their careers and the community. R packages provide a powerful mechanism for contributions to be organized and communicated. This is the only advanced programming book on R, written by the author of the S language from which R evolved.

Parallel R

Data Analysis in the Distributed World

Author: Q. Ethan McCallum,Stephen Weston

Publisher: "O'Reilly Media, Inc."

ISBN: 1449320333

Category: Computers

Page: 126

View: 9704

It’s tough to argue with R as a high-quality, cross-platform, open source statistical software product—unless you’re in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets, including three chapters on using R and Hadoop together. You’ll learn the basics of Snow, Multicore, Parallel, Segue, RHIPE, and Hadoop Streaming, including how to find them, how to use them, when they work well, and when they don’t. With these packages, you can overcome R’s single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address R’s memory barrier. Snow: works well in a traditional cluster environment Multicore: popular for multiprocessor and multicore computers Parallel: part of the upcoming R 2.14.0 release R+Hadoop: provides low-level access to a popular form of cluster computing RHIPE: uses Hadoop’s power with R’s language and interactive shell Segue: lets you use Elastic MapReduce as a backend for lapply-style operations

Advanced R

Author: Hadley Wickham

Publisher: CRC Press

ISBN: 1498759807

Category: Mathematics

Page: 476

View: 3292

An Essential Reference for Intermediate and Advanced R Programmers Advanced R presents useful tools and techniques for attacking many types of R programming problems, helping you avoid mistakes and dead ends. With more than ten years of experience programming in R, the author illustrates the elegance, beauty, and flexibility at the heart of R. The book develops the necessary skills to produce quality code that can be used in a variety of circumstances. You will learn: The fundamentals of R, including standard data types and functions Functional programming as a useful framework for solving wide classes of problems The positives and negatives of metaprogramming How to write fast, memory-efficient code This book not only helps current R users become R programmers but also shows existing programmers what’s special about R. Intermediate R programmers can dive deeper into R and learn new strategies for solving diverse problems while programmers from other languages can learn the details of R and understand why R works the way it does.

ggplot2

Elegant Graphics for Data Analysis

Author: Hadley Wickham

Publisher: Springer

ISBN: 3319242776

Category: Computers

Page: 260

View: 9947

This new edition to the classic book by ggplot2 creator Hadley Wickham highlights compatibility with knitr and RStudio. ggplot2 is a data visualization package for R that helps users create data graphics, including those that are multi-layered, with ease. With ggplot2, it's easy to: produce handsome, publication-quality plots with automatic legends created from the plot specification superimpose multiple layers (points, lines, maps, tiles, box plots) from different data sources with automatically adjusted common scales add customizable smoothers that use powerful modeling capabilities of R, such as loess, linear models, generalized additive models, and robust regression save any ggplot2 plot (or part thereof) for later modification or reuse create custom themes that capture in-house or journal style requirements and that can easily be applied to multiple plots approach a graph from a visual perspective, thinking about how each component of the data is represented on the final plot This book will be useful to everyone who has struggled with displaying data in an informative and attractive way. Some basic knowledge of R is necessary (e.g., importing data into R). ggplot2 is a mini-language specifically tailored for producing graphics, and you'll learn everything you need in the book. After reading this book you'll be able to produce graphics customized precisely for your problems, and you'll find it easy to get graphics out of your head and on to the screen or page.

Compositional Data Analysis

Theory and Applications

Author: Vera Pawlowsky-Glahn,Antonella Buccianti

Publisher: John Wiley & Sons

ISBN: 0470711353

Category: Mathematics

Page: 378

View: 577

Compositional Data Analysis: Theory and Applications Edited by Vera Pawlowsky-Glahn, Department of Computer Science and Applied Mathematics, University of Girona, Spain. Antonella Buccianti, Department of Earth Sciences, University of Florence, Italy It is difficult to imagine that the statistical analysis of compositional data has been a major issue of concern for more than 100 years. It is even more difficult to realize that so many statisticians and users of statistics are unaware of the particular problems affecting compositional data, as well as their solutions. The issue of spurious correlation'', as the situation was phrased by Karl Pearson back in 1897, affects all data that measures parts of some whole, such as percentages, proportions, ppm and ppb. Such measurements are present in all fields of science, ranging from geology, biology, environmental sciences, forensic sciences, medicine and hydrology. This book presents the history and development of compositional data analysis along with Aitchison's log-ratio approach. "Compositional Data Analysis" describes the state of the art both in theoretical fields as well as applications in the different fields of science. Key Features: - Reflects the state-of-the-art in compositional data analysis. - Gives an overview of the historical development of compositional data analysis, as well as basic concepts and procedures. - Looks at advances in algebra and calculus on the simplex. - Presents applications in different fields of science, including, genomics, ecology, biology, geochemistry, planetology, chemistry and economics. - Explores connections to correspondence analysis and the Dirichlet distribution. - Presents a summary of three available software packages for compositional data analysis. - Supported by an accompanying website featuring R code. Applied scientists working on compositional data analysis in any field of science, both in academia and professionals will benefit from this book, along with graduate students in any field of science working with compositional data.

Testing R Code

Author: Richard Cotton

Publisher: CRC Press

ISBN: 1315350513

Category: Computers

Page: 196

View: 4980

Learn how to write R code with fewer bugs. The problem with programming is that you are always one typo away from writing something silly. Likewise with data analysis, a small mistake in your model can lead to a big mistake in your results. Combining the two disciplines means that it is all too easy for a missed minus sign to generate a false prediction that you don’t spot until it’s too late. Testing is the only way to be sure that your code, and your results, are correct. Testing R Code teaches you how to perform development-time testing using the testthat package, allowing you to ensure that your code works as intended. The book also teaches run-time testing using the assertive package; enabling your users to correctly run your code. After beginning with an introduction to testing in R, the book explores more advanced cases such as integrating tests into R packages; testing code that accesses databases; testing C++ code with Rcpp; and testing graphics. Each topic is explained with real-world examples, and has accompanying exercises for readers to practise their skills — only a small amount of experience with R is needed to get started!

An Introduction to R

Author: William N. Venables,David M. Smith

Publisher: Network Theory.

ISBN: 9780954612085

Category: Computers

Page: 144

View: 2185

This tutorial manual provides an introduction to R, a software package for statistical computing and graphics. This revised and updated edition is based on the April 2009 release of R (version 2.9.0).

Statistical Computing in C++ and R

Author: Randall L. Eubank,Ana Kupresanin

Publisher: CRC Press

ISBN: 1420066501

Category: Mathematics

Page: 556

View: 1437

With the advancement of statistical methodology inextricably linked to the use of computers, new methodological ideas must be translated into usable code and then numerically evaluated relative to competing procedures. In response to this, Statistical Computing in C++ and R concentrates on the writing of code rather than the development and study of numerical algorithms per se. The book discusses code development in C++ and R and the use of these symbiotic languages in unison. It emphasizes that each offers distinct features that, when used in tandem, can take code writing beyond what can be obtained from either language alone. The text begins with some basics of object-oriented languages, followed by a "boot-camp" on the use of C++ and R. The authors then discuss code development for the solution of specific computational problems that are relevant to statistics including optimization, numerical linear algebra, and random number generation. Later chapters introduce abstract data structures (ADTs) and parallel computing concepts. The appendices cover R and UNIX Shell programming. Features Includes numerous student exercises ranging from elementary to challenging Integrates both C++ and R for the solution of statistical computing problems Uses C++ code in R and R functions in C++ programs Provides downloadable programs, available from the authors’ website The translation of a mathematical problem into its computational analog (or analogs) is a skill that must be learned, like any other, by actively solving relevant problems. The text reveals the basic principles of algorithmic thinking essential to the modern statistician as well as the fundamental skill of communicating with a computer through the use of the computer languages C++ and R. The book lays the foundation for original code development in a research environment.

R Data Science Essentials

Author: Raja B. Koushik,Sharan Kumar Ravindran

Publisher: Packt Publishing Ltd

ISBN: 1785286366

Category: Computers

Page: 154

View: 8489

Learn the essence of data science and visualization using R in no time at all About This Book Become a pro at making stunning visualizations and dashboards quickly and without hassle For better decision making in business, apply the R programming language with the help of useful statistical techniques. From seasoned authors comes a book that offers you a plethora of fast-paced techniques to detect and analyze data patterns Who This Book Is For If you are an aspiring data scientist or analyst who has a basic understanding of data science and has basic hands-on experience in R or any other analytics tool, then R Data Science Essentials is the book for you. What You Will Learn Perform data preprocessing and basic operations on data Implement visual and non-visual implementation data exploration techniques Mine patterns from data using affinity and sequential analysis Use different clustering algorithms and visualize them Implement logistic and linear regression and find out how to evaluate and improve the performance of an algorithm Extract patterns through visualization and build a forecasting algorithm Build a recommendation engine using different collaborative filtering algorithms Make a stunning visualization and dashboard using ggplot and R shiny In Detail With organizations increasingly embedding data science across their enterprise and with management becoming more data-driven it is an urgent requirement for analysts and managers to understand the key concept of data science. The data science concepts discussed in this book will help you make key decisions and solve the complex problems you will inevitably face in this new world. R Data Science Essentials will introduce you to various important concepts in the field of data science using R. We start by reading data from multiple sources, then move on to processing the data, extracting hidden patterns, building predictive and forecasting models, building a recommendation engine, and communicating to the user through stunning visualizations and dashboards. By the end of this book, you will have an understanding of some very important techniques in data science, be able to implement them using R, understand and interpret the outcomes, and know how they helps businesses make a decision. Style and approach This easy-to-follow guide contains hands-on examples of the concepts of data science using R.

Beginning R

An Introduction to Statistical Programming

Author: Larry Pace

Publisher: Apress

ISBN: 1430245557

Category: Computers

Page: 336

View: 761

Beginning R: An Introduction to Statistical Programming is a hands-on book showing how to use the R language, write and save R scripts, build and import data files, and write your own custom statistical functions. R is a powerful open-source implementation of the statistical language S, which was developed by AT&T. R has eclipsed S and the commercially-available S-Plus language, and has become the de facto standard for doing, teaching, and learning computational statistics. R is both an object-oriented language and a functional language that is easy to learn, easy to use, and completely free. A large community of dedicated R users and programmers provides an excellent source of R code, functions, and data sets. R is also becoming adopted into commercial tools such as Oracle Database. Your investment in learning R is sure to pay off in the long term as R continues to grow into the go to language for statistical exploration and research. Covers the freely-available R language for statistics Shows the use of R in specific uses case such as simulations, discrete probability solutions, one-way ANOVA analysis, and more Takes a hands-on and example-based approach incorporating best practices with clear explanations of the statistics being done

Automated Data Collection with R

A Practical Guide to Web Scraping and Text Mining

Author: Simon Munzert,Christian Rubba,Peter Meißner,Dominic Nyhuis

Publisher: John Wiley & Sons

ISBN: 111883481X

Category: COMPUTERS

Page: 480

View: 2608

"This book provides a unified framework of web scraping and information extraction from text data with R for the social sciences"--

Text Mining with R

A Tidy Approach

Author: Julia Silge,David Robinson

Publisher: "O'Reilly Media, Inc."

ISBN: 1491981628

Category: Computers

Page: 194

View: 1357

Much of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you’ll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages like ggraph and dplyr. You’ll learn how tidytext and other tidy tools in R can make text analysis easier and more effective. The authors demonstrate how treating text as data frames enables you to manipulate, summarize, and visualize characteristics of text. You’ll also learn how to integrate natural language processing (NLP) into effective workflows. Practical code examples and data explorations will help you generate real insights from literature, news, and social media. Learn how to apply the tidy text format to NLP Use sentiment analysis to mine the emotional content of text Identify a document’s most important terms with frequency measurements Explore relationships and connections between words with the ggraph and widyr packages Convert back and forth between R’s tidy and non-tidy text formats Use topic modeling to classify document collections into natural groups Examine case studies that compare Twitter archives, dig into NASA metadata, and analyze thousands of Usenet messages

Applied Asymptotics

Case Studies in Small-Sample Statistics

Author: A. R. Brazzale,A. C. Davison,N. Reid

Publisher: Cambridge University Press

ISBN: 9780521847032

Category: Business & Economics

Page: 236

View: 1308

First practical treatment of small-sample asymptotics, enabling practitioners to apply new methods with confidence.

Mixed-Effects Models in S and S-PLUS

Author: José C. Pinheiro,Douglas Bates

Publisher: Springer Science & Business Media

ISBN: 1441903178

Category: Computers

Page: 530

View: 7730

R, linear models, random, fixed, data, analysis, fit.

Find eBook