Basic Statistics Using R for Crime Analysis
Jaeyoung Choi, West Chester University
Copyright Year:
Publisher: The Pennsylvania Alliance for Design of Open Textbooks (PA-ADOPT)
Language: English
Formats Available
Conditions of Use
Attribution-ShareAlike
CC BY-SA
Reviews
Choi's "Basic Statistics Using R for Crime Analysis" offers a comprehensive review of basic statistical procedures to analyze crime data through use of the R statistical programming language (R-Studio). The text begins with a user-friendly section... read more
Choi's "Basic Statistics Using R for Crime Analysis" offers a comprehensive review of basic statistical procedures to analyze crime data through use of the R statistical programming language (R-Studio). The text begins with a user-friendly section on the install of base R and R-Studio and then progresses into the statistical analysis of 2012 General Social Survey (GSS) data. Topics range from basic descriptive statistics, graphical analysis, and inferential statistics. The inferential statistics covered include ANOVA and Simple Linear Regression (single output factor).
Choi's latest textbook offers an accurate and up to date view of R and popular packages available for R-Studio. A thorough review concluded the accuracy of both the R-code and its relevancy to this criminal justice topic. This recent publication is noted to be written for future crime analysts who may be studying at the undergraduate level, however I would offer that this text is appropriate and value-add for certificate and graduate level students as well. The code and lessons are built such that data from any era may be imported and analyzed- a signature feature of this text.
Choi's "Basic Statistics Using R for Crime Analysis" is a fantastic primer to non-programmers and programmers alike. The approachable nature of this text allows for any level of R student to gain immediate and relevant experience in both the coding language and the subject matter itself. As an added bonus, the text makes use of the popular "Haven" package which allows users of other statistical packages such as Minitab or SPSS to analyze their datasets interchangeably; a bonus for experienced statisticians and analysts looking to make the jump to the open-source R code.
Choi's writing style and accompanying R-code is well received and clear. The learning outcomes of each chapter are clear and the R code is presented with annotations to ensure the reader is able to easily follow along (bonus points for the youtube video link on how to set the working directory; a common and painful failure mode for new R users). One improvement Choi may consider for future versions is to include the visual outputs of the R-code such as histograms, box plots, etc. generated throughout the text.
Choi's "Basic Statistics Using R for Crime Analysis" is written with consistent and modern R package code and is presented in a clear and incrementally building fashion. Students will likely have the most success following the text's chapters sequentially, unless they have prior experience in R. If that is the case, this text becomes and invaluable desktop guide for basic statistical operations within R for any subject matter.
Based on the structure of the text, Choi's "Basic Statistics Using R for Crime Analysis" may be best suited for modular use for those with an existing familiarity to R programming or statistical computing. This text can easily operate as a standalone desktop reference for basic statistical analysis as well as common R functions such as importing, tidying, and exporting data. This text may present a slight challenge for those without a basic statistical understanding or prior training; this may be also used as a companion/addition for a intro to statistics course in some cases.
Choi's text presents in logical, clear, and building fashion. R code blocks are appropriately shaded in gray to guide the reader. Section headers are well worded and easily identifiable by their bold and red font.
As reviewed the text was a single .pdf file suitable for browsing on a screen or printing. There were no interface issues detected.
No grammatical issues were detected.
No culturally insensitive or offensive material was detected in this text. GSS data is coded (0, 1, 2, ...) which can be used instead of renamed variables if desired.
Excellent book from Dr. Choi.
This OER textbook is a cohesive yet concise book that introduces key topics that you would expect to see in an applied introductory statistics course: Graphs, Descriptive Summary Statistics, Null Hypothesis Significance Testing, Analysis of... read more
This OER textbook is a cohesive yet concise book that introduces key topics that you would expect to see in an applied introductory statistics course: Graphs, Descriptive Summary Statistics, Null Hypothesis Significance Testing, Analysis of Variance (ANOVA), Correlation, and Linear Regression. In addition to an appropriate scope and sequence, the book has the added feature of being an “easy read.” The concepts are presented in a way that can be grasped by a broad audience, even by readers who do not have a background in criminology. The book is not intended as a theoretical text with derivations of each topic.
Readers are also introduced to a powerful, free, open-source statistics analysis tool called R. Whether a novice or an experienced computer coder, the numeric examples are made more accessible to all. The author not only specifies the location of pre-coded R libraries, they also show examples of how to use the libraries’ built-in functions in problem solving.
Thank you for listing Keywords for searching purposes.
No errors were found in the presentation of concepts.
Data-driven decision making, accessible tech tools (e.g. R), and crime topics are all hot topics in today’s American society. With the strong interweaving of different crime analysis topics in each chapter, future updates may be time-consuming.
With its clearly written, approachable prose, the book can be used for self-study, for review, for a quick reference book, or as a companion to a required statistics book.
A key observation is with the author’s use of Null Hypothesis Significance Testing (NHST). Once the NHST framework is introduced, it is repeated and adapted appropriately for different statistics concepts.
Once the chapter on ANOVA is presented, then the book’s readers can creatively select chapters to research. NHST and ANOVA form the foundations for the remaining chapters.
The topics are presented in an order such that knowledge of topics builds upon each other.
The outline view of the text (pdf) enables easy navigation throughout the textbook. Also, the active links to the data sets are very easy to navigate. Use the back arrow to return to the textbook.
The book’s content is written clearly. When read aloud, only a few places may benefit from a different word or phrase.
Crime statistics were analyzed for a specific cultural group. It could prove interesting to compare similar statistics for other cultural groups.
Suggestions to strengthen the book:
Include screenshots of R results and graphs to enable user to efficiently compare their results to the author’s results.
Add links to OER exercises, case studies, videos at the end of each chapter, to encourage further study.
Provide brief explanation for arguments to built-in functions (e.g. ggplot )
Table of Contents
- Preface
- Chapter 1. Introduction to Crime Data Analysis, R and RStudio
- Chapter 2. Introduction to Data Formations and Graphics
- Chapter 3. Creating a New Variable and Producing Summary Statistics
- Chapter 4. Central Tendency and Variability
- Chapter 5. Reliability of a Scale
- Chapter 6. Chi-Squared Test
- Chapter 7. T-Test
- Chapter 8. Analysis of Variance
- Chapter 9. Correlation
- Chapter 10. Linear Regression
- Conclusion
- References
Ancillary Material
Submit ancillary resourceAbout the Book
Limited access to subscription-based statistical software poses obstacles when students want to apply the skills they acquired in college. Although students may learn programs like SPSS or Stata while at the university, they often find themselves unable to continue using these programs after graduation, making their acquired skills obsolete. As an open-source software program, R offers a solution to this challenge. It is freely accessible to anyone, including students, after they graduate. Therefore, I decided to write a freely available book for those interested in becoming crime analysts, focusing on learning statistics without delving too deeply into mathematics. Moreover, this book emphasizes practical applications by utilizing R for data analysis, ensuring students can develop relevant skills beyond the university. I hope that students can easily follow the instructions in this book and replicate the same outcomes using the provided data. This practical experience will demonstrate the value of statistics and R, ideally inspiring students to further their learning in these areas.
About the Contributors
Author
Jaeyong Choi, Ph. D., is an Assistant Professor of Criminal Justice at West Chester University. He has received multiple teaching and research awards, including the 2023 Roslyn Muraskin Emerging Scholar Award from the Northeastern Association of Criminal Justice Sciences, the 2023 CJPR Policy Paper Award from the Sage Publication & Criminal Justice Policy Review, the 2020 Junior Faculty Research Award from the Korean Society of Criminology in America, and the 2018 Teaching Award from the Center for Teaching Excellence at the Indiana University of Pennsylvania. He has published more than 60 peer-reviewed articles in the areas of immigration and criminal justice, cybercrime, criminological theory, and comparative research. His recent work can be found in Criminal Justice & Behavior, Crime & Delinquency, Deviant Behavior, Journal of Criminal Justice, Journal of Interpersonal Violence, Police Quarterly, Policing & Society, and Prison Journal.
Dr. Choi enjoys traveling, taking a walk, and meditating with his wife in his spare time. With a new addition to the family, his days are filled with watching his daughter grow and reach new milestones.