img Leseprobe Leseprobe

Humanities Data Analysis

Case Studies with Python

Mike Kestemont, Folgert Karsdorp, Allen Riddell, et al.

ca. 43,99
Amazon iTunes Hugendubel Bü kobo Osiander Google Books Barnes&Noble Legimi
* Affiliatelinks/Werbelinks
Hinweis: Affiliatelinks/Werbelinks
Links auf sind sogenannte Affiliate-Links. Wenn du auf so einen Affiliate-Link klickst und über diesen Link einkaufst, bekommt von dem betreffenden Online-Shop oder Anbieter eine Provision. Für dich verändert sich der Preis nicht.

Princeton University Press img Link Publisher

Naturwissenschaften, Medizin, Informatik, Technik / Informatik, EDV


A practical guide to data-intensive humanities research using the Python programming language

The use of quantitative methods in the humanities and related social sciences has increased considerably in recent years, allowing researchers to discover patterns in a vast range of source materials. Despite this growth, there are few resources addressed to students and scholars who wish to take advantage of these powerful tools. Humanities Data Analysis offers the first intermediate-level guide to quantitative data analysis for humanities students and scholars using the Python programming language. This practical textbook, which assumes a basic knowledge of Python, teaches readers the necessary skills for conducting humanities research in the rapidly developing digital environment.

The book begins with an overview of the place of data science in the humanities, and proceeds to cover data carpentry: the essential techniques for gathering, cleaning, representing, and transforming textual and tabular data. Then, drawing from real-world, publicly available data sets that cover a variety of scholarly domains, the book delves into detailed case studies. Focusing on textual data analysis, the authors explore such diverse topics as network analysis, genre theory, onomastics, literacy, author attribution, mapping, stylometry, topic modeling, and time series analysis. Exercises and resources for further reading are provided at the end of each chapter.

An ideal resource for humanities students and scholars aiming to take their Python skills to the next level, Humanities Data Analysis illustrates the benefits that quantitative methods can bring to complex research questions.

  • Appropriate for advanced undergraduates, graduate students, and scholars with a basic knowledge of Python
  • Applicable to many humanities disciplines, including history, literature, and sociology
  • Offers real-world case studies using publicly available data sets
  • Provides exercises at the end of each chapter for students to test acquired skills
  • Emphasizes visual storytelling via data visualizations

Weitere Titel von diesem Autor



Statistic, Pairwise comparison, Parsing, Pattern matching, Bayesian inference, HTML, Cosine similarity, Hierarchical clustering, Statistics, Respondent, Function word, Bayes' theorem, Lexical analysis, Least squares, Topic model, Scikit-learn, Text corpus, XML, Probability, Calculation, Histogram, Ranking (information retrieval), Instance (computer science), Data set, Vector space, JSON, Sorting algorithm, Distance matrix, Addition, Normal distribution, Expectation–maximization algorithm, Mixture model, Literature, Recipe, Literary theory, Bigram, LibreOffice Calc, Random variable, Machine learning, Source lines of code, Cluster analysis, Pandas (software), Vector space model, Taxicab geometry, Principal component analysis, Stemming, Document-term matrix, Exploratory data analysis, Namespace, Box plot, NumPy, Probability distribution, Case study, Computational resource, Categorical distribution, Statistical classification, Vocabulary, High- and low-level, Negative binomial distribution, Information theory, Inference, Python (programming language), Writing style, Content analysis, Computation, Processing (programming language), Summary statistics, Publication, Categorical variable, Punctuation, Parameter (computer programming), Latent Dirichlet allocation, Variable (computer science), Result, Standard library, Writing, Pairwise, Quantitative research, Data model, Handbook, Chain letter, Interquartile range, Stylometry, Accuracy and precision, For loop, Ingredient, Subset, Cohen's kappa, Genre, Syntax error, Binomial distribution, Array data structure, Data analysis, Annotation, Conditional (computer programming), Family resemblance, Plain text, Variable (mathematics), Naming convention (programming), Bayesian