Home#
chatter: a Python library for applying information theory and AI/ML models to animal communication
The study of animal communication often involves categorizing units into types (e.g. syllables in songbirds, or notes in humpback whales). While this approach is useful in many cases, it necessarily flattens the complexity and nuance present in real communication systems. chatter is a new Python library for analyzing animal communication in continuous latent space using information theory and modern machine learning techniques. It is taxonomically agnostic, and has been tested with the vocalizations of birds, bats, whales, and primates. By leveraging a variety of different architectures, including variational autoencoders and vision transformers, chatter represents vocal sequences as trajectories in high-dimensional latent space, bypassing the need for manual or automatic categorization of units. The library provides an end-to-end workflow—from preprocessing and segmentation to model training and feature extraction—that enables researchers to quantify features like:
Complexity: path length of sequences in latent space per unit time.
Predictability: predictability of a transition in latent space.
Similarity: cosine similarity between units or sequences in latent space.
Novelty: inverse of predicted density of units or sequences in latent space.
Below is a basic diagram of the chatter workflow, showing the progression from spectrograms to latent features to visualizations in 2D space.

Additionally, chatter makes it easy to explore the latent space of a species’ vocalizations, either statically or with an interactive plot like the one below (of syllables in Cassin’s vireo song).

This project is heavily inspired by the work of folks like Nilo Merino Recalde and Tim Sainburg. Here is a list of related projects:
Sainburg, T., Thielk, M., Gentner, T. Q. (2020). Finding, visualizing, and quantifying latent structure across diverse animal vocal repertoires. PLOS Computational Biology. https://doi.org/10.1371/journal.pcbi.1008228
Goffinet, J., Brudner, S., Mooney, R., Pearson, J. (2021). Low-dimensional learned feature spaces quantify individual and group differences in vocal repertoires. eLife. https://doi.org/10.7554/eLife.67855
Merino Recalde, N. (2023). pykanto: a python library to accelerate research on wild bird song. Methods in Ecology and Evolution. https://doi.org/10.1111/2041-210X.14155
Alam, D., Zia, F., Roberts, T. F. (2024). The hidden fitness of the male zebra finch courtship song. Nature. https://www.doi.org/10.1038/s41586-024-07207-4
Please cite chatter as:
Youngblood, M. (2025). chatter: a Python library for applying information theory and AI/ML models to animal communication. arXiv. https://doi.org/10.48550/arXiv.2512.17935
@article{youngblood_chatter_2025,
author = {Youngblood, Mason},
title = {chatter: a Python library for applying information theory and AI/ML models to animal communication},
date = {2025},
journal = {arXiv},
doi = {10.48550/arXiv.2512.17935},
url = {https://doi.org/10.48550/arXiv.2512.17935}
}
Full Vignettes
Short Vignettes
Reference
Chatter: a Python library for applying information theory and AI/ML models to animal communication. |