Riding the Spider: A Network-Sampling Framework for Multi-Platform Data Collections

Table of contents

Bibliographic information


Cover of Volume: M&K Medien & Kommunikationswissenschaft Volume 74 (2026), Issue 1
Open Access Full access

M&K Medien & Kommunikationswissenschaft

Volume 74 (2026), Issue 1


Authors:
Publisher
Nomos, Baden-Baden
Copyright Year
2026
ISSN-Online
2942-3317
ISSN-Print
1615-634X

Chapter information


Open Access Full access

Volume 74 (2026), Issue 1

Riding the Spider: A Network-Sampling Framework for Multi-Platform Data Collections


Authors:
ISSN-Print
1615-634X
ISSN-Online
2942-3317


Preview:

Research on the digital networked public sphere is not only hindered by challenges in data access but also by a lack of common standards for describing and implementing data collection independently of the form of access or technologies employed. These challenges are particularly pronounced in cross-platform research. In this article, we propose a network-sampling framework to conceptualize, implement, and document explorative data collections in a generalizable, readily operationalizable, and interoperable way. Building on the theoretically established components of the networked public sphere, the concept of multilayer networks, explorative network sampling, and legal and technical realities of cross-platform data access, we segment the data collection process into four modules: a Connector, a Parser, a Filter, and a Sampler. This framework enables researchers not only to describe their data collection in a precise and reproducible way but also to follow guidelines on for developing interoperable software implementations of these modules or to propose new modules themselves.

Bibliography


  1. Barabasi, Albert-László, and Reka Albert. 1999. Emergence of scaling in random networks. Science, 286(5439), 509–512. https://doi.org/10.1126/science.286.5439.509 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  2. Berners-Lee, Tim. 1994. “Universal Resource Identifiers in WWW: A Unifying Syntax for the Expression of Names and Addresses of Objects on the Network as Used in the World-Wide Web.” Request for {{Comments}} RFC 1630. Internet Engineering Task Force. https://doi.org/10.17487/RFC1630 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  3. Berry, David. 2011. The computational turn: Thinking about the digital humanities. CULTURE MACHINE, 12. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  4. boyd, danah m., and Nicole B. Ellison. 2007. “Social Network Sites: Definition, History, and Scholarship.” Journal of Computer-Mediated Communication 13 (1): 210–30. https://doi.org/10.1111/j.1083-6101.2007.00393.x. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  5. Bossetta, Michael. 2019. The Digital Architectures of Social Media: Comparing Political Campaigning on Facebook, Twitter, Instagram, and Snapchat in the 2016 U.S. Election (No. arXiv:1904.07333). arXiv. https://doi.org/10.48550/arXiv.1904.07333 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  6. Breiter, Andreas, and Andreas Hepp. 2018. “The Complexity of Datafication: Putting Digital Traces in Context.” In Communicative Figurations: Transforming Communications in Times of Deep Mediatization, edited by Andreas Hepp, Andreas Breiter, and Uwe Hasebrink, 387–405. Transforming Communications – Studies in Cross-Media Research. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-319-65584-0_16 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  7. Brin, Sergey, Rajeev Motwani, Lawrence Page, and Terry Winograd. 1998. “What Can You Do with a Web in Your Pocket?” IEEE Data Eng. Bull. 21 (2): 37–47. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  8. Brüggemann, Michael, and Hendrik Meyer. 2023. “When Debates Break Apart: Discursive Polarization as a Multi-Dimensional Divergence Emerging in and Through Communication.” Communication Theory 33 (2–3): 132–42. https://doi.org/10.1093/ct/qtad012 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  9. Bruns, Axel. 2008. “Life Beyond the Public Sphere: Towards a Networked Model for Political Deliberation.” Information Polity 13: 71–85. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  10. Bruns, Axel. 2019. After the ‘APIcalypse’: Social media platforms and their fight against critical scholarly research. Information, Communication & Society, 22(11), 1544–1566. https://doi.org/10.1080/1369118X.2019.1637447 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  11. Bruns, Axel. 2023. “From ‘the’ Public Sphere to a Network of Publics: Towards an Empirically Founded Model of Contemporary Public Communication Spaces.” Communication Theory 33 (2–3): 70–81. https://doi.org/10.1093/ct/qtad007 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  12. Bruns, Axel, and Jean Burgess. 2011. “The Use of Twitter Hashtags in the Formation of Ad Hoc Publics.” In Proceedings of the 6th European Consortium for Political Research (ECPR) General Conference 2011, edited by A. Bruns and P. De Wilde, 1–9. United Kingdom: The European Consortium for Political Research (ECPR). Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  13. Bruns, Axel, Kateryna Kasianenko, Vish Padinjaredath Suresh, Ehsan Dehghan, and Laura Vodden. 2025. Untangling the Furball: A Practice Mapping Approach to the Analysis of Multimodal Interactions in Social Networks. Social Media + Society, 11(2). https://doi.org/10.1177/20563051251331748 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  14. Bruns, Axel, and Brenda Moon. 2019. “One Day in the Life of a National Twittersphere.” Nordicom Review 40 (s1): 11–30. https://doi.org/10.2478/nor-2019-0011 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  15. Bruns, Axel, Brenda Moon, Felix Victor Münch, and Troy Sadkowsky. 2017. “The Australian Twittersphere in 2016: Mapping the Follower/Followee Network.” Social Media + Society, 3(4). https://doi.org/10.1177/2056305117748162. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  16. “Common Crawl – Overview.” n.d. https://commoncrawl.org/overview. Accessed January 28, 2026. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  17. Coscia, Michele, and Luca Rossi. 2018. “Benchmarking API Costs of Network Sampling Strategies.” In 2018 IEEE International Conference on Big Data (Big Data), 663–72. https://doi.org/10.1109/BigData.2018.8622486 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  18. Esau, Katharina, Tariq Choucair, Samantha Vilkins, Sebastian F. K. Svegaard, Axel Bruns, Kate S. O’Connor-Farfan, and Carly Lubicz-Zaorski. 2024. “Destructive Polarization in Digital Communication Contexts: A Critical Review and Conceptual Framework.” Information, Communication & Society 0 (0): 1–22. https://doi.org/10.1080/1369118X.2024.2413127 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  19. Freelon, Deen. 2021. “The Post-API Age Reconsidered: Web Science in the ’20s and Beyond.” In 13th ACM Web Science Conference 2021, 3–3. Virtual Event United Kingdom: ACM. https://doi.org/10.1145/3447535.3466177. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  20. Friemel, Thomas N, and Christoph Neuberger. 2023. “The Public Sphere as a Dynamic Network.” Communication Theory 33 (2–3): 92–101. https://doi.org/10.1093/ct/qtad003 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  21. Frischlich, Lena, & Edda Humprecht (2021). Trust, Democratic Resilience, and the Infodemic. https://doi.org/10.5167/UZH-202660 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  22. Goodman, Leo A. 1961. “Snowball Sampling.” The Annals of Mathematical Statistics 32 (1): 148–70. https://doi.org/10.1214/aoms/1177705148 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  23. Guan, Lu, Xiao Fan Liu, Wujiu Sun, Hai Liang, and Jonathan Zhu. 2022. “Census of Twitter Users: Scraping and Describing the National Network of South Korea.” PLOS ONE 17 (November): e0277549. https://doi.org/10.1371/journal.pone.0277549 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  24. Habermas, Jürgen. 1962. Strukturwandel der Öffentlichkeit – Untersuchungen zu einer Kategorie der bürgerlichen Gesellschaft. 1990th ed. Suhrkamp. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  25. Hasebrink, Uwe, and Andreas Hepp. 2017. “How to Research Cross-Media Practices? Investigating Media Repertoires and Media Ensembles.” Convergence, 23(4): 362–77. https://doi.org/10.1177/1354856517700384 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  26. Heft, Annett, Kilian Buehling, Xixuan Zhang, Juni Schindler, and Miriam Milzner. 2024. Challenges of and Approaches to Data Collection across Platforms and Time: Conspiracy-Related Digital Traces as Examples of Political Contention. Journal of Information Technology & Politics, 21(3), 323–339. https://doi.org/10.1080/19331681.2023.2250779 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  27. Helmond, Anne. 2015. “The Platformization of the Web: Making Web Data Platform Ready.” Social Media + Society 1 (2): 205630511560308. https://doi.org/10.1177/2056305115603080 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  28. Hepp, Andreas, and Uwe Hasebrink. 2014. Kommunikative Figurationen – ein Ansatz zur Analyse der Transformation mediatisierter Gesellschaften und Kulturen. In Von der Gutenberg-Galaxis zur Google-Galaxis: Alte und neue Grenzvermessungen nach 50 Jahren DGPuK (pp. 343–360). UVK Verlagsgesellschaft. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  29. Hu, Pili, and Wing Cheong Lau. 2013. “A Survey and Taxonomy of Graph Sampling.” arXiv. https://doi.org/10.48550/arXiv.1308.5865 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  30. Jost, Pablo, Annett Heft, Kilian Buehling, Maximilian Zehring, Heidi Schulze, Hendrik Bitzmann, and Emese Domahidi. 2023. “Mapping a Dark Space: Challenges in Sampling and Classifying Non-Institutionalized Actors on Telegram.” M&K Medien & Kommunikationswissenschaft 71(3–4): 212–29. https://doi.org/10.5771/1615-634X-2023-3-4-212 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  31. Kivelä, Mikko, Alex Arenas, Marc Barthelemy, James P. Gleeson, Yamir Moreno, and Mason A. Porter. 2014. “Multilayer Networks.” Journal of Complex Networks, 2(3): 203–71. https://doi.org/10.1093/comnet/cnu016 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  32. Lazer, David, Alex Pentland, Lada Adamic, Sinan Aral, Albert-László Barabási, Devon Brewer, Nicholas Christakis, Noshir Contractor, James Fowler, Myron Gutmann, Tony Jebara, Gary King, Michael Macy, Deb Roy and Marshall van Alstyne. 2009. Computational Social Science. Science, 323(5915), 721–723. https://doi.org/10.1126/science.1167742 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  33. Lazer, David, Alex Pentland, Duncan J. Watts, Sinan Aral, Susan Athey, Noshir Contractor, Deen Freelon, et al. 2020. “Computational Social Science: Obstacles and Opportunities.” Science 369 (6507): 1060–62. https://doi.org/10.1126/science.aaz8170 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  34. Leskovec, Jure, and Christos Faloutsos. 2006. “Sampling from Large Graphs.” In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 631–36. Philadelphia PA USA: ACM. https://doi.org/10.1145/1150402.1150479 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  35. Münch, Felix Victor, Ben Thies, Cornelius Puschmann, and Axel Bruns. 2021. “Walking Through Twitter: Sampling a Language-Based Follow Network of Influential Twitter Accounts.” Social Media + Society, 7(1): 2056305120984475. https://doi.org/10.1177/2056305120984475 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  36. Nielsen, Hendrik, Roy T. Fielding, and Tim Berners-Lee (1996). Hypertext Transfer Protocol – HTTP/1.0 (Request for Comments No. RFC 1945). Internet Engineering Task Force. https://doi.org/10.17487/RFC1945 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  37. Ohme, Jakob, Theo Araujo, Laura Boeschoten, Deen Freelon, Nilam Ram, Byron B. Reeves, and Thomas N. Robinson. 2023. “Digital Trace Data Collection for Social Media Effects Research: APIs, Data Donation, and (Screen) Tracking.” Communication Methods and Measures, 0 (0): 1–18. https://doi.org/10.1080/19312458.2023.2181319 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  38. Olteanu, Alexandra, Carlos Castillo, Fernando Diaz, and Emre Kıcıman. 2019. “Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries.” Frontiers in Big Data 2. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  39. Pearce, Wareen, Suay M. Özkula, Amanda K. Greene, Lauren Teeling, Jennifer S. Bansard, Janna Joceli Omena, and Elaine Teixeira Rabello. 2020. Visual Cross-Platform Analysis: Digital Methods to Research Social Media Images. Information, Communication & Society, 23(2), 161–180. https://doi.org/10.1080/1369118X.2018.1486871 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  40. Quandt, Thorsten, Lena Frischlich, Svenja Boberg, and Tim Schatto-Eckrodt. 2019. Fake News. In The International Encyclopedia of Journalism Studies (pp. 1–6). John Wiley & Sons, Ltd. https://doi.org/10.1002/9781118841570.iejs0128 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  41. Rau, Jan Philipp, Philipp Kessling, Gregor Wiedemann, and Felix Victor Münch. 2025. “Research Data Access in the Context of Art. 40 DSA for the German Federal Election: A Mixed Experience at Best.” Frankfurt, Leizig. https://doi.org/10.58079/13ZE0 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  42. Ricaud, Benjamin, Nicolas Aspert, and Volodymyr Miz. 2020. “Spikyball Sampling: Exploring Large Networks via an Inhomogeneous Filtered Diffusion.” arXiv. https://doi.org/10.48550/arXiv.2010.11786. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  43. Rogers, Richard 1996. “The Future of Science and Technology Studies on the Web.” EASST Review 15, 25–27. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  44. Rogers, Richard. 2010. “Mapping Public Web Space with the Issuecrawler.” In Digital Cognitive Technologies: Epistemology and Knowledge Society, edited by Claire Brossard and Bernard Rebers. London, England: Wiley. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  45. Rogers, Richard. 2023. “‘Serious Queries’ and ‘Editorial Epistemologies’.” In The Propagation of Misinformation in Social Media: A Cross-platform Analysis. Amsterdam University Press. https://doi.org/10.5117/9789463720762 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  46. Schmidt, Jan-Hinrik. 2014. “Twitter and the Rise of Personal Publics.” In Twitter and Society, 3–14. New York, Washington, D.C., Bern. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  47. Schmidt, Jan-Hinrik, Lisa Merten, Uwe Hasebrink, Isabelle Petrich, and Amelie Rolfs (2017). Zur Relevanz von Online-Intermediären für die Meinungsbildung. Arbeitspapiere des Hans-Bredow-Instituts, 40, 107 S. https://doi.org/10.21241/SSOAR.71784 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  48. Sen, Indira, Fabian Flöck, Katrin Weller, Bernd Weiß, and Claudia Wagner. 2021. “A Total Error Framework for Digital Traces of Human Behavior on Online Platforms.” Public Opinion Quarterly 85 (S1): 399–422. https://doi.org/10.1093/poq/nfab018 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  49. Strippel, Christian, Annekatrin Bock, Christian Katzenbach, Merja Mahrt, Lisa Merten, Christian Nuernbergk, Christian Pentzold, Cornelius Puschmann, and Annie Waldherr. 2018. “Die Zukunft der Kommunikationswissenschaft ist schon da, sie ist nur ungleich verteilt: Eine Kollektivreplik auf Beiträge im ,,Forum“ (Publizistik, Heft 3 und 4, 2016).” Publizistik 63, 11–27 (Januar). https://doi.org/10.1007/s11616-017-0398-5. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  50. Voudigari, Elli, Nikos Salamanos, Theodore Papageorgiou, and Emmanuel J. Yannakoudakis. 2016. “Rank Degree: An Efficient Algorithm for Graph Sampling.” Proceedings of the 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2016, 120–29. https://doi.org/10.1109/ASONAM.2016.7752223 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  51. Waldherr, Annie. 2017. “Öffentlichkeit als komplexes System. Theoretischer Entwurf und methodische Konsequenzen.” M&K Medien & Kommunikationswissenschaft, 65(3): 534–49. https://doi.org/10.5771/1615-634X-2017-3-534 Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  52. Wiedemann, Gregor, Felix Victor Münch, Jan Philipp Rau, Phillip Kessling, and Jan-Hinrik Schmidt. 2023. “Concept and Challenges of a Social Media Observatory as a DIY Research Infrastructure.” Publizistik, 201–223 August. https://doi.org/10.1007/s11616-023-00807-6. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52
  53. Wolf, J. L., Squillante, M. S., Yu, P. S., Sethuraman, J., & Ozsen, L. (2002, May 7). Optimal Crawling Strategies for Web Search Engines. WWW2002. Open Google Scholar DOI: 10.5771/1615-634X-2026-1-52

Citation


Download RIS Download BibTex