Data Collection & Quality

SNA Module 2: A Conceptual Overview

A Quick Refresher

Network Theory

  1. Social relations are often more important than individual attributes.

  2. Social networks affect individual beliefs, perceptions, and behaviors.

  3. Relations are not static but rather occur as part of a dynamic process.

Network Terms

Data Collection Considerations

Boundaries, Ties & Data Sources

Boundary Specification

How do you determine who’s in the network that you will collect data about?

  • no individual is an island, independence is NOT assumed
  • who we know, are friends with, or talk with matters
  • the influence of these relations is shaped by the larger network
  • location in a social structure shapes one’s opportunities and outcomes

Defining Ties

Relationship & Strength

  • Friendships (friend, best friend)

  • Co-Author/Presenter (# of pubs)

  • Assignment Partner

  • Committee Members

Interaction & Frequency

  • Online Discussion (# of replies/DMs)

  • Support Seeking (how often)

  • Confidential Exchanges

  • Communications

Data Sources

Archival Records

  • Email Accounts

  • Discussion Forums

  • Social Network Platforms

  • School Records

Sociometric Instrument

  • Survey or Interview Protocol

  • Nomination or Roster

  • Binary or Ordinal

  • Simple or Multiplex

Discussion

Consider the following quesions about a network you may be interested in studying:

  • What is the boundary of this network?

  • What relations within this network might be of interest to your research?

  • What attributes about actors in this network might you want to capture?

  • How might you collect data about these actors and their relations?

Data Quality

Validity, Reliability, & Measurement Error

Validity

While research on construct validity is slim, and participant recall is suspect, approaches to improve instrument validity include:

  1. Focus on more stable relations, as opposed to ones that are time specific;

  2. Limit the number of alters one can nominate;

  3. Ask respondents to try to answer what the others would say;

  4. Consider observations or archival data (e.g. logs, student records).

Reliability

Recall that reliability is the extent to which an instrument yields a similar results every time its applied to the same participant.

  • A common approaches in SNA is Test-Retest reliability.

  • Closer ties tend to be reported more reliably than weaker ones

  • Reciprocated relations may be more trustworthy.

Measurement Error

Measurement Issues

  1. Recall vs. Recognition

  2. Fixed- vs. Free-Choice (Quantity)

  3. Ratings vs. Rankings (Intensity)

  4. Missing Data

Possible Solutions

  1. Use a roster if network size < 50

  2. No consensus, consider both

  3. Ranking if network size < 20

  4. Avoid if possible ;)

Discussion

Consider the following questions about your network interest described earlier:

  • What might be some threats to the quality of your data?

  • How might you address or avoid these threats?

Essential Readings

The following chapters in Carolan (2014) cover the topics introduced in this conceptual overview in much greater depth:

In preparation for the Module 2 Code-Along and Case Study, the research article by Kellogg, Booth, and Oliver (2014) is also highly recommended.

Acknowledgements

This work was supported by the National Science Foundation grants DRL-2025090 and DRL-2321128 (ECR:BCSER). Any opinions, findings, and conclusions expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

References

Carolan, Brian. 2014. “Social Network Analysis and Education: Theory, Methods & Applications.” https://doi.org/10.4135/9781452270104.
Kellogg, Shaun, Sherry Booth, and Kevin Oliver. 2014. “A Social Network Perspective on Peer Supported Learning in MOOCs for Educators.” International Review of Research in Open and Distributed Learning 15 (5): 263–89. https://www.erudit.org/en/journals/irrodl/2014-v15-n5-irrodl04945/1065545ar.pdf.