HyperText2009

November 5th, 2011 Leave a comment Go to comments

DATASET: Hypertext 2009 dynamic contact network

http://www.sociopatterns.org/datasets/hypertext-2009-dynamic-contact-network/

Release data: Oct 28, 2011

This dataset was collected during the ACM Hypertext 2009 conference, where the SocioPatterns project deployed the Live Social Semantics application. Conference attendees volunteered to wear radio badges that monitored their face-to-face proximity. The dataset published here represents the dynamical network of face-to-face proximity of ~110 conference attendees over about 2.5 days. No personal data are released here, and no metadata collected by the Live Social Semantics application are exposed. We provide two data files, described below.

Contact List. This is a tab-separated list representing the active contacts during 20-second intervals of the data collection. Each line has the form “t i j“, where i and j are the anonymous IDs of the persons in contact, and t is the interval during which this contact was active is [ t – 20s, t ]. If multiple contacts are active in a given interval, you will see multiple lines starting with the same value of t. Time is measured in seconds since 8am on Jun 29th 2009 (UNIX ctime 1246255200).

Contact Intervals. This file is in JSON format and contains a dictionary. Each key is a person ID and the corresponding value is a dictionary of neighbors of that person in the contact network. This dictionary of neighbors has person IDs as keys and, for each key, the value gives the list of time intervals during which the corresponding contact was active. Time is measured as above.

Terms and conditions

The data are distributed to the public under a Creative Commons Attribution-NonCommercial-ShareAlike license. If you use the data for research or visualization purposes, please cite the following paper: L. Isella et al., What’s in a crowd? Analysis of face-to-face behavioral networks, Journal of Theoretical Biology 271, 166 (2011).

Please also acknowledge the SocioPatterns Project and provide a link to http://www.sociopatterns.org .

The activity plot is shown below:
Hypertext2009 Connected Time plot with mean, median 20th percentile and 80 percentile shown on log scale

Hypertext2009 Connected Time plot with mean, median 20th percentile and 80 percentile shown on log scale

The results of KCLIQUE community finding are shown below
Community Breakdown Hypertext2009 KCLIQUE MCT: 80th Percentile, K=4
Grouped Community Hypertext2009 KCLIQUE MCT: 80th Percentile K=4
Hypertext2009 dataset, edges removed at 80th percentile, K=4, KCLIQUE clustering

Hypertext2009 dataset, edges removed at 80th percentile, K=4, KCLIQUE clustering. Edge thickness relates to connected time, node colour relates to community assignment, node size relates to the number of communities assigned, edge colour relates to community assignment

The same graph, but for InfoMap:

hypertextinfomap80pc

Hypertext2009 dataset, edges removed at 80th percentile, InfoMap clustering. Edge thickness relates to connected time, node colour relates to community assignment, node size relates to the number of communities assigned, edge colour relates to community assignment

This

  1. No comments yet.
  1. No trackbacks yet.