Skip to main content
CircleNet Analytics uses three interconnected datasets to model a social media platform and analyze user behavior, relationships, and activity patterns.

The Three Datasets

The CircleNet Analytics system is built on three core datasets:
  1. CircleNetPage - User profile information for 200,000 users
  2. Follows - One-sided follow relationships with 20 million records
  3. ActivityLog - User actions and page accesses with 10 million records

Dataset Relationships

These datasets are designed to work together for comprehensive social network analytics.
CircleNetPage (200K users)
    |
    +--- Referenced by ---> Follows.ID1 & Follows.ID2 (20M relationships)
    |
    +--- Referenced by ---> ActivityLog.ByWho & ActivityLog.WhatPage (10M actions)

How They Connect

  • CircleNetPage serves as the master user directory with unique IDs (1-200,000)
  • Follows records use ID1 and ID2 to reference CircleNetPage users
    • ID1 follows ID2 (one-directional relationship)
    • The relationship is not symmetric: ID1 → ID2 is different from ID2 → ID1
  • ActivityLog tracks when users interact with pages
    • ByWho references the user performing the action
    • WhatPage references the CircleNetPage being accessed

Scale Information

DatasetRecordsPurpose
CircleNetPage200,000User profiles and demographics
Follows20,000,000Social graph relationships
ActivityLog10,000,000User interaction history
This scale is designed to test big data processing capabilities while remaining manageable for educational and development purposes.

Data Format

All datasets are stored as CSV files without headers:
  • Values are comma-separated
  • No column names in the files
  • Column position determines the attribute
  • String values do not contain commas

Use Cases

These datasets enable analytics such as:
  • User popularity rankings
  • Hobby-based user segmentation
  • Follow-back analysis
  • Activity patterns and engagement metrics
  • Regional network analysis
  • Inactive user identification

Next Steps

CircleNetPage

User profile schema

Follows

Relationship schema

ActivityLog

Activity tracking schema

Build docs developers (and LLMs) love