Mobile phone Data

Features

Individual

  • active_days
  • number_of_contacts
  • number_of_interactions
  • call_duration
  • percent_nocturnal
  • percent_initiated_conversations
  • response_delay_text
  • response_rate_text
  • entropy_of_contacts
  • Interactions_per_contact
  • percent_pareto_interactions (percentage of user’s contact that account for 80% of its interactions)
  • percent_pareto_durations

Spatial

  • Number of unique places (antennas) visited
  • Entropy of antennas
  • percent at home
  • radius of gyration (the equivalent distance of the mass from the center of gravity, for all visited places)
  • frequent_antennas - location that accounts for 80% of the locations the user was
  • churn_rate - Computes the frequency spent at every towers each week, and returns the distribution of the cosine similarity between two consecutives week

Network (These are graph network analysis)

  • Directed, weighted matrix for call, text etc
  • Directed, Unweighted matrix
  • Undirected, weighted matrix
  • Undirected, Unweighted matrix
  • Clustering coefficient - Measure of the degree to which nodes in a graph tend to cluster together
  • clustering coefficient unweighted of users weighted undirected network
  • clustering coefficient weighted (undirected)
  • assortativity of indicators(The extent to which nodes of a graphlink to others of the same degree)
  • assortativity of attributes

Recharges

  • Recharge amounts
  • Time between recharges
  • percent pareto recharges
  • Number of recharges
  • Average daily balance estimated from all recharges

Challenges

  • Mobile data will not be uniform across different networks. A different model may be required for different network.