Interpretable Poverty Mapping Using Social Media, Satellite Images and Geospatial Information

Satellite images are costly to acquire and training a deep learning model requires costly GPU resources
Deep learning models do not provide interpretability
In this study these challenges are overcome by combining social media and geospatial data sources with cost efficient ML methods as an interpretable and inexpensive approach to poverty estimation

Data used

Linear Regression, Lasso Regression, Ridge Regression, Random Forest, and LightGBM. The models were trained on social media data, remote sensing data, and point of interest data, first separately then combined, with the hypothesis that integrating multiple data sources will lead to improved model performance over using any one data source alone.

Using multiple data sources provided better results than using a single data source
Important features in this study to predict wealth are night time light values, proportion of population with 4G access, presence of public schools