2022
Masters Projects
@Computer Science
+Political Science
+Economics
+Sociology
#veracity
#value
#big data
#quality
#social scientists
Project Summary
A particular recent trend for social scientists is to understand the potential of big data in complementing traditional research methods and their value in making decisions. Several major issues have to be closely investigated around big data in social sciences, including political polarization, viral information diffusion, and economic performance. The veracity and value characteristics of big data are the main concerns for social scientists. This master internship will focus on urban data, particularly the NYC taxi dataset, to develop technical procedures that help social scientists deal with this and similar urban datasets. Social scientists have used the NYC dataset in the past and yet left many dimensions unexplored. Most problematically, they have not yet provided a technology that allows for fast, flexible data access and a strategy for ensuring the quality of the data. Once such an infrastructure is in place, the NYC taxi dataset can lead to better understanding of core questions in the social sciences, such as economic decision-making and labor mobility, as well as a strategy for how social scientists can work with novel datasets. In this work, we would study data quality issues in NYC taxi big dataset by considering all of data inconsistencies, data inaccuracies, and data incompletenesses. We will propose a veracity assessment model with a veracity score calculus and veracity assessment approaches that correlate the NYC taxi data veracity to their various business queries without repairing data.
Soror Sahri
Projects in the same discipline
Diffusion Models Based Visual Counterfactual Explanations
2024 Masters Projects @Computer Science #Visual counterfactual explanations #Diffusion Models #Identification of subtle phenotypesProject Summary to be updated Valerie MezgerProjects in the same discipline
OpenStreetMap and Sentinel-2 data for the production of environmental indices for demographic studies
2023Masters Projects@Computer Science +Demography #Remote sensing#Demography#Deep learning#Sentinel 2#OpenStreetMap#Local climate zones#Africa Project Summaryto be updated. Sylvain Lobry Projects in the same discipline
Diffusion Models Based Unpaired Image-to-Image Translation to Reveal Subtle Phenotypes
2023Masters Projects@Computer Science +Mathematics/Statistics+Biology+neurodevelopment #Image-to-image translation#Deep generative models#Diffusion models#Subtle Phenotypes#Neurodevelopment Project SummaryUnpaired image-to-image translation methods aim at learning a...
Generalization of a method enabling to update vineyard geographic databases from satellite data
2023Masters Projects@Computer Science +Earth Sciences/Geosciences #image time series analysis#deep learning#optical satellite imagery#agriculture monitoring#crop type mapping#vineyard#VENUS images Project Summaryto be updated. Camille Kurtz Projects in the same...