Geo-linguistic fingerprint and the evolution of languages in Twitter

Having access to content of messages sent by some given group of subscribers of a social network may be used to identify (and quantify) some features of that group. The feature can stand for the level of interest in some event or product, or for the popularity of some idea, or a musical hit or of a political figure. The feature can also stand for the way the written language is used and transformed, the way words are spelled and grammer is used. In this paper we shall be interested in identifying features of groups of subscribers that have their geographic location and their language in common. We develop a methodology that allows one to perform such a study using a statistical tool which is freely available, and which makes use of a part of all tweets which twitter makes available for free over the Internet. The methodology is based on the fact that one can differentiate among some geographic areas according to the activity pattern of tweets during the time of the day. We present an application of this methodology to the study of new spellings or of new words created in twitter messages

Data and Resources

Additional Info

Field Value
Source https://inria.hal.science/hal-00674853
Author Altman, Eitan, Portilla, Yonathan
Maintainer CCSD
Last Updated May 20, 2026, 23:45 (UTC)
Created May 20, 2026, 23:45 (UTC)
Identifier hal-00674853
Language en
Rights https://about.hal.science/hal-authorisation-v1/
contributor Models for the performance analysis and the control of networks (MAESTRO) ; Centre Inria d'Université Côte d'Azur ; Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)
creator Altman, Eitan
date 2012-02-28T00:00:00
harvest_object_id 771453cb-be3b-4b29-b33a-552f984a84a4
harvest_source_id 3374d638-d20b-4672-ba96-a23232d55657
harvest_source_title test moissonnage SELUNE
metadata_modified 2025-08-26T00:00:00
set_spec type:REPORT