ALLASSONNIÈRE-TANG Marc

Entité de rattachement

UMR 7206 - Diversité et évolution culturelles (DivEC)

Spécialité

Linguistique, Traitement automatique des langues, Humanités numériques, Phylogénétique

Contact

Réseaux sociaux

Twitter

Site(s) web

https://marctang.github.io/

Courriel

marc.allassonniere-tang [at] mnhn.fr

Présentation

PARCOURS PROFESSIONNEL

2021 – present: CNRS Researcher. EA (Ecological Anthropology, UMR 7206) lab at the Muséum National d’Histoire Naturelle (MNHN) in Paris, France.
2019 – 2021: Postdoctoral Researcher. DDL (Dynamics of Language) lab of University Lumière Lyon 2, France. (80% research - 20% Teaching/Supervision).
2016 – 2019: Ph.D. Student. Department of Linguistics and Philology, Uppsala University, Sweden. (80% research- 20% teaching, full-time funded working contract).
2013 – 2015: French Instructor. Chinese Institute of European Languages, Taiwan. (Group courses 5-20 members, 30 hours/week, level A1-C2).
2013 – 2015: Research Assistant. Syntax/Phonology lab, National ChengChi University, Taiwan. (Experiment design, database maintenance, and data analysis)
2011 – 2012: Product Manager. North Africa division, Asustek Computer, Taiwan. (Product planning, sales and marketing for notebooks and tablets)

FORMATION

2016 – 2019: Ph.D. in linguistics, Uppsala University, Sweden/ National Institute of Oriental Languages and Civilizations, France.
Thesis: A typology of classifiers and gender: From description to computation.
2013 – 2015: M.A. in linguistics, National ChengChi University, Taiwan.
Thesis: A GIS typological analysis of the convergence and divergence among numeral classifier, genders and plural markers in the world’s languages.
2006 – 2011: B.A. in Diplomacy/Arabic Language and Literature (double major), National ChengChi University, Taiwan.

COMPÉTENCES

Languages: French (Native), Chinese (Native), English (TOEIC 990/990), Arabic (CEFR B1), Swedish (CEFR B1).
Data visualization and analysis: The R tidyverse and its extensions, classification and regression (Generalized Linear Mixed Models, Random Forests, Neural
Networks, Support Vector Machines), clustering methods, QGIS.
Natural Language Processing: Stylometry, topic modelling, word embeddings (GloVe, word2vec, fastText), text processing (Word segmentation, POS tagging, dependency parsing), web data harvesting (Docker, Selenium).
Computational methods: Bayesian phylogenetic inference (BayesTraits, MrBayes, BEAST), Bayesian agent-based modelling with network analysis.
Linguistics: CLAN (Computerized Language Analysis), ELAN (EUDICO Linguistic Annotator), Praat, Toolbox, VTL (VocalTractLab).
Computer: Programming language R and Python, Operating systems Linux, Mac, and Windows, LATEX.

ENSEIGNEMENTS

Linguistique

Fieldwork practice session with native speakers: Linguistic summer school teaching module, HT19 (International School in Linguistic Fieldwork, Paris). Instructor for 2-hour daily sessions on planning and conducting fieldwork during a week.
Current Research in Linguistics: Undergraduate course VT19 (Department of Linguistics and Philology, Uppsala University). Instructor in charge of the full course. The students learn qualitative and quantitative methods to develop and test linguistic hypotheses.
Cognitive Linguistics: Undergraduate course VT19 (Department of Linguistics and Philology, Uppsala University). Instructor in charge of the full course. This course provides basic theoretical and methodological knowledge in the area of cognitive linguistics.

Méthodes quantitatives

Visualisation and Statistics: Postgraduate course HT18 (Faculty of Languages, Uppsala University). Instructor for weekly two-hour lab sessions of R programming in data visualization an statistical analysis.
An Introduction to Random Forests in R: Postgraduate teaching module, VT18 (Department of Linguistics and Philology, Uppsala University). Instructor for 90-minute sessions to the computational classifier of random forests.

Cours de langues

French: High school/University group courses 2013HT- 2015VT (Chinese Institute of European Languages). Instructor for group courses (A1-C2 levels). The teaching involved conversation, writing, grammar courses, and preparation for the DELF diplomas.

CV

marc_allassonniere-tang_cv.pdf Format (Pdf) - 230.08 Ko

Projets

EVOGRAM

The role of linguistic and non-linguistic factors in the evolution of nominal classification systems (Grant: ANR, 166 936 euros) - Principal Investigator - ANR-20-CE27-0021

This project (2021-2023) is hosted at the DDL (Dynamics of Language) lab in Lyon and aims at building a database on nominal classification systems to identify the factors affecting their evolution.

MACDIT

Multi-agent models and social media data: Collective dynamics and individual trajectories in linguistic populations (Grant: Labex ASLAN, 229 916 euros) - Principal Investigator (with J-P Magué) - LINK

The goal of this project (2021-2024) is to study the interaction between individual and collective levels of language variation and change in Twitter and Wikipedia data using Bayesian agent-based models.

RELI

Recherche En Linguistique Illustrée [Research In Linguistics Illustrated] (Grant: Labex ASLAN, 6 000 euros + extension 2 000 euros) - Principal Investigator (with R Anselme) - LINK

This project (2020-2021) contributes to the valorization of science by popularizing research of the ASLAN laboratories in Lyon in the form of short comics and comic strips.

FIELDLING

Funded international school in linguistic fieldwork - Organising committee - LINK

FieldLing has been organised on a yearly basis since 2010 (involved units: LLACAN, SEDYL, LACITO, DDL). It is at present the only regular (and free) intensive training program in France preparing students to study theories, methods, and the use of technological tools for language description through fieldwork.

Publications

2023

Hutin Mathilde & Allassonnière-Tang Marc, août 2023 — L’apport des données participatives pour l’étude linguistique des français du monde : le cas de l’opposition /a∼/. RÉSUMÉ Le français est une langue parlée par plusieurs centaines de millions de locuteurs en Europe, en Afrique et en Amérique… Journal of French Language Studies , , p. 1-24

ISSN

0959-2695, 1474-0079

https://dx.doi.org/10.1017/S0959269523000200
Levshina Natalia, Namboodiripad Savithry, Allassonnière-Tang Marc, Kramer Mathew, Talamo Luigi, Verkerk Annemarie, Wilmoth Sasha, Rodriguez Gabriela Garrido, Gupton TimothyMichael, Kidd Evan, Liu Zoey, Naccarato Chiara, Nordlinger Rachel, Panova Anastasia & Stoynova Natalia, juillet 2023 — Why we need a gradient approach to word order. Abstract This article argues for a gradient approach to word order, which treats word order preferences, both within and across… Linguistics vol. 61, n° 4, p. 825-883

ISSN

0024-3949, 1613-396X

https://dx.doi.org/10.1515/ling-2021-0098
Touraille Pris, et Allassonnière-Tang Marc. 2023. « Chapitre 8. Idéer Une catégorie épicène Et La matérialiser cohéremment Dans La Langue. Une nécessité épistémologique Autant Que Politique ». In Qu’est-Ce qu’une Femme ? Catégories Homme Femme : Débats Contemporains, 167-233. {{\’E}ditions Mat{\’e}riologiques}. doi:10.3917/edmat.lemar.2023.01.0169.

https://dx.doi.org/10.3917/edmat.lemar.2023.01.0169
Ulrich Natalja, Pellegrino François & Allassonnière-Tang Marc, avril 2023 — Intra- and inter-speaker variation in eight Russian fricatives. Acoustic variation is central to the study of speaker characterization. In this respect, specific phonemic classes such as… The Journal of the Acoustical Society of America vol. 153, n° 4, p. 2285

ISSN

0001-4966

https://dx.doi.org/10.1121/10.0017827
Parajuli Krishna Prasad & Allassonnière-Tang Marc, février 2023 — A corpus-based quantitative study of numeral classifiers in Nepali. Abstract Nepali is typologically rare in terms of nominal classification systems, as it is one of the few languages of the… Corpus Linguistics and Linguistic Theory , ,

ISSN

1613-7027, 1613-7035

https://dx.doi.org/10.1515/cllt-2022-0064
Allassonnière-Tang Marc, et Kilarski Marcin. Nominal Classification in Asia and Oceania: Functional and Diachronic Perspectives. John Benjamins, 2023.

ISBN

978-90-272-1437-9
Touraille Priscille, et Allassonnière-Tang Marc. 2023. « Idéer Une catégorie épicène Et La matérialiser cohéremment Dans La Langue. Une nécessité épistémologique Autant Que Politique ». In Qu’est-Ce qu’une Femme? Catégories Homme Femme: Débats Contemporains, 167–233. Éditions Matériologiques.

2022

Wichers Schreur Jesse, Allassonnière-Tang Marc, Bellamy Kate & Rochant Neige, décembre 2022 — Predicting grammatical gender in Nakh languages: Three methods compared. The Nakh languages Chechen and Tsova-Tush each have a five-valued gender system: masculine, feminine, and three “neuter”… Linguistic Typology at the Crossroads vol. 2, n° 2, 93–126 Pages Artwork Size: 93-126 Pages Publisher: Linguistic Typology at the Crossroads

https://dx.doi.org/10.6092/ISSN.2785-0943/14545
Her One-Soon, Hammarström Harald & Allassonnière-Tang Marc, novembre 2022 — Defining numeral classifiers and identifying classifier languages of the world. Abstract This paper presents a precise definition of numeral classifiers, steps to identify a numeral classifier language, and… Linguistics Vanguard vol. 8, n° 1, p. 151-164

ISSN

2199-174X

https://dx.doi.org/10.1515/lingvan-2022-0006
Hutin Mathilde & Allassonnière-Tang Marc, septembre 2022 — Operation LiLi: Using Crowd-Sourced Data and Automatic Alignment to Investigate the Phonetics and Phonology of Less-Resourced Languages. Less-resourced languages are usually left out of phonetic studies based on large corpora. We contribute to the recent efforts… Languages vol. 7, n° 3, p. 234

ISSN

2226-471X

https://dx.doi.org/10.3390/languages7030234

Voir toutes les publications