Articles in Peer-Reviewed Journals

Bouchet-Valat Milan, forthcoming, “General Marginal-free Association Indices for Contingency Tables: From the Altham Index to the Intrinsic Association Coefficient”, Sociological Methods & Research. Sage Online First Pre-print [open access]

Tables and code

Notwithstanding a large body of literature on log-linear models and odds ratios, no general marginal-free index of the association in a contingency table has gained a wide acceptance. Building on a framework developed by L. A. Goodman, we put into light the direct links between odds ratios, the Altham index, the intrinsic association coefficient, and coefficients in log-multiplicative models including Unidiff and row-column association models. We devise a normalized version of the latter coefficient varying between 0 and 1, which offers a simpler interpretation than existing indices similar to the correlation coefficient. We illustrate with the case of educational and socioeconomic homogamy among 149 European regions how this index can be used either alone in a non- or semiparametric approach or combined with models, and how it can protect against incorrect conclusions based on models which rely on strong assumptions to summarize the strength of association as a single parameter.

Bouchet-Valat Milan, 2018, “Hypergamie et célibat selon le statut social en France depuis 1969 : une convergence entre femmes et hommes ?” [“Hypergamy and Singlehood by Social Position in France Since 1969: A Convergence Between Women and Men?”], Revue de l’OFCE, 160. OFCE [open access] Cairn [open access]

Tables and code

Couples in which the woman is more educated than her partner are now the majority in France: female hypergamy has reversed. This article updates this result based on Labor Force Survey data, and shows a similar though less marked trend regarding occupational status: the proportion of women belonging to a social class (EGP schema) higher than their partner’s has increased from 13% in 1969 to 23% in 2016. No hypergamy is observed in terms of social class of origin. We confirm the classical theory relating hypergamy to a gender-asymmetric distribution of singlehood. Although singles used to be more frequent among women with a higher status, this relation has disappeared or even reversed over time. The distribution of singles among women has become closer to that among men.

Bouchet-Valat Milan, 2018, “Educational and socioeconomic homogamy, development level and metropolisation across 149 European regions”, European Journal of Social Sciences/Revue européenne des sciences sociales, 56(1), p. 53-84. Cairn Pre-print [open access]

Working paper in French: Bouchet-Valat Milan, 2018, “Homogamie éducative et socioéconomique, niveau de développement et métropolisation dans 149 régions d’Europe”, Ined, Working paper, n°236, p. 71-90. Ined [open access]

Tables and code

Considered as a key index of social stratification, social homogamy (i.e. resemblance of partners within couples) has been used by a sociological research stream to put to the test modernisation and industrialisation theories, which hold that development comes together with a less rigid class structure and a weaker role of kinship in individual destinies. This article applies for the first time this reasoning at the sub-national scale using data regarding 149 regions of 26 European countries in 2014-2016 (European Labour Force Surveys, Eurostat). We show that educational and socioeconomic homogamy get weaker when disposable income per inhabitant increases, but that it stabilises beyond a level corresponding to the average income of studied regions. This relation also holds between regions belonging to a given country once the stronger homogamy of capitals and large metropolises is controlled for.

Bouchet-Valat Milan, 2017, “Does women’s employment growth increase wage inequalities between couples? The case of France between 1982 and 2014”, Economics and Statistics, 493, pp. 67-86.

Insee [open access]

Tables and code

It has often been argued that women’s employment growth is a factor that contributes to the increase in inequalities between households due, in particular, to an alleged reinforcement of social homogamy. In contrast to this idea, an accounting approach to inequality decomposition, based on Insee’s Labour Force surveys (enquêtes Emploi) shows that wage inequalities between couples aged 30 to 59 remained stable between 1982 and 2014 in France, whereas they would have increased had women’s employment rate not risen. This overall stability results from two converse developments, which are themselves linked to the strong growth in women’s employment over this period: a fall in wage inequality between women and an increase in the correlation of partners’ wages within couples. However, the almost uniform increase in women’s employment rate, regardless of their partner’s wage level, has limited the increase in the correlation of partners’ wages and prevented an increase in wage inequalities between couples.

Bouchet-Valat Milan, 2015, “Fewer Singles among Highly Educated Women. A Gender Reversal of Hypergamy across Cohorts in France”, Population, 70(4), pp. 705-730.

Cairn [open access]

Tables and code

In France today, women are more highly educated than men. This has affected first union formation and the traditional tendency – known as female hypergamy – for women to form unions with men who are more educated than themselves. Using data from the Family History survey (Étude de l’histoire familiale, INSEE-INED, 1999), Milan Bouchet-Valat analyses the educational characteristics of partners in a first union and the rate of permanent singlehood by educational level across cohorts, starting with those born in the 1920s. Disentangling structural effects from gender effects, he questions the existence of a norm of female hypergamy capable of withstanding the changing gender composition of the highly educated, and reveals deep-seated changes in the prevalence of male and female permanent singlehood by educational level.


Bouchet-Valat Milan, 2015, “L’analyse statistique des tables de contingence carrées - L’homogamie socioprofessionnelle en France - I. L’analyse des correspondances” [“Statistical Analysis of Square Contingency Tables - Socioeconomic Homogamy in France - I. Correspondence Analysis”], Bulletin of Sociological Methodology, 125, pp. 65–88.

SAGE Pre-print [open access]

Tables and code

Correspondence analysis (CA) is a powerful technique, commonly used by sociologists to summarize the structure of the association observed in a contingency table (also known as cross-table). In contrast, approaches derived from CA, or similar in their use, but offering superior properties, are only known by a restricted audience in France: the CA of incomplete contingency table and log-multiplicative association models. This first article uses classic CA and CA of incomplete contingency table to analyze a socioeconomic homogamy table (based on the two-digit catégories socioprofessionnelles of the spouses) from the INSEE Labour Force Surveys conducted between 2003 and 2010. It shows the interest of giving a special status to endogamous couples (in which the spouses belong exactly to the same group), in order to represent better the social space.

Bouchet-Valat Milan, 2015, “L’analyse statistique des tables de contingence carrées - L’homogamie socioprofessionnelle en France II. L’apport des modèles d’association” [“Statistical Analysis of Square Contingency Tables - Social-professional Homogamy in France II. The Contribution of Association Models”], Bulletin of Sociological Methodology, 126, pp. 5–27.

SAGE Pre-print [open access]

Tables and code

Contrary to correspondence analysis (CA), log-multiplicative association models are only known by a restricted audience in France. This second article uses the latter technique to analyze a socioeconomic homogamy table (based on the two-digit catégories socioprofessionnelles of the spouses) from the INSEE Labour Force Surveys conducted between 2003 and 2010. It illustrates the strong resemblance between the two approaches, and highlights the advantages brought by the greater flexibility of association models. They provide a view of the social space both more accurate and easier to interpret than CA, in particular by giving special status to endogamous couples (in which the spouses belong exactly to the same group). They also allow us to analyze specifically men-women asymmetries which emerge from the choice of a spouse (hypergamy), an essential issue poorly apprehended by classical methods.


Bouchet-Valat Milan, 2014, “Changes in Educational, Social Class and Social Class of Origin Homogamy in France (1969–2011): Greater Openness Overall but Increased Closure of Elites”, Revue française de sociologie, 55(3), pp. 324–364.


Tables and code     Other homogamy tables

Despite the relevance of homogamy for the study of social structure, the few existing French studies on the evolution of homogamy have significant weaknesses. This article aims to fill this gap by studying the evolution of educational, social class and social class of origin homogamy (following the Erikson–Goldthorpe–Portocarero—EGP classification) within cohabiting couples, using 42 Labour Force Surveys conducted by INSEE between 1969 and 2011. Homogamy is shown to have clearly declined in all three of the components studied, both in absolute terms and beyond what would be expected as a result of structural changes to the population. Endogamy has declined for almost all groups and the dimensions of social space for partner choice have become largely unimportant. In particular, the specificity of persons with no qualifications, of the self-employed, and of agricultural occupations has very clearly diminished. Only the endogamy of graduates of grandes écoles has increased. Finally, these results are related to changes in class-consciousness, socio-spatial segregation and economic inequality.


Bastin Gilles, Bouchet-Valat Milan, 2014, “Media Corpora, Text Mining, and the Sociological Imagination - A Free Software Text Mining Approach to the Framing of Julian Assange by Three News Agencies Using R.TeMiS”, Bulletin of Sociological Methodology, 122, p. 5‑25. SAGE

In this paper, we introduce R.TeMiS, a free software solution aimed at exploring new dimensions in text mining with a particular focus on media framing analysis. R.TeMiS is especially designed to provide help in a) the automation of corpus construction and management procedures based on the use of large media content data bases, and b) the extension of the range of statistical tools available to social scientists exploring texts through R coding (one and two-way tables, time series, hierarchical clustering, correspondence analysis, geographical mapping…). A case study on the media framing of Julian Assange from January 2010 to December 2011 is conducted. It is based on the analysis of a corpus of 667 news dispatches published in English by the three top international news agencies: Agence France-Presse (AFP), Reuters and Associated Press (AP).

Bouchet-Valat Milan, Bastin Gilles, 2013, “RcmdrPlugin.temis, a Graphical Integrated Text Mining Solution in R”, The R Journal, 5(1), pp. 188–197.

R Journal [open access]

We present the package RcmdrPlugin.temis, a graphical user interface for user-friendly text mining in R. Built as a plug-in to the R Commander provided by the Rcmdr package, it brings together several existing packages and provides new features streamlining the process of importing, managing and analyzing a corpus, in addition to saving results and plots to a report file. Beyond common file formats, automated import of corpora from the Dow Jones Factiva content provider and Twitter is supported. Featured analyses include vocabulary and dissimilarity tables, terms frequencies, terms specific of levels of a variable, term co-occurrences, time series, correspondence analysis and hierarchical clustering.


Book Chapters

Bouchet-Valat Milan, Peugny Camille, Vallet Louis-André, 2016, “Inequality of Educational Returns in France. Changes in the Effect of Education and Social Background on Occupational Careers”, in Bernardi Fabrizio, Ballarino Gabriele (eds.), Education, Occupation and Social Origin. A Comparative Analysis of the Transmission of Socio-Economic Inequalities, Cheltenham, Edward Elgar.

Elgar Online [Introduction in open access]

Online appendix (long version in OSC Notes & Documents)

Summary of findings on Population Europe blog

Auzière Alexia, Bouchet-Valat Milan, Chevalier Martin, De Lisi Laura, Poullaouec Tristan, 2013, “Les formes de sélection des normaliens par le concours d’entrée à l’ENS Cachan” [“The Forms of Selection of ENS Cachan Students Through the Entrance Examination”], in Le Bot Florent, Albe Virginie, Bodé Gérard, Brucy Guy, Chatel Elisabeth (eds.), L’ENS Cachan : Le siècle d’une grande école pour les sciences, les techniques, la société, Rennes, Presses Universitaires de Rennes, pp. 315–329.



Ph.D. Thesis

Bouchet-Valat Milan, 2015, The Wheels of Love and Chance. Homogamy and Hypergamy in Contemporary France and Europe: Socioeconomic and Educational Dimensions, Variations and Mechanisms, Ph.D. Thesis in Sociology, Paris, Sciences Po, 610 p.

Table of contents and summary [in English] Full manuscript [in French]

Magna cum laude. Supervisor: Louis-André Vallet (OSC-CNRS). Jury: Francesco Billari (University of Oxford), Philippe Coulangeon (OSC-CNRS), Brendan Halpin (University of Limerick), Éva Lelièvre (Ined), Louis-André Vallet (OSC-CNRS).

The persistence of social homogamy is regularly observed; the tendency to form couples in which the woman occupies a lower position than her partner – female hypergamy – is also well documented: love is far from blind to social distinctions. Yet, it was concluded somewhat too fast from this major result that these phenomena have remained stable over time. This thesis shows, based on large-scale survey data, that education, social class and social class of origin homogamy has significantly weakened over the last forty years in France. This trend goes beyond what changes in the population structure would have implied (relative homogamy): the composition of couples has become closer to a random mating situation. Hypergamy has also clearly diminished, and has even reversed regarding education; females hold higher qualifications than their partners since the year 2000. As a consequence, the higher celibacy rate of women with a high social status, in particular that of the higher educated, which stemmed from their unfavorable position on the conjugal market, has converged to that of other groups. The variations of these two dimensions of partner choice over 64 regions of Europe in the 2000s are also sizable. Educational homogamy is higher than socioeconomic homogamy; hypergamy is the most frequent situation, but it is not an absolutely general pattern. The degree of cultural liberalism and of social openness appears as the strongest determinant of these two phenomena. They are negatively correlated with female labor force participation, but their relationship with economic inequalities is ambiguous.

Recensions d'ouvrages

Bouchet-Valat Milan, 2018, “European Classes. About C. Hugrée, É. Penissat and A. Spire, Les classes sociales en Europe. Tableau des nouvelles inégalités sur le vieux continent, Agone”, Books & Ideas, May 3rd. Open access

Bouchet-Valat Milan, 2018, “About: A. Schmitz, 2017, The Structure of Digital Partner Choice. A Bourdieusian Perspective, New York, Springer, 219 p.”, Population, 73(1), p. 162-164. Cairn