Diff
checker
Text
Text
Images
Documents
Excel
Folders
Legal
Enterprise
Desktop
Pricing
Sign in
Download Diffchecker Desktop
Compare text
Find the difference between two text files
Tools
History
Real-time editor
Hide whitespace changes
Hide unchanged lines
Disable line wrap
Layout
Split
Unified
Diff precision
Smart
Word
Char
Text styles
Change appearance
Syntax highlighting
Choose syntax
Ignore
Transform text
Go to first change
Edit input
Diffchecker Desktop
The most secure way to run Diffchecker. Get the Diffchecker Desktop app: your diffs never leave your computer!
Get Desktop
Untitled Diff
Created
7 years ago
Diff never expires
Clear
Export
Share
Explain
456 removals
Lines
Total
Removed
Characters
Total
Removed
To continue using this feature, upgrade to
Diff
checker
Pro
View Pricing
463 lines
Copy
499 additions
Lines
Total
Added
Characters
Total
Added
To continue using this feature, upgrade to
Diff
checker
Pro
View Pricing
464 lines
Copy
Gender Differences in Recognition for Group Work
Gender Differences in Recognition for Group Work
Copy
Copied
Copy
Copied
Heather Sarsons
Heather Sarsons
, Klarita Gërxhani, Ernesto Reuben, and Arthur Schram
∗
∗
Copy
Copied
Copy
Copied
February 4
, 2019
September 15
, 2019
Abstract
Abstract
Does gender influence how credit for group work is allocated? Using data from
Does gender influence how credit for group work is allocated? Using data from
Copy
Copied
Copy
Copied
academic economists’ CVs,
I
test whether coauthored and solo-authored publica
tions
academic economists’ CVs,
we
test whether coauthored and solo-authored publica
-
matter differently for tenure for men and women.
Because coauthors are listed
alpha
-
tions
matter differently for tenure for men and women.
Because coauthors are listed
betically in economics, coauthored papers do not provide specific information
about
alpha
betically in economics, coauthored papers do not provide specific information
each contributor’s skills or ability.
Solo-authored papers,
on the other hand,
provide
about
each contributor’s skills or ability.
Solo-authored papers,
on the other hand,
a
relatively
clear
signal
of
ability.
I
find
that
conditional
on
publication
quality and
provide
a
relatively
clear
signal
of
ability.
We
find
that
conditional
on
publication
other observables,
men are tenured at roughly the same rate regardless of
whether
quality and
other observables,
men are tenured at roughly the same rate regardless of
they coauthor or solo-author.
Women, however, become less likely to receive
tenure
whether
they coauthor or solo-author.
Women, however, become less likely to receive
the more they coauthor.
The result is most pronounced for women coauthoring
with
tenure
the more they coauthor.
The result is most pronounced for women coauthoring
men and less pronounced among women who coauthor with other women.
An on-
with
men and less pronounced among women who coauthor with other women.
Two
line experiment finds similar patterns when women perform male-stereotyped tasks.
experiments suggest that both stereotypes surrounding a task as well as the evalua-
However, women receive equal credit for joint work with men when the task is per-
tors’ gender affect who receives credit. Taken together, our results are best explained
ceived to be gender-neutral. Taken together, the results suggest that gender and stereo-
by gender and stereotypes having a noticeable influence on
the allocation of credit for
types influence
the allocation of credit for
group work.
group work.
∗
∗
Copy
Copied
Copy
Copied
I
especially thank
Roland Fryer, Claudia Goldin, Larry
Katz,
David Laibson,
and Amanda Pallais for
Sarsons, University of Chicago Booth (heather.sarsons@chicagobooth.edu); Gërxhani, European Uni-
their guidance and encouragement.
I
also thank Mitra
Akhtari, Amitabh Chandra, John Coglianese, Oren
versity Institute; Reuben, New York University Abu Dhabi and LISER; Schram, Amsterdam School of Eco-
Danieli, Ellora Derenoncourt, Florian Ederer, Ben Enke,
Raissa Fabregas, Nicole Fortin,
Peter Ganong, Ed
-
nomics and European University Institute. Sarsons
especially thank
s
Roland Fryer, Claudia Goldin, Larry
ward Glaeser, Siri Isaksson,
Sara Lowes, Rob McMillan, Eduardo Montero, Gautam Rao, Alex Segura, Nihar
Katz,
David Laibson,
and Amanda Pallais for
their guidance and encouragement.
We
also thank Mitra
Shah, Peter Tu,
Justin Wolfers, and
participants
at SOLE 2016, the Early Behavioural Economics Conference,
Akhtari, Amitabh Chandra, John Coglianese, Oren
Danieli, Ellora Derenoncourt, Florian Ederer, Ben Enke,
the Harvard Business School Gender Initiative, and the Harvard Inequality Seminar for
helpful com
ments
Raissa Fabregas, Nicole Fortin,
Nickolas Gagnon,
Peter Ganong, Ed
ward Glaeser, Siri Isaksson,
Emir Ka-
and suggestions.
This paper is intentionally solo-authored.
menica,
Sara Lowes, Rob McMillan, Eduardo Montero, Gautam Rao, Alex Segura, Nihar
Shah, Peter Tu,
Jeroen van de Ven,
Justin Wolfers, and
various conference and seminar
participants
for their
helpful com
-
ments
and suggestions.
1
1
1 Introduction
1 Introduction
Do employers use gender when allocating credit for group work, particularly when in-
Do employers use gender when allocating credit for group work, particularly when in-
dividual contributions are unobserved? Organizations increasingly rely on group work
dividual contributions are unobserved? Organizations increasingly rely on group work
for production (Lazear and Shaw, 2007), yet there is little empirical evidence document-
for production (Lazear and Shaw, 2007), yet there is little empirical evidence document-
ing how credit for group work is allocated. Unless employers can perfectly observe each
ing how credit for group work is allocated. Unless employers can perfectly observe each
worker’s contribution to the team’s output, they must decide how to allocate credit with-
worker’s contribution to the team’s output, they must decide how to allocate credit with-
out having full information as to what each worker did. This could leave room for demo-
out having full information as to what each worker did. This could leave room for demo-
graphic characteristics, such as gender, to influence the allocation of credit.
graphic characteristics, such as gender, to influence the allocation of credit.
Copy
Copied
Copy
Copied
In this paper,
I
test whether uncertainty over an individual’s contribution to a project
In this paper,
we
test whether uncertainty over an individual’s contribution to a project
leads to differential attribution of credit that contributes to the gender promotion gap. In
leads to differential attribution of credit that contributes to the gender promotion gap. In
many industries, women are not only hired at lower rates than men are, they are also
many industries, women are not only hired at lower rates than men are, they are also
promoted at lower rates.
promoted at lower rates.
1
1
This paper explores whether gender differences in credit for
This paper explores whether gender differences in credit for
group work exist and whether they explain part of the promotion gap.
group work exist and whether they explain part of the promotion gap.
Copy
Copied
Copy
Copied
I
primarily look at the tenure decisions of academic economists to test whether gen
der
We
primarily look at the tenure decisions of academic economists to test whether gen
-
influences
the
allocation
of
credit
for
coauthored
papers.
Economics
is
a
relevant
setting
der
influences
the
allocation
of
credit
for
coauthored
papers.
Economics
is
a
relevant
as there is a large tenure gap between men and women, and because the amount
of coau
-
setting
as there is a large tenure gap between men and women, and because the amount
thoring has risen dramatically in recent years (Ginther and Kahn,
2004;
Hammer
mesh,
of coau
thoring has risen dramatically in recent years (Ginther and Kahn,
2004;
Hammer
-
2013). Using data from economists’ CVs,
I
track individuals’ career trajectories
and com
-
mesh,
2013). Using data from economists’ CVs,
we
track individuals’ career trajectories
pare whether the trajectory is different for individuals who coauthor versus solo-
author,
and com
pare whether the trajectory is different for individuals who coauthor versus solo-
and whether there is a difference by gender.
author,
and whether there is a difference by gender.
Within economics,
I
find that men and women who solo-author most of their work
Within economics,
we
find that men and women who solo-author most of their work
have similar tenure rates conditional on a proxy for the quality of papers. However, an
have similar tenure rates conditional on a proxy for the quality of papers. However, an
additional coauthored paper is correlated with an 8.2% increase in tenure probability for
additional coauthored paper is correlated with an 8.2% increase in tenure probability for
men but only a 5.6% increase for women. This gap is significantly less pronounced for
men but only a 5.6% increase for women. This gap is significantly less pronounced for
women who coauthor with women, suggesting that the attribution of credit is related
women who coauthor with women, suggesting that the attribution of credit is related
to the gender mix of coauthors. Furthermore, a man who coauthors is no less likely to
to the gender mix of coauthors. Furthermore, a man who coauthors is no less likely to
receive tenure than a comparable man who solo-authors even though there is presumably
receive tenure than a comparable man who solo-authors even though there is presumably
more uncertainty as to how much work he did. A counterfactual exercise suggests that
more uncertainty as to how much work he did. A counterfactual exercise suggests that
this difference in credit allocation explains 60% of the unconditional gender gap in tenure
this difference in credit allocation explains 60% of the unconditional gender gap in tenure
rates and 84% of the gap that remains after controlling for average paper quality, citations,
rates and 84% of the gap that remains after controlling for average paper quality, citations,
tenure and PhD institution ranks, and field.
tenure and PhD institution ranks, and field.
Copy
Copied
Copy
Copied
To ensure that
I am
not picking up on ability differences between men and women,
I
To ensure that
we are
not picking up on ability differences between men and women,
control for the quality of papers using both journal rankings and citations, allowing
for
we
control for the quality of papers using both journal rankings and citations, allowing
1
1
Blau and DeVaro (2007), for example, find that across jobs, women are less likely to be promoted than
Blau and DeVaro (2007), for example, find that across jobs, women are less likely to be promoted than
men conditional on worker’s performance and ability ratings. In the UK, female managers are nearly 40%
men conditional on worker’s performance and ability ratings. In the UK, female managers are nearly 40%
Copy
Copied
Copy
Copied
less likely to be promoted than male managers (Elmins et al.
2016).
less likely to be promoted than male managers (Elmins et al.
,
2016).
2
2
Copy
Copied
Copy
Copied
a comparison of men and women with similar research portfolios.
The results are also
for
a comparison of men and women with similar research portfolios.
The results are also
robust to including other individual-level controls such as length of time to tenure and
robust to including other individual-level controls such as length of time to tenure and
the
the
seniority of one’s coauthors, as well as tenure year, tenure institution, and primary
seniority of one’s coauthors, as well as tenure year, tenure institution, and primary
field
field
fixed effects.
fixed effects.
I
argue that these results are most consistent with a story of women receiving less
credit
We
argue that these results are most consistent with a story of women receiving less
for their joint work with men because of bias. To show this,
I
first use current
CV and
credit
for their joint work with men because of bias. To show this,
we
first use current
citation data to compare the productivity of men and women who did and did
not receive
CV and
citation data to compare the productivity of men and women who did and did
tenure at the institution where they initially went up for tenure. While the
estimates are
not receive
tenure at the institution where they initially went up for tenure. While the
imprecise,
I
find suggestive evidence that women who coauthor and are
denied tenure
estimates are
imprecise,
we
find suggestive evidence that women who coauthor and are
produce more solo-authored papers that publish in high-ranking journals
than men who
denied tenure
produce more solo-authored papers that publish in high-ranking journals
are denied tenure. Data on citations show a similar result.
than men who
are denied tenure. Data on citations show a similar result.
I
then rule out several alternative explanations for the empirical patterns.
For ex
ample,
We
then rule out several alternative explanations for the empirical patterns.
For ex
-
several papers have demonstrated that selection into coauthorship in economics
is not
ample,
several papers have demonstrated that selection into coauthorship in economics
random.
is not
random.
2
2
Copy
Copied
Copy
Copied
I
test for selection into coauthorship and do not find any evidence that
women
We
test for selection into coauthorship and do not find any evidence that
coauthor with high ability or more senior men.
I
also look at the timing of coau
thorship
women
coauthor with high ability or more senior men.
We
also look at the timing of coau
-
and find no evidence that women begin coauthoring if they have a slower start
to their
thorship
and find no evidence that women begin coauthoring if they have a slower start
careers. The empirical patterns are also inconsistent with taste-basted discrimina
tion.
to their
careers. The empirical patterns are also inconsistent with taste-basted discrimina
-
Because the CV data do
es
not allow
me
to rule out the possibility that women actually
tion.
contribute less to papers that are coauthored with men,
I
conduct
an online
experiment
Because the CV data do
not allow
us
to rule out the possibility that women actually
de
signed to test whether
this
drive
s
credit allocation.
In the
experiment,
I
first hire indi
-
contribute less to papers that are coauthored with men,
we
conduct
two
experiment
s de-
viduals to complete quizzes on topics that are ei
ther male or female-stereotyped.
I
then
signed to test whether
real or perceived differences in contributions
drive
credit allocation.
hire participants who act as “predictors” and
are randomized into a
solo
treatment or a
In the
first
experiment,
we
first hire indi
viduals to complete quizzes on topics that are ei
-
group
treatment.
Predictors in the
solo
treatment are shown two individual’s separate
ther male or female-stereotyped.
We
then
hire participants who act as “predictors” and
quiz scores while predictors in the
group
treatment are shown the combined score of two
are randomized into a
n individual
treatment or a
joint
treatment.
Predictors in the
indi-
individuals. They are then asked to
predict the performance of each participant on future
vidual
treatment are shown two individual’s separate
quiz scores while predictors in the
quizzes.
joint
treatment are shown the combined score of two
individuals. They are then asked to
I find that in the group
treatment, women are predicted to perform worse than their
predict the performance of each participant on future
quizzes.
male counter
parts for male-stereotyped quizzes, suggesting that
participants making the
In the joint
treatment, women are predicted to perform worse than their
male counter
-
predict
ion
s believe that women con
tributed less to the combined score (that is,
they per
-
parts for male-stereotyped quizzes, suggesting that
predict
or
s believe that women con
-
formed worse).
However,
if pairs
performed a female-stereotyped quiz, women and men
tributed less to the combined score (that is,
they per
formed worse).
However,
if pairs
are given equal credit.
To under
stand whether these results are driven by participants’
performed a female-stereotyped quiz, women and men
are given equal credit.
To under
-
beliefs about the ability distribu
tions of men and women,
I
randomly provide some
par-
stand whether these results are driven by participants’
beliefs about the ability distribu
-
tions of men and women,
we
randomly provide some
participants with the distribution of
scores on the initial quiz by gender. Women appear to be given equal credit in the female-
stereotyped quiz because participants view it as being gender-neutral. That is, they do not
2
2
Copy
Copied
Copy
Copied
Boschini and Sjögren (2007)
test whether coauthorship patterns in economics are gender neutral. They
See, for example,
Boschini and Sjögren (2007)
,
Garcia and Sherman (2015)
, and Bikard et al (2015).
find that women are more likely to solo-author than men and that gender homophily in coauthoring in-
creases with women’s representation in a field.
Garcia and Sherman (2015)
argue that the alphabetical pub-
lishing norms in economics influence both the types of people authors work with and the types of projects
they will work on.
3
3
Copy
Copied
Copy
Copied
ticipants with the distribution of scores on the initial quiz by gender. Women appear to
realize that women tend to outperform men.
Show
ing participants the gender distribution
be given equal credit in the female-stereotyped quiz because participants view it as being
of scores corrects this belief and women are then
predicted
to
have
a
better
performance
gender-neutral. That is, they do not
realize that women tend to outperform men.
Show
-
in
future
female-stereotyped
quizzes
but it
does not affect the predicted performance gap
ing participants the gender distribution
of scores corrects this belief and women are then
for women and men
performing male-stereotyped tasks.
predicted
to
have
a
better
performance
in
future
female-stereotyped
quizzes
. Surpris-
The second experiment is conducted in a more natural setting with human resources
ingly, this treatment
does not affect the predicted performance gap
for women and men
personnel. Following a similar design, we again test whether women are less likely than
performing male-stereotyped tasks.
men to receive credit for good group performance. We additionally elicit the HR person-
nels’ beliefs about male and female performance and find that differences in the allocation
of credit are largely driven by differences in beliefs. We also find that male HR personnel
are more likely to hire in favor of men, and women in favor of women.
This paper replicates and builds off of the results in Sarsons (2017), which shows the
This paper replicates and builds off of the results in Sarsons (2017), which shows the
Copy
Copied
Copy
Copied
basic
correlational
patterns
between
paper
composition
and
tenure.
In
this
paper,
I
repli
-
basic
correlational
patterns
between
paper
composition
and
tenure.
In
this
paper,
we
cate the results using more data and then use the C.V. data and
an
experiment
to es
tablish a
repli
cate the results using more data and then use the C.V. data and
two
experiment
s
to es
-
channel through which gender influences the allocation of credit. The paper also
relates to
tablish a
channel through which gender influences the allocation of credit. The paper also
a large literature seeking to understand difference in labor market outcomes be
tween men
relates to
a large literature seeking to understand difference in labor market outcomes be
-
and women.
Factors such as productivity, personality and behavioural differ
ences (such
tween men
and women.
Factors such as productivity, personality and behavioural differ
-
as competition aversion), and fertility preferences have been shown to explain
some differ
-
ences (such
as competition aversion), and fertility preferences have been shown to explain
ences in career choice and progression.
some differ
ences in career choice and progression.
3
3
Copy
Copied
Copy
Copied
In academia in particular, studies have
pointed
In academia in particular, studies have
to both supply-side factors, including differences in subject matter interest (Dy
nan and
pointed
to both supply-side factors, including differences in subject matter interest (Dy
-
Rouse, 1997) and the availability of role models (Hale and Regev, 2014; Carrell
, Page, and
nan and
Rouse, 1997) and the availability of role models (Hale and Regev, 2014; Carrell
et
West
, 2010); demand-side factors, such as implicit bias (Milkman
, Akinola, and Chugh,
al.
, 2010); demand-side factors, such as implicit bias (Milkman
et al.,
2015; Moss-Racusin et
2015; Moss-Racusin et
al., 2012); and institutional factors (Antecol et al., 2018). This paper
al., 2012); and institutional factors (Antecol et al., 2018). This paper
directly tests whether
directly tests whether
the differential treatment of work output contributes to the gender
the differential treatment of work output contributes to the gender
gap.
gap.
The remainder of the paper is organized as follows. Section 2 describes the data and
The remainder of the paper is organized as follows. Section 2 describes the data and
Copy
Copied
Copy
Copied
shows that a tenure gap exists between male and female economists.
In Section 3,
I
show
shows that a tenure gap exists between male and female economists.
In Section 3,
we
that the tenure gap closes as women produce more solo-authored papers but does
not
show
that the tenure gap closes as women produce more solo-authored papers but does
close as they produce more coauthored papers.
Women have a consistently lower
proba
-
not
close as they produce more coauthored papers.
Women have a consistently lower
bility of tenure for each additional coauthored paper than men.
I
show that the
results are
proba
bility of tenure for each additional coauthored paper than men.
We
show that the
robust to accounting for attrition, and to using different journal rankings and
definitions
results are
robust to accounting for attrition, and to using different journal rankings and
of tenure.
In Section 4,
I
argue that the results are in line with a story in which
women
definitions
of tenure.
In Section 4,
we
argue that the results are in line with a story in which
receive less credit for joint work with men.
I
test alternative explanations of the
relation
-
women
receive less credit for joint work with men.
We
test alternative explanations of the
ship between coauthorship and tenure and argue that none can fully explain the
observed
relation
ship between coauthorship and tenure and argue that none can fully explain the
empirical patterns. Section 5 discusses the
structure
and results of the experiment
.
Section
observed
empirical patterns. Section 5 discusses the
design
and results of the experiment
s.
6 concludes.
Section
6 concludes.
3
3
There is a large literature documenting gender differences in productivity, attitudes toward different
There is a large literature documenting gender differences in productivity, attitudes toward different
Copy
Copied
Copy
Copied
types of work, and family choices. See, for example, Niederle and Vesterlund (2007),
Antecol et al.
(201
6),
types of work, and family choices. See, for example, Niederle and Vesterlund (2007),
Buser et al. (2014),
Ceci et al. (2014)
, and Ginther and Kahn (2004).
Antecol et al.
(201
8),
Ceci et al. (2014)
, Reuben et al. (2017)
, and Ginther and Kahn (2004).
4
4
2 Data
2 Data
Copy
Copied
Copy
Copied
To examine the relationship between paper composition and tenure,
I
construct a dataset
To examine the relationship between paper composition and tenure,
we
construct a dataset
using the CVs of economists who came up for tenure between 1985 and 2014 at one of the
using the CVs of economists who came up for tenure between 1985 and 2014 at one of the
top 35 U.S. PhD-granting universities
top 35 U.S. PhD-granting universities
4
4
. The academic progression documented in the CVs
. The academic progression documented in the CVs
makes it possible to evaluate the relationship between an individual’s research output and
makes it possible to evaluate the relationship between an individual’s research output and
Copy
Copied
Copy
Copied
career progression.
I
can then compare the degree of collaborative work and reward for
career progression.
We
can then compare the degree of collaborative work and reward for
that work, and compare these results for men versus women.
that work, and compare these results for men versus women.
2.1 Sample Selection and Data Overview
2.1 Sample Selection and Data Overview
Copy
Copied
Copy
Copied
I
include
only
PhD-granting
institutions
in
the
sample
as
tenure
evaluation
at
these
schools
We
include
only
PhD-granting
institutions
in
the
sample
as
tenure
evaluation
at
these
is primarily based on research output, of which
I
have a clear measure.
Other
institutions
schools
is primarily based on research output, of which
we
have a clear measure.
Other
like
liberal
arts
colleges
place
greater
weight
on
teaching
ability
for
tenure,
something
institutions
like
liberal
arts
colleges
place
greater
weight
on
teaching
ability
for
tenure,
that
I
cannot measure.
I
exclude business and public policy schools for
similar
reasons.
something
that
we
cannot measure.
We
exclude business and public policy schools for
similar
reasons.
5
5
Copy
Copied
Copy
Copied
It
is
reasonable
to
assume
that
the
top
35
economics
departments
in
the U.S. emphasize
It
is
reasonable
to
assume
that
the
top
35
economics
departments
in
research which is measured by the number and quality of papers one
produces.
the U.S. emphasize
research which is measured by the number and quality of papers one
produces.
One problem in collecting tenure information is that the CVs of individuals who went
One problem in collecting tenure information is that the CVs of individuals who went
up for tenure, were denied it, and left to industry or government are difficult to find, lead-
up for tenure, were denied it, and left to industry or government are difficult to find, lead-
Copy
Copied
Copy
Copied
ing to a sample selection problem.
To deal with this issue,
I
collected historical faculty
lists
ing to a sample selection problem.
To deal with this issue,
we
collected historical faculty
from 23 of the 35 schools and locate over 90% of faculty who had ever gone up for
tenure
lists
from 23 of the 35 schools and locate over 90% of faculty who had ever gone up for
at these 23 institutions. For the remaining 12 schools that did not have historical
faculty
tenure
at these 23 institutions. For the remaining 12 schools that did not have historical
lists available,
I
looked at the top 75 U.S. institutions, the top 5 Canadian institu
tions, and
faculty
lists available,
we
looked at the top 75 U.S. institutions, the top 5 Canadian institu
-
the top 5 European institutions to locate anyone who went up for tenure at a top
35 U.S.
tions, and
the top 5 European institutions to locate anyone who went up for tenure at a top
school and then moved to another institution.
I
also checked economists’ CVs at
the ma
-
35 U.S.
school and then moved to another institution.
We
also checked economists’ CVs at
jor Federal Reserve Boards and other large research institutes, such as Mathemat
ica, in the
the ma
jor Federal Reserve Boards and other large research institutes, such as Mathemat
-
U.S. While there might still be a sample selection problem,
I
show in Section
3.2.1
that
the
ica, in the
U.S. While there might still be a sample selection problem,
we
show in Section
results
are
robust
to
using
only
the
sample
for
which
I
have
historical
faculty lists.
3.2.1
that
the
results
are
robust
to
using
only
the
sample
for
which
we
have
historical
From individuals’ CVs,
I
code where and when they received their PhDs, their em-
faculty lists.
From individuals’ CVs,
we
code where and when they received their PhDs, their em-
ployment and publication history, and their primary and secondary fields. When looking
ployment and publication history, and their primary and secondary fields. When looking
Copy
Copied
Copy
Copied
at the relationship between publications and tenure in the main analysis,
I
only include
at the relationship between publications and tenure in the main analysis,
we
only include
papers that were published up to and including the year an individual goes up for tenure.
Book chapters are not included in the paper count. In a robustness check, I include papers
that were published one and two years after tenure.
4
4
The list of institutions are taken from the RePEc/IDEAS Economics Department rankings. The list of
The list of institutions are taken from the RePEc/IDEAS Economics Department rankings. The list of
Copy
Copied
Copy
Copied
schools included can be found in Appendix
A
schools included can be found in Appendix
C.
5
5
Business and policy schools might also value teaching differently and put weight on different types of
Business and policy schools might also value teaching differently and put weight on different types of
journals.
journals.
5
5
Copy
Copied
Copy
Copied
To control for the quality of a person’s publications,
I
primarily use the “AER equiv-
papers that were published up to and including the year an individual goes up for tenure.
Book chapters are not included in the paper count. In a robustness check, we include
papers that were published one and two years after tenure.
To control for the quality of a person’s publications,
we
primarily use the “AER equiv-
alent” ranking measure developed by Kalaitzidakis et al. (2003). This measure converts
alent” ranking measure developed by Kalaitzidakis et al. (2003). This measure converts
Copy
Copied
Copy
Copied
journal publications into their equivalent number of American Economic Review papers
journal publications into their equivalent number of American Economic Review papers
.
6
6
Copy
Copied
Copy
Copied
.
Less than 10% of journal articles cannot be converted because the journal does not appear
Less than 10% of journal articles cannot be converted because the journal does not appear
Copy
Copied
Copy
Copied
in the ranking. In these cases
I
give the publication a ranking of zero.
in the ranking. In these cases
we
give the publication a ranking of zero.
7
7
Using the AER-equivalent measure instead of a list journal rank allows for different
Using the AER-equivalent measure instead of a list journal rank allows for different
distances between journal ranks and for multiple journals to hold the same rank. For
distances between journal ranks and for multiple journals to hold the same rank. For
example, the top field journals can all hold the same rank. Other journal rankings force
example, the top field journals can all hold the same rank. Other journal rankings force
a ranking among these even though the journals might count the same amount toward
a ranking among these even though the journals might count the same amount toward
Copy
Copied
Copy
Copied
tenure
depending
on
one’s
field.
For
robustness,
I
replace
this
paper
quality
measure
tenure
depending
on
one’s
field.
For
robustness,
we
replace
this
paper
quality
measure
with the RePEc/IDEAS ranking of economics journals in Section 3.2.2.
with the RePEc/IDEAS ranking of economics journals in Section 3.2.2.
Copy
Copied
Copy
Copied
Finally,
I
include citations, measured in 2015, of pre-tenure papers as a control vari
able.
Finally,
we
include citations, measured in 2015, of pre-tenure papers as a control vari
-
These citations were scraped from Google Scholar.
able.
These citations were scraped from Google Scholar.
I
supplement this dataset with results from a survey designed to measure individu
als’
We
supplement this dataset with results from a survey designed to measure individu
-
beliefs about the returns to various types of papers. The survey also contains informa
tion
als’
beliefs about the returns to various types of papers. The survey also contains informa
-
on how frequently individuals present their papers.
The exact questions and nature
of the
tion
on how frequently individuals present their papers.
The exact questions and nature
survey are discussed in greater detail in Section 4.
of the
survey are discussed in greater detail in Section 4.
2.2 Construction of Tenure
2.2 Construction of Tenure
Copy
Copied
Copy
Copied
To determine whether someone received tenure,
I
follow the guidelines on each school’s
To determine whether someone received tenure,
we
follow the guidelines on each school’s
website (as of 2015) as to when tenure decisions are made. The majority of schools require
website (as of 2015) as to when tenure decisions are made. The majority of schools require
Copy
Copied
Copy
Copied
faculty to apply for tenure 7 years after their initial appointment.
I
therefore consider
faculty to apply for tenure 7 years after their initial appointment.
We
therefore consider
years 6-8 to be the “tenure window” in which someone applies for tenure to account for
years 6-8 to be the “tenure window” in which someone applies for tenure to account for
Copy
Copied
Copy
Copied
people who go up for tenure early or late (because of a leave of absence, for example).
I
as
-
people who go up for tenure early or late (because of a leave of absence, for example).
sume that an individual is denied tenure if s/he moves to a university ranked 5
positions
We
as
sume that an individual is denied tenure if s/he moves to a university ranked 5
below the initial institution during the tenure window. Similarly,
I
assume
that an indi
-
positions
below the initial institution during the tenure window. Similarly,
we
assume
vidual is denied tenure if
he moves from academia to industry during the
tenure window.
that an indi
vidual is denied tenure if
s/
he moves from academia to industry during the
Defining tenure in this way accounts for the fact that some people switch
institutions 2-3
tenure window.
Defining tenure in this way accounts for the fact that some people switch
years after their initial appointment, not because they were denied tenure
but for personal
institutions 2-3
years after their initial appointment, not because they were denied tenure
preferences, and that some people might choose to move to a comparable school around
the time of tenure even though they were offered tenure at their original institution. For
6
6
The American Economic Review is regarded as one of the top journals in economics. Most journal
The American Economic Review is regarded as one of the top journals in economics. Most journal
publications are therefore converted to be some fraction of an AER paper.
publications are therefore converted to be some fraction of an AER paper.
7
7
Copy
Copied
Copy
Copied
If someone does not have any solo or coauthored papers,
I
set the relevant journal ranking to zero and
If someone does not have any solo or coauthored papers,
we
set the relevant journal ranking to zero and
include a dummy variable indicating that the individual has no solo (or coauthored) papers. This enables
include a dummy variable indicating that the individual has no solo (or coauthored) papers. This enables
Copy
Copied
Copy
Copied
me
to keep using the full sample.
us
to keep using the full sample.
6
6
Copy
Copied
Copy
Copied
example,
someone who moves from MIT to Harvard after 7 years was
presumably offered
but for personal preferences, and that some people might choose to move to a comparable
tenure at MIT but chose to move to Harvard for other reasons.
school around the time of tenure even though they were offered tenure at their original
institution. For
example,
someone who moves from MIT to Harvard after 7 years was
presumably offered
tenure at MIT but chose to move to Harvard for other reasons.
As mentioned, a person who moves 5 or fewer years after his or her initial appointment
As mentioned, a person who moves 5 or fewer years after his or her initial appointment
is not assumed to have been denied tenure since s/he moved before the tenure window
is not assumed to have been denied tenure since s/he moved before the tenure window
Copy
Copied
Copy
Copied
starts.
If someone moves before the tenure window,
I
use the second institution they
were
starts.
If someone moves before the tenure window,
we
use the second institution they
at to determine tenure.
For example,
if a person’s first job is at University A but s/he
were
at to determine tenure.
For example,
if a person’s first job is at University A but s/he
moves to University B after three years,
I
use University B as the tenure institution but
moves to University B after three years,
we
use University B as the tenure institution but
do not start the tenure clock over.
I
do not restart the clock because the data shows that
do not start the tenure clock over.
We
do not restart the clock because the data shows that
in over 80% of cases, the individual still appears to go up for tenure within 8 years of his
in over 80% of cases, the individual still appears to go up for tenure within 8 years of his
Copy
Copied
Copy
Copied
or her appointment at the first institution. However,
I
do extend this tenure clock in a
or her appointment at the first institution. However,
we
do extend this tenure clock in a
robustness check.
robustness check.
Individuals who move from an academic institution into industry before the tenure
Individuals who move from an academic institution into industry before the tenure
window are excluded from the sample.
window are excluded from the sample.
2.3 Summary Statistics
2.3 Summary Statistics
Table 1 presents summary statistics of the data. Approximately 68% of the full sample
Table 1 presents summary statistics of the data. Approximately 68% of the full sample
received tenure, but this masks a stark difference between men and women. Only 52% of
received tenure, but this masks a stark difference between men and women. Only 52% of
women received tenure while 73% of men did.
women received tenure while 73% of men did.
Total Papers,Solo-authored, andCoauthoredare the number of papers in each group that
Total Papers,Solo-authored, andCoauthoredare the number of papers in each group that
an individual had published by the time of tenure. These publication counts do not in-
an individual had published by the time of tenure. These publication counts do not in-
clude books or book chapters. Papers published in non-economics journals (such as a
clude books or book chapters. Papers published in non-economics journals (such as a
political science journal) are included but receive a ranking of 0 (the lowest ranking). The
political science journal) are included but receive a ranking of 0 (the lowest ranking). The
results are robust to excluding publications in non-economics journals.
results are robust to excluding publications in non-economics journals.
There is no statistically significant difference in the number of papers that men and
There is no statistically significant difference in the number of papers that men and
women produce. Panel B looks at differences in the quality of papers. Men are no more
women produce. Panel B looks at differences in the quality of papers. Men are no more
Copy
Copied
Copy
Copied
likely to publish their papers in
"
Top 5
"
journals (American Economic Review, Economet-
likely to publish their papers in
“
Top 5
”
journals (American Economic Review, Economet-
rica, Journal of Political Economy, Quarterly Journal of Economics, and The Review of
rica, Journal of Political Economy, Quarterly Journal of Economics, and The Review of
Economic Studies) than women. The only statistically significant productivity difference
Economic Studies) than women. The only statistically significant productivity difference
is that men tend to publish their coauthored papers in slightly higher-ranking journals.
is that men tend to publish their coauthored papers in slightly higher-ranking journals.
Specifically, men’s coauthored papers have an average ranking of 0.34 AER-equivalents
Specifically, men’s coauthored papers have an average ranking of 0.34 AER-equivalents
Copy
Copied
Copy
Copied
while women’s coauthored papers have an average ranking of 0.30 AER-equivalents.
I
while women’s coauthored papers have an average ranking of 0.30 AER-equivalents.
We
therefore control for the quality of papers, measured using the AER-equivalent ranking as
therefore control for the quality of papers, measured using the AER-equivalent ranking as
well as average citations, throughout the analysis.
well as average citations, throughout the analysis.
Copy
Copied
Copy
Copied
7
Panel C displays differences in coauthoring patterns between men and women.Num-
Panel C displays differences in coauthoring patterns between men and women.Num-
ber Unique CAsis the number of unique coauthors an individual has had by tenure. Men
ber Unique CAsis the number of unique coauthors an individual has had by tenure. Men
Copy
Copied
Copy
Copied
7
and women have roughly the same number of coauthors but there are some differences in
and women have roughly the same number of coauthors but there are some differences in
the types of people men and women coauthor with. For example, women are less likely to
the types of people men and women coauthor with. For example, women are less likely to
coauthor with senior faculty and more likely to coauthor with other assistant professors.
coauthor with senior faculty and more likely to coauthor with other assistant professors.
This could in part be driven by the fact that they are also more likely to coauthor with
This could in part be driven by the fact that they are also more likely to coauthor with
other women, many of whom are also junior professors.
other women, many of whom are also junior professors.
Copy
Copied
Copy
Copied
For
illustrative
purposes,
I
plot
the
number
of
women
and
men
who
have
various
For
illustrative
purposes,
we
plot
the
number
of
women
and
men
who
have
various
combinations of solo and coauthored papers in Figure 1,
as well as the average proba
-
combinations of solo and coauthored papers in Figure 1,
as well as the average proba
bility
bility
of receiving tenure for each paper combination in Figure 2. Most men and women
of receiving tenure for each paper combination in Figure 2. Most men and women
have
have
a similar combination of solo and coauthored papers. Figure 2 illustrates that in
-
a similar combination of solo and coauthored papers. Figure 2 illustrates that in
dividu
-
dividu
als with a large number of either solo or coauthored papers are likely to receive
als with a large number of either solo or coauthored papers are likely to receive
tenure.
tenure.
However,
Panel A suggests that women with a higher fraction of their papers that
However,
Panel A suggests that women with a higher fraction of their papers that
are
are
solo-authored have a better chance of receiving tenure than women with a mix of solo
solo-authored have a better chance of receiving tenure than women with a mix of solo
and coauthored papers.
I
examine this claim formally in the next section.
and coauthored papers.
We
examine this claim formally in the next section.
3 Empirical Strategy and Results
3 Empirical Strategy and Results
3.1 Main Results
3.1 Main Results
Copy
Copied
Copy
Copied
I
show three main results.
I
first establish that a significant tenure gap exists between
We
show three main results.
We
first establish that a significant tenure gap exists between
men and women.
I
then show that the gap becomes more pronounced the more women
men and women.
We
then show that the gap becomes more pronounced the more women
coauthor, and that women who solo-author all of their papers have comparable tenure
coauthor, and that women who solo-author all of their papers have comparable tenure
Copy
Copied
Copy
Copied
rates to men.
Finally,
I
show that the gender of a woman’s coauthor matters.
Women
who
rates to men.
Finally,
we
show that the gender of a woman’s coauthor matters.
Women
coauthor with other women do not suffer a coauthor penalty.
who
coauthor with other women do not suffer a coauthor penalty.
3.1.1 The Tenure Gap
3.1.1 The Tenure Gap
Figure 3 plots the coefficient
Figure 3 plots the coefficient
ˆ
ˆ
β
β
1
1
from estimating
from estimating
T
T
if st
if st
=β
=β
1
1
TotPapers
TotPapers
i
i
+β
+β
2
2
TotPapers
TotPapers
2
2
Copy
Copied
Copy
Copied
i
+γ
+γ
′
′
Z
Z
i
i
+θ
+θ
f
f
+θ
+θ
s
s
+θ
+θ
t
t
+
+
if st
if st
(1)
(1)
separately for men and women using OLS. The dependent variable,T
separately for men and women using OLS. The dependent variable,T
if st
if st
, is an indicator
, is an indicator
that individualiin fieldfat schoolsreceives tenure in yeart.TotPapers
that individualiin fieldfat schoolsreceives tenure in yeart.TotPapers
i
i
is the number of
is the number of
papers (both coauthored and solo-authored) individualihas at the time he or she went up
papers (both coauthored and solo-authored) individualihas at the time he or she went up
for tenure. A quadratic in the number of papers is included to capture non-linearities in
for tenure. A quadratic in the number of papers is included to capture non-linearities in
how publications matter for tenure. The vector of individual-level controls,Z
how publications matter for tenure. The vector of individual-level controls,Z
i
i
, includes
, includes
Copy
Copied
Copy
Copied
8
average journal rank (measured as average AER-equivalents), the log of total citations,
average journal rank (measured as average AER-equivalents), the log of total citations,
Copy
Copied
Copy
Copied
the number of years it tookito go up for tenure, and the
average
number of coauthors
the number of years it tookito go up for tenure, and the
total
number of coauthors
oni’s
8
papers.
Tenure institution (θ
oni’s
papers.
Tenure institution (θ
s
s
), tenure year (θ
), tenure year (θ
t
t
), and field fixed effects (θ
), and field fixed effects (θ
f
f
Copy
Copied
Copy
Copied
) are also
) are also
included
included
as tenure standards likely vary over time and by field and department.
as tenure standards likely vary over time and by field and department.
The figure shows that a significant tenure gap exists between men and women even
The figure shows that a significant tenure gap exists between men and women even
after controlling for productivity, primary field, tenure institution, and tenure year. While
after controlling for productivity, primary field, tenure institution, and tenure year. While
an additional paper is correlated with a 13-16 percentage point increase in tenure proba-
an additional paper is correlated with a 13-16 percentage point increase in tenure proba-
bility for men and women, women are consistently 10-13 percentage points less likely to
bility for men and women, women are consistently 10-13 percentage points less likely to
receive tenure than men conditional on having written the same number and quality of
receive tenure than men conditional on having written the same number and quality of
papers. The lower intercept for women could stem from tenure committees starting with a
papers. The lower intercept for women could stem from tenure committees starting with a
lower prior about women’s ability. However, if all papers were clear signals of ability and
lower prior about women’s ability. However, if all papers were clear signals of ability and
tenure committees are Bayesian, we would expect the slope of the relationship between
tenure committees are Bayesian, we would expect the slope of the relationship between
papers and tenure to be steeper for women. Put differently, if men and women received
papers and tenure to be steeper for women. Put differently, if men and women received
equal credit for papers, the coefficient onTotPapers
equal credit for papers, the coefficient onTotPapers
i
i
should be significantly larger for
should be significantly larger for
women than for men.
women than for men.
Copy
Copied
Copy
Copied
I
provide a formal test for the difference in slopes for men and women in Column 1
of
We
provide a formal test for the difference in slopes for men and women in Column 1
Table 2, where
I
present the estimates from
of
Table 2, where
we
present the estimates from
T
T
if st
if st
=β
=β
1
1
TotPapers
TotPapers
i
i
+β
+β
2
2
fem
fem
i
i
+β
+β
3
3
(TotPapers
(TotPapers
i
i
×fem
×fem
i
i
) +β
) +β
4
4
TotPapers
TotPapers
2
2
Copy
Copied
Copy
Copied
i
+γ
+γ
′
′
Z
Z
i
i
+θ
+θ
f
f
+θ
+θ
s
s
+θ
+θ
t
t
+
+
if st
if st
(2)
(2)
Copy
Copied
Copy
Copied
This is similar to estimating equation 1 except that
I
interact total papers with a female
This is similar to estimating equation 1 except that
we
interact total papers with a female
dummy,fem
dummy,fem
i
i
rather than splitting the sample. There is no significant difference in the
rather than splitting the sample. There is no significant difference in the
marginal benefit of an additional paper to men and women.
marginal benefit of an additional paper to men and women.
3.1.2 The Tenure Gap and Paper Composition
3.1.2 The Tenure Gap and Paper Composition
Copy
Copied
Copy
Copied
To test whether coauthored papers matter differently for men and women,
I
separate
pa-
To test whether coauthored papers matter differently for men and women,
we
separate
pers into those that are solo-authored and those that are coauthored and estimate
pa
pers into those that are solo-authored and those that are coauthored and estimate
T
T
ifst
ifst
=β
=β
1
1
S
S
i
i
+β
+β
2
2
(fem
(fem
i
i
×S
×S
i
i
) +β
) +β
3
3
CA
CA
i
i
+β
+β
4
4
(fem
(fem
i
i
×CA
×CA
i
i
) +δ
) +δ
1
1
fem
fem
i
i
+γ
+γ
′
′
Z
Z
i
i
+θ
+θ
f
f
+θ
+θ
s
s
+θ
+θ
t
t
+
+
ifst
ifst
(3)
(3)
using OLS. Here,S
using OLS. Here,S
i
i
andCA
andCA
i
i
are the number of solo-authored and coauthored papers an
are the number of solo-authored and coauthored papers an
individual has at the time of tenure.
individual has at the time of tenure.
The results are presented in Table 2. An additional solo-authored paper is associated
The results are presented in Table 2. An additional solo-authored paper is associated
with a 9.7 percentage point increase in men’s tenure rates and a 15.4 percentage point
with a 9.7 percentage point increase in men’s tenure rates and a 15.4 percentage point
increase in women’s tenure rates (who start from a lower base tenure rate). If the lower
increase in women’s tenure rates (who start from a lower base tenure rate). If the lower
initial tenure rate for women is due to employers holding the belief that women are lower
initial tenure rate for women is due to employers holding the belief that women are lower
Copy
Copied
Copy
Copied
9
ability, it seems that the signals from solo papers begin to outweigh the employer’s prior.
ability, it seems that the signals from solo papers begin to outweigh the employer’s prior.
This is consistent with a model in which employers start with a lower prior about women
This is consistent with a model in which employers start with a lower prior about women
Copy
Copied
Copy
Copied
9
and update as they receive clear signals about a woman’s ability, giving women full credit
and update as they receive clear signals about a woman’s ability, giving women full credit
for this solo work. This is further discussed in the next section.
for this solo work. This is further discussed in the next section.
Copy
Copied
Copy
Copied
If coauthored papers are an
unclear signal of ability, an employer must make a judg-
If coauthored papers are an
ment call as to how much each coauthor contributed to the paper which could lead to
differential attribution of credit. Indeed, we see that while an additional coauthored pa-
per helps both men and women, men benefit much more than women. Men’s tenure
rates increase by 8.2 percentage points when they produce a coauthored paper whereas
women’s increase by 5.6 percentage points.
However, the fact
Saved diffs
Original text
Open file
Gender Differences in Recognition for Group Work Heather Sarsons ∗ February 4, 2019 Abstract Does gender influence how credit for group work is allocated? Using data from academic economists’ CVs, I test whether coauthored and solo-authored publications matter differently for tenure for men and women. Because coauthors are listed alpha- betically in economics, coauthored papers do not provide specific information about each contributor’s skills or ability. Solo-authored papers, on the other hand, provide a relatively clear signal of ability. I find that conditional on publication quality and other observables, men are tenured at roughly the same rate regardless of whether they coauthor or solo-author. Women, however, become less likely to receive tenure the more they coauthor. The result is most pronounced for women coauthoring with men and less pronounced among women who coauthor with other women. An on- line experiment finds similar patterns when women perform male-stereotyped tasks. However, women receive equal credit for joint work with men when the task is per- ceived to be gender-neutral. Taken together, the results suggest that gender and stereo- types influence the allocation of credit for group work. ∗ I especially thank Roland Fryer, Claudia Goldin, Larry Katz, David Laibson, and Amanda Pallais for their guidance and encouragement. I also thank Mitra Akhtari, Amitabh Chandra, John Coglianese, Oren Danieli, Ellora Derenoncourt, Florian Ederer, Ben Enke, Raissa Fabregas, Nicole Fortin, Peter Ganong, Ed- ward Glaeser, Siri Isaksson, Sara Lowes, Rob McMillan, Eduardo Montero, Gautam Rao, Alex Segura, Nihar Shah, Peter Tu, Justin Wolfers, and participants at SOLE 2016, the Early Behavioural Economics Conference, the Harvard Business School Gender Initiative, and the Harvard Inequality Seminar for helpful comments and suggestions. This paper is intentionally solo-authored. 1 1 Introduction Do employers use gender when allocating credit for group work, particularly when in- dividual contributions are unobserved? Organizations increasingly rely on group work for production (Lazear and Shaw, 2007), yet there is little empirical evidence document- ing how credit for group work is allocated. Unless employers can perfectly observe each worker’s contribution to the team’s output, they must decide how to allocate credit with- out having full information as to what each worker did. This could leave room for demo- graphic characteristics, such as gender, to influence the allocation of credit. In this paper, I test whether uncertainty over an individual’s contribution to a project leads to differential attribution of credit that contributes to the gender promotion gap. In many industries, women are not only hired at lower rates than men are, they are also promoted at lower rates. 1 This paper explores whether gender differences in credit for group work exist and whether they explain part of the promotion gap. I primarily look at the tenure decisions of academic economists to test whether gender influences the allocation of credit for coauthored papers. Economics is a relevant setting as there is a large tenure gap between men and women, and because the amount of coau- thoring has risen dramatically in recent years (Ginther and Kahn, 2004; Hammermesh, 2013). Using data from economists’ CVs, I track individuals’ career trajectories and com- pare whether the trajectory is different for individuals who coauthor versus solo-author, and whether there is a difference by gender. Within economics, I find that men and women who solo-author most of their work have similar tenure rates conditional on a proxy for the quality of papers. However, an additional coauthored paper is correlated with an 8.2% increase in tenure probability for men but only a 5.6% increase for women. This gap is significantly less pronounced for women who coauthor with women, suggesting that the attribution of credit is related to the gender mix of coauthors. Furthermore, a man who coauthors is no less likely to receive tenure than a comparable man who solo-authors even though there is presumably more uncertainty as to how much work he did. A counterfactual exercise suggests that this difference in credit allocation explains 60% of the unconditional gender gap in tenure rates and 84% of the gap that remains after controlling for average paper quality, citations, tenure and PhD institution ranks, and field. To ensure that I am not picking up on ability differences between men and women, I control for the quality of papers using both journal rankings and citations, allowing for 1 Blau and DeVaro (2007), for example, find that across jobs, women are less likely to be promoted than men conditional on worker’s performance and ability ratings. In the UK, female managers are nearly 40% less likely to be promoted than male managers (Elmins et al. 2016). 2 a comparison of men and women with similar research portfolios. The results are also robust to including other individual-level controls such as length of time to tenure and the seniority of one’s coauthors, as well as tenure year, tenure institution, and primary field fixed effects. I argue that these results are most consistent with a story of women receiving less credit for their joint work with men because of bias. To show this, I first use current CV and citation data to compare the productivity of men and women who did and did not receive tenure at the institution where they initially went up for tenure. While the estimates are imprecise, I find suggestive evidence that women who coauthor and are denied tenure produce more solo-authored papers that publish in high-ranking journals than men who are denied tenure. Data on citations show a similar result. I then rule out several alternative explanations for the empirical patterns. For example, several papers have demonstrated that selection into coauthorship in economics is not random. 2 I test for selection into coauthorship and do not find any evidence that women coauthor with high ability or more senior men. I also look at the timing of coauthorship and find no evidence that women begin coauthoring if they have a slower start to their careers. The empirical patterns are also inconsistent with taste-basted discrimination. Because the CV data does not allow me to rule out the possibility that women actually contribute less to papers that are coauthored with men, I conduct an online experiment designed to test whether this drives credit allocation. In the experiment, I first hire indi- viduals to complete quizzes on topics that are either male or female-stereotyped. I then hire participants who act as “predictors” and are randomized into a solo treatment or a group treatment. Predictors in the solo treatment are shown two individual’s separate quiz scores while predictors in the group treatment are shown the combined score of two individuals. They are then asked to predict the performance of each participant on future quizzes. I find that in the group treatment, women are predicted to perform worse than their male counterparts for male-stereotyped quizzes, suggesting that participants making the predictions believe that women contributed less to the combined score (that is, they per- formed worse). However, if pairs performed a female-stereotyped quiz, women and men are given equal credit. To understand whether these results are driven by participants’ beliefs about the ability distributions of men and women, I randomly provide some par- 2 Boschini and Sjögren (2007) test whether coauthorship patterns in economics are gender neutral. They find that women are more likely to solo-author than men and that gender homophily in coauthoring in- creases with women’s representation in a field. Garcia and Sherman (2015) argue that the alphabetical pub- lishing norms in economics influence both the types of people authors work with and the types of projects they will work on. 3 ticipants with the distribution of scores on the initial quiz by gender. Women appear to be given equal credit in the female-stereotyped quiz because participants view it as being gender-neutral. That is, they do not realize that women tend to outperform men. Show- ing participants the gender distribution of scores corrects this belief and women are then predicted to have a better performance in future female-stereotyped quizzes. Surpris- ingly, this treatment does not affect the predicted performance gap for women and men performing male-stereotyped tasks. This paper replicates and builds off of the results in Sarsons (2017), which shows the basic correlational patterns between paper composition and tenure. In this paper, I repli- cate the results using more data and then use the C.V. data and an experiment to establish a channel through which gender influences the allocation of credit. The paper also relates to a large literature seeking to understand difference in labor market outcomes between men and women. Factors such as productivity, personality and behavioural differences (such as competition aversion), and fertility preferences have been shown to explain some differ- ences in career choice and progression. 3 In academia in particular, studies have pointed to both supply-side factors, including differences in subject matter interest (Dynan and Rouse, 1997) and the availability of role models (Hale and Regev, 2014; Carrell, Page, and West, 2010); demand-side factors, such as implicit bias (Milkman, Akinola, and Chugh, 2015; Moss-Racusin et al., 2012); and institutional factors (Antecol et al., 2018). This paper directly tests whether the differential treatment of work output contributes to the gender gap. The remainder of the paper is organized as follows. Section 2 describes the data and shows that a tenure gap exists between male and female economists. In Section 3, I show that the tenure gap closes as women produce more solo-authored papers but does not close as they produce more coauthored papers. Women have a consistently lower proba- bility of tenure for each additional coauthored paper than men. I show that the results are robust to accounting for attrition, and to using different journal rankings and definitions of tenure. In Section 4, I argue that the results are in line with a story in which women receive less credit for joint work with men. I test alternative explanations of the relation- ship between coauthorship and tenure and argue that none can fully explain the observed empirical patterns. Section 5 discusses the structure and results of the experiment. Section 6 concludes. 3 There is a large literature documenting gender differences in productivity, attitudes toward different types of work, and family choices. See, for example, Niederle and Vesterlund (2007), Antecol et al. (2016), Ceci et al. (2014), and Ginther and Kahn (2004). 4 2 Data To examine the relationship between paper composition and tenure, I construct a dataset using the CVs of economists who came up for tenure between 1985 and 2014 at one of the top 35 U.S. PhD-granting universities 4 . The academic progression documented in the CVs makes it possible to evaluate the relationship between an individual’s research output and career progression. I can then compare the degree of collaborative work and reward for that work, and compare these results for men versus women. 2.1 Sample Selection and Data Overview I include only PhD-granting institutions in the sample as tenure evaluation at these schools is primarily based on research output, of which I have a clear measure. Other institutions like liberal arts colleges place greater weight on teaching ability for tenure, something that I cannot measure. I exclude business and public policy schools for similar reasons. 5 It is reasonable to assume that the top 35 economics departments in the U.S. emphasize research which is measured by the number and quality of papers one produces. One problem in collecting tenure information is that the CVs of individuals who went up for tenure, were denied it, and left to industry or government are difficult to find, lead- ing to a sample selection problem. To deal with this issue, I collected historical faculty lists from 23 of the 35 schools and locate over 90% of faculty who had ever gone up for tenure at these 23 institutions. For the remaining 12 schools that did not have historical faculty lists available, I looked at the top 75 U.S. institutions, the top 5 Canadian institutions, and the top 5 European institutions to locate anyone who went up for tenure at a top 35 U.S. school and then moved to another institution. I also checked economists’ CVs at the ma- jor Federal Reserve Boards and other large research institutes, such as Mathematica, in the U.S. While there might still be a sample selection problem, I show in Section 3.2.1 that the results are robust to using only the sample for which I have historical faculty lists. From individuals’ CVs, I code where and when they received their PhDs, their em- ployment and publication history, and their primary and secondary fields. When looking at the relationship between publications and tenure in the main analysis, I only include papers that were published up to and including the year an individual goes up for tenure. Book chapters are not included in the paper count. In a robustness check, I include papers that were published one and two years after tenure. 4 The list of institutions are taken from the RePEc/IDEAS Economics Department rankings. The list of schools included can be found in Appendix A 5 Business and policy schools might also value teaching differently and put weight on different types of journals. 5 To control for the quality of a person’s publications, I primarily use the “AER equiv- alent” ranking measure developed by Kalaitzidakis et al. (2003). This measure converts journal publications into their equivalent number of American Economic Review papers 6 . Less than 10% of journal articles cannot be converted because the journal does not appear in the ranking. In these cases I give the publication a ranking of zero. 7 Using the AER-equivalent measure instead of a list journal rank allows for different distances between journal ranks and for multiple journals to hold the same rank. For example, the top field journals can all hold the same rank. Other journal rankings force a ranking among these even though the journals might count the same amount toward tenure depending on one’s field. For robustness, I replace this paper quality measure with the RePEc/IDEAS ranking of economics journals in Section 3.2.2. Finally, I include citations, measured in 2015, of pre-tenure papers as a control variable. These citations were scraped from Google Scholar. I supplement this dataset with results from a survey designed to measure individuals’ beliefs about the returns to various types of papers. The survey also contains information on how frequently individuals present their papers. The exact questions and nature of the survey are discussed in greater detail in Section 4. 2.2 Construction of Tenure To determine whether someone received tenure, I follow the guidelines on each school’s website (as of 2015) as to when tenure decisions are made. The majority of schools require faculty to apply for tenure 7 years after their initial appointment. I therefore consider years 6-8 to be the “tenure window” in which someone applies for tenure to account for people who go up for tenure early or late (because of a leave of absence, for example). I as- sume that an individual is denied tenure if s/he moves to a university ranked 5 positions below the initial institution during the tenure window. Similarly, I assume that an indi- vidual is denied tenure if he moves from academia to industry during the tenure window. Defining tenure in this way accounts for the fact that some people switch institutions 2-3 years after their initial appointment, not because they were denied tenure but for personal preferences, and that some people might choose to move to a comparable school around the time of tenure even though they were offered tenure at their original institution. For 6 The American Economic Review is regarded as one of the top journals in economics. Most journal publications are therefore converted to be some fraction of an AER paper. 7 If someone does not have any solo or coauthored papers, I set the relevant journal ranking to zero and include a dummy variable indicating that the individual has no solo (or coauthored) papers. This enables me to keep using the full sample. 6 example, someone who moves from MIT to Harvard after 7 years was presumably offered tenure at MIT but chose to move to Harvard for other reasons. As mentioned, a person who moves 5 or fewer years after his or her initial appointment is not assumed to have been denied tenure since s/he moved before the tenure window starts. If someone moves before the tenure window, I use the second institution they were at to determine tenure. For example, if a person’s first job is at University A but s/he moves to University B after three years, I use University B as the tenure institution but do not start the tenure clock over. I do not restart the clock because the data shows that in over 80% of cases, the individual still appears to go up for tenure within 8 years of his or her appointment at the first institution. However, I do extend this tenure clock in a robustness check. Individuals who move from an academic institution into industry before the tenure window are excluded from the sample. 2.3 Summary Statistics Table 1 presents summary statistics of the data. Approximately 68% of the full sample received tenure, but this masks a stark difference between men and women. Only 52% of women received tenure while 73% of men did. Total Papers,Solo-authored, andCoauthoredare the number of papers in each group that an individual had published by the time of tenure. These publication counts do not in- clude books or book chapters. Papers published in non-economics journals (such as a political science journal) are included but receive a ranking of 0 (the lowest ranking). The results are robust to excluding publications in non-economics journals. There is no statistically significant difference in the number of papers that men and women produce. Panel B looks at differences in the quality of papers. Men are no more likely to publish their papers in "Top 5" journals (American Economic Review, Economet- rica, Journal of Political Economy, Quarterly Journal of Economics, and The Review of Economic Studies) than women. The only statistically significant productivity difference is that men tend to publish their coauthored papers in slightly higher-ranking journals. Specifically, men’s coauthored papers have an average ranking of 0.34 AER-equivalents while women’s coauthored papers have an average ranking of 0.30 AER-equivalents. I therefore control for the quality of papers, measured using the AER-equivalent ranking as well as average citations, throughout the analysis. Panel C displays differences in coauthoring patterns between men and women.Num- ber Unique CAsis the number of unique coauthors an individual has had by tenure. Men 7 and women have roughly the same number of coauthors but there are some differences in the types of people men and women coauthor with. For example, women are less likely to coauthor with senior faculty and more likely to coauthor with other assistant professors. This could in part be driven by the fact that they are also more likely to coauthor with other women, many of whom are also junior professors. For illustrative purposes, I plot the number of women and men who have various combinations of solo and coauthored papers in Figure 1, as well as the average proba- bility of receiving tenure for each paper combination in Figure 2. Most men and women have a similar combination of solo and coauthored papers. Figure 2 illustrates that in- dividuals with a large number of either solo or coauthored papers are likely to receive tenure. However, Panel A suggests that women with a higher fraction of their papers that are solo-authored have a better chance of receiving tenure than women with a mix of solo and coauthored papers. I examine this claim formally in the next section. 3 Empirical Strategy and Results 3.1 Main Results I show three main results. I first establish that a significant tenure gap exists between men and women. I then show that the gap becomes more pronounced the more women coauthor, and that women who solo-author all of their papers have comparable tenure rates to men. Finally, I show that the gender of a woman’s coauthor matters. Women who coauthor with other women do not suffer a coauthor penalty. 3.1.1 The Tenure Gap Figure 3 plots the coefficient ˆ β 1 from estimating T if st =β 1 TotPapers i +β 2 TotPapers 2 +γ ′ Z i +θ f +θ s +θ t + if st (1) separately for men and women using OLS. The dependent variable,T if st , is an indicator that individualiin fieldfat schoolsreceives tenure in yeart.TotPapers i is the number of papers (both coauthored and solo-authored) individualihas at the time he or she went up for tenure. A quadratic in the number of papers is included to capture non-linearities in how publications matter for tenure. The vector of individual-level controls,Z i , includes average journal rank (measured as average AER-equivalents), the log of total citations, the number of years it tookito go up for tenure, and the average number of coauthors 8 oni’s papers. Tenure institution (θ s ), tenure year (θ t ), and field fixed effects (θ f ) are also included as tenure standards likely vary over time and by field and department. The figure shows that a significant tenure gap exists between men and women even after controlling for productivity, primary field, tenure institution, and tenure year. While an additional paper is correlated with a 13-16 percentage point increase in tenure proba- bility for men and women, women are consistently 10-13 percentage points less likely to receive tenure than men conditional on having written the same number and quality of papers. The lower intercept for women could stem from tenure committees starting with a lower prior about women’s ability. However, if all papers were clear signals of ability and tenure committees are Bayesian, we would expect the slope of the relationship between papers and tenure to be steeper for women. Put differently, if men and women received equal credit for papers, the coefficient onTotPapers i should be significantly larger for women than for men. I provide a formal test for the difference in slopes for men and women in Column 1 of Table 2, where I present the estimates from T if st =β 1 TotPapers i +β 2 fem i +β 3 (TotPapers i ×fem i ) +β 4 TotPapers 2 +γ ′ Z i +θ f +θ s +θ t + if st (2) This is similar to estimating equation 1 except that I interact total papers with a female dummy,fem i rather than splitting the sample. There is no significant difference in the marginal benefit of an additional paper to men and women. 3.1.2 The Tenure Gap and Paper Composition To test whether coauthored papers matter differently for men and women, I separate pa- pers into those that are solo-authored and those that are coauthored and estimate T ifst =β 1 S i +β 2 (fem i ×S i ) +β 3 CA i +β 4 (fem i ×CA i ) +δ 1 fem i +γ ′ Z i +θ f +θ s +θ t + ifst (3) using OLS. Here,S i andCA i are the number of solo-authored and coauthored papers an individual has at the time of tenure. The results are presented in Table 2. An additional solo-authored paper is associated with a 9.7 percentage point increase in men’s tenure rates and a 15.4 percentage point increase in women’s tenure rates (who start from a lower base tenure rate). If the lower initial tenure rate for women is due to employers holding the belief that women are lower ability, it seems that the signals from solo papers begin to outweigh the employer’s prior. This is consistent with a model in which employers start with a lower prior about women 9 and update as they receive clear signals about a woman’s ability, giving women full credit for this solo work. This is further discussed in the next section. If coauthored papers are an unclear signal of ability, an employer must make a judg- ment call as to how much each coauthor contributed to the paper which could lead to differential attribution of credit. Indeed, we see that while an additional coauthored pa- per helps both men and women, men benefit much more than women. Men’s tenure rates increase by 8.2 percentage points when they produce a coauthored paper whereas women’s increase by 5.6 percentage points. However, the fact that men benefit nearly as much from a coauthored paper as they do from a solo-authored paper is at odds with the story that employers are dividing credit for projects among authors. If employers do divide credit, not all men can get 100% of the credit, particularly for those papers coauthored with other men. 8 This result could point to an alternative mechanism. For example, if employers exhibit taste-based discrimination, they could use joint projects as an excuse to promote men over women. I discuss and test several such alternative stories in Section 4. The relationship between paper composition and tenure is summarized in Figure 4. This figure plots the relationship between the fraction of an individual’s papers that are solo-authored and tenure, controlling for the total number of papers, citations, journal quality, number of coauthors, and tenure institution, year, and field fixed effects. For men, it does not matter if one coauthors or solo-authors: tenure rates are comparable conditional on the quality of papers. Women who write all of their papers alone have similar tenure rates to men. However, women who coauthor all of their papers have an approximately 37% tenure rate, substantially lower than that of men who coauthor all of their papers ( 72%). The slope for women is ˆ β=0.780and is statistically significant at the 1% level (s.e.=0.184). 3.1.3 Does Coauthor Gender Matter? The probability of receiving tenure is not lower for all women who coauthor. In Table 3, I categorize coauthored papers into those written with only men, only women, or a mix of men and women: 8 It could be the case that because tenure committees are evaluating one person, they always assume that the man they evaluate deserves full credit for the paper (and we do not see the amount of credit they would have given to the other man). It is impossible to evaluate such theories with these data. 10 T if st =β 1 S i +β 2 (fem i ×S i ) +β 3 CAmale i +β 4 (fem×CAmale i ) +β 5 CAmix i +β 6 (fem×CAmix i ) +β 7 CAfem i +β 8 (fem i ×CAfem i ) +β 9 fem i +γ ′ Z i +θ f +θ s +θ t + if st (4) As before,S i is the number of solo-authored papers individualihas at the time of tenure. CAfem i is the number of coauthored papers individualihas in which all of the coauthors are female. Similarly,CAmale i is the number of papersihas in which all of the coauthors are male andCAmix i is the number of papersihas in which the coauthors consist of men and women. The estimated coefficients on the interaction terms show that the negative relationship between coauthoring and tenure for women is driven almost entirely by papers that are coauthored with men. While a coauthored paper with another man is associated with an 8.7 percentage point increase in tenure probability for a man, it is associated with a 3.1 percentage point increase in tenure probability for a woman. 9 An additional paper with a woman, however, is associated with an 11.6 percentage point increase in tenure proba- bility for a woman. While this estimate is imprecise due to sample size, I can say that an additional coauthored paper with a woman has a more positive impact on tenure than an additional coauthored paper with a man. Any explanation as to why women have lower tenure rates than men when they coauthor must therefore be correlated with coauthor gender. The estimates are robust to including all of the control variables discussed earlier. 3.1.4 Counterfactual Analysis I conduct a counterfactual analysis to estimate how much of the gender gap in tenure rates can be explained by the different treatment of coauthored papers. I first estimate T if st =β 1 S i +β 2 CA i +δ 1 fem i +γ ′ Z i +θ f +θ s +θ t + if st (5) and use the estimates to predict the probability of tenure, ˆ T i , for everyone in the sample. I then let the female dummyfem i be 0 for everyone and predict tenure rates again (call this ̃ T i ). The difference ˆ T i − ̃ T i gives the gender gap in tenure rates conditional on all observable characteristics but not allowing for differences in the marginal impact of solo 9 These results again show the puzzling pattern that the amount of credit that is divided among male coauthors adds up to more than one. 11 and coauthored papers for men and women. 10 I then repeat this exercise using the estimates from equation 4, first letting the female dummy equal one and then predicting tenure rates again letting the female dummy (and therefore all of the interactions) equal zero. This second set of predicted tenure probabil- ities tells us what women’s predicted tenure rate would be if their papers were treated in the same way that men’s papers are treated. The unconditional gender gap in tenure rates is 22 percentage points. The conditional gap in tenure rates from equation 5 is approximately 16 percentage points. Thus, observ- able characteristics such as differences in time to tenure and paper quality account for about 27% of the gap. The results from using equation 4 to predict tenure probabilities suggest that the gap would close by a further 13.5 percentage points if men and women’s papers were treated similarly. The different assignment of credit thus accounts for ap- proximately 60% of the unconditional tenure gap and 84% of the conditional gap. 3.2 Robustness Checks One may be concerned that the results are a product of the types of productivity measures used or are affected by missing data. In this section, I show that the results are robust to using only the sample for which I have historical faculty lists, to using different journal rankings, to accounting for papers published shortly after tenure, and to using different measures of paper counts. 11 3.2.1 Attrition The results will be biased if the sample excludes individuals who are denied tenure and go into industry, government, or other institutions where I do not observe them. This would be particularly problematic if men who go to industry after being denied tenure disproportionately coauthored their papers. If this is true, I would be overestimating the benefit of coauthoring for men. I would have a similar problem if women who go to industry after being denied tenure typically wrote solo-authored papers. As discussed in Section 2.1, I attempted to find such individuals by searching institu- tions outside of the top 35 U.S. schools, federal reserves, and other research institutes. To 10 Interacting all variables except for the number of solo/coauthored papers with the female dummy does not substantially change the results. 11 In Appendix Table A1, I also test whether the results vary by school rank and over time. The estimates suggest that the coauthoring penalty is driven largely by schools outside of the top 10, although the esti- mates are imprecise. The coauthorship penalty is also stronger in later years but again the estimates are imprecise. 12 further allay concerns about sample selection, I run the analysis on the sample for which I received historical faculty lists. These lists allow me to track who went up for tenure and find them even if they left academia. The results, presented in Column 1 of Table 4, do not change when run on the sample for which there should be very few missing observations. The coefficient on theFemale×Coauthoredinteraction is significant only at the 10% level due to the smaller sample, but the direction and magnitude do not change. 3.2.2 Journal Rankings In the main analysis, I use a flexible journal ranking that allows multiple journals to hold the same rank. However, while the economics profession largely agrees on what the “top” journals are, rankings of field journals or lower-tier journals have changed over time. In Columns 2-4 of Table 4, I show that the results are robust to using three alternative journal ranking metrics as controls. In Column 2, I use the current RePEc-IDEAS journal ranking. This ranking forces a linear relationship between journals and tenure but also contains a larger number of journals. The main results do not change when using this ranking. In Column 3, I allow journal rankings to change over time. I use historical rankings of economics journals (drawn from Laband and Piette, 1994, and combined with current rankings) and match each paper with its journal ranking at the time it was published. Us- ing these rankings accounts for journals moving in rank over time as well as new journals being added. The coefficient on theFemale×Coauthoredinteraction is slightly smaller but the same pattern persists. An additional coauthored paper is associated with an 8.1 percentage point increase in tenure probability for men and a 5.6 percentage point increase for women. In section 4, I also separate papers into "Top 5s" and "non-Top 5s". Finally, in Column 4, I divide the AER-equivalent measure into deciles and control for the number of solo and coauthored papers an individual has in each decile. For example, if an individual publishes one solo-authored paper in the AER and another in the lowest- rank journal, she will have one paper in the tenth bin, one in the first bin, and zero in the others. Thus, instead of having a single coauthored or solo-authored paper rank control, I include ten variables controlling for the quality of an individual’s solo-authored papers (the number of solo papers in each AER-equivalent bin) and ten variables controlling for the quality of an individual’s coauthored paper (the number of coauthored papers in each AER-equivalent bin). Again, the results hold. 13 3.2.3 Tenure Definition In the main analysis, I only consider papers that were published up to and including the year that an individual goes up for tenure. If an individual goes up for tenure in 1995, for example, papers published in 1996 are not included in the paper count even though they may have been “revise and resubmits” at the time of tenure. This could affect the results if men who coauthor have several promising unpublished papers at the time of tenure but women who coauthor do not, in which case I am not actually comparing people with similar publication records. In Columns 5 and 6 of Table 4, I include papers that are published one and two years after a person’s tenure year in the paper count variables. The magnitude of the coefficients are smaller but the results do not change: women continue to benefit less from coauthored papers than men do. 3.2.4 Paper Count Variable While I control for journal quality, the main independent variables (number of solo and coauthored papers) may not accurately reflect how tenure committees decide on tenure cases. For example, institutions might trade off the quantity and quality of papers in different ways. In Column 7 of Table 4, I use an alternative measure for the number of papers. Specifically, after converting each publication to its AER-equivalent, I add up the AER-equivalent measure to give the total number of "AERs" an individual has at the time of tenure. For example, if an individual published two solo-authored papers and one is worth 0.25 AERs and the other worth 0.8 AERs, the individual will have 1.05 solo- authored AERs at the time of tenure. Again, the patterns are the same. An additional coauthored “AER” paper is correlated with an 8.9 percentage point increase in a man’s tenure probability but a 5.3 percentage point increase in a woman’s tenure probability. 3.3 Testing Against Other Disciplines and Coauthoring Conventions Many disciplines use different coauthoring conventions, such as listing authors in order of contribution. However, these disciplines differ on several other dimensions, such as the fraction of women in the disciplines and what is most important for tenure (publications, grants, conference proceedings, etc.). In Appendix A, I conduct the same analysis for a sample of sociologists, a discipline that order authors by contribution. The sample and results are discussed in more detail in the Appendix, but I do not find evidence of women being penalized for coauthoring. What matters is being first author on a paper: being first author is correlated with a 5% increase in tenure probability for both men and women. 14 Because sociology differs from economics in many ways, though, it is difficult to interpret whether these results suggest that ordering authors by contribution helps eliminate bias or whether the larger presence of women helps to eliminate it. 4 Channels The previous section established three facts: 1. For very few papers, women have a lower tenure probability than men; 2. As women produce more solo-authored papers, their tenure probability converges to that of comparable men; 3. Women benefit less than men from work coauthored with men. There are several explanations for these patterns. In this section, I argue that the results are most consistent with a story of women receiving less credit for their joint work with men rather than a story of women contributing less when they work with men. I assume that tenure committees begin with the prior that women are on average lower ability than men, and that solo-authored papers provide a clear signal of one’s ability whereas coauthored papers provide an unclear signal. Employers then misattribute credit for work produced by a man and a woman as the man is assumed to be higher ability. I first test the claim by comparing the productivity of men and women who were de- nied tenure. I then explore and rule out several threats to this story. Specifically, I test for ability and preference-based sorting, women receiving less exposure by presenting less, and taste-based discrimination. Finally, I present evidence from an experiment designed to shut down the possibility that women put in less effort when working with men, and find that even in this context, women receive less credit than men when they perform a stereotypically male task. However, women receive at least as much credit as men when they perform a stereotypically female task. 4.1 Do Men Get the Credit or Do Women Contribute Less? If tenure committees hold the prior that women are lower ability than men and if solo- authored papers provide clear signals of ability, we will see differences in tenure rates for men and women with few publications. However, additional solo-authored publications of the same quality will have a larger marginal impact on a woman’s tenure probability 15 than a man’s. As these clear signals begin to dominate the committee’s prior, tenure rates between men and women will converge. If committees are biased toward giving men more credit for work coauthored with women, we would expect to see the following. Assuming that there is some fixed amount of credit that can be given for a paper, a man will benefit more than a woman from joint work between them. In addition, both men and women will benefit more from their coau- thored work with women than their coauthored work with men, as two men who coau- thor will be assumed to have contributed similarly while a woman will be assumed to have contributed less. These two claims largely play out in the data. Table 2 shows that the marginal solo- authored paper helps women more than it helps men as they start from a lower baseline tenure rate. Table 3 shows that men benefit the most from coauthoring with women (an increase in tenure probability of 9.7% when coauthoring with a woman vs. 8.7% when coauthoring with a man) although this difference is insignificant. Similarly, women ben- efit more from coauthoring with other women than with men. One result that it is in- consistent with a story of credit allocation is the fact that the total amount of credit that can be allocated, at least when all coauthors are men, seems to add up to more than one. Men benefit as much from a coauthored paper as they do from a solo-authored paper, suggesting that tenure committees are either making a mistake when dividing credit (for example, each committee assumes that the male author under consideration for tenure at its school did most of the work), or that there is an alternative mechanism behind the results. In Section 4.2, I test several potential mechanisms. We would see these same empirical patterns if women contribute less to projects that are joint with men. Comparing the productivity of men and women who were denied tenure helps to disentangle these two stories. If women who coauthor are given less credit, then women who coauthor and are denied tenure should on average be more productive than men who are denied tenure. If women who coauthor simply contribute less, we would not expect to see productivity differences between men and women who are denied tenure. I use two productivity measures to test whether women who coauthor and are denied tenure are more productive than men: the number of solo-authored AER-equivalents an individual publishes after the tenure decision and the log number of citations an individ- ual has as of 2015. 1213 Individuals who leave academia and do not publish after tenure are excluded from the AER-equivalent outcome sample, but including them and setting 12 Citations were scraped from Google scholar in 2015. 13 For the top 5 papers outcome, I do not compare coauthored papers as these can reflect the ability of one’s coauthors. Citation data includes both solo and coauthored papers as the data came in this structure. 16 their number of post-tenure papers to zero does not change the results. Table 5 shows the results from estimating Y if st =β 1 fem i +β 2 FracCA it +β 3 T i +β 4 (fem i ×FracCA it ) +β 5 (fem i ×T i ) +β 6 (FracCA it ×T i ) +β 7 (FracCA it ×T i ×Fem i ) +X ′ i γ+θ f +θ t +θ p + if st (6) where the outcome variableY if st is one of the two productivity measures described above andT i is a tenure dummy. I include a post-tenure institution fixed effect,θ p , to account for the fact that individuals will have access to different resources depending on where they go after the initial tenure decision. Column 1 shows the results from estimating equation 6 with the number of solo- authored AER-equivalents as the outcome. Women who are denied tenure and coauthor have 0.4 more solo-authored AER-equivalents than men who are denied tenure and coau- thor. Column 2, which has log citations as the outcome variable, shows a similar pattern although the results are much noisier. Together, these results provide some suggestive evidence that these women receive less credit for joint projects. 4.2 Alternative Stories There are other possible explanations for the above findings, not all of which can be tested with these particular data. Here I shed light on four standard and testable channels: ability-based sorting, preference-based sorting, women not claiming credit for their work, and taste-based discrimination. The empirical patterns are inconsistent with all of the proposed explanations. 4.2.1 Ability-Based Sorting Employers might rationally deny women who coauthor tenure if individuals sort such that only lower ability women coauthor with men. This could arise for several reasons. For example, if coauthoring lowers the cost of producing a paper, but women know that they receive less credit for papers, high ability women might forego the cost savings and choose to work alone. They know they can produce high quality papers by themselves and send the employer a clearer signal of their ability. However, if low ability women can only produce high quality papers with the help of a high ability man, they might coauthor even if they receive less credit. High ability men will agree to coauthor with them if it reduces the cost of the paper without reducing the quality. Employers would 17 then know that any woman coauthoring with a man is lower ability. In what follows, I test whether women anticipate receiving less credit, whether high ability women sort out of coauthoring with men, and whether men coauthor with women whose careers begin more slowly. To do so, I first present survey evidence suggesting that women do not know that the returns to coauthoring are lower than solo-authoring. I then show that women do receive some credit for papers that publish well, suggesting that employers might believe that there is some assortative matching. I also provide evi- dence that even when women tend to work with men who are slightly higher ability than themselves this unequal match does not explain the gender gap in tenure. Survey Evidence on Knowledge of Returns to CoauthoringIf women know that their returns to coauthoring with men are low, it is plausible that high ability women would choose to solo-author or only work with other women. Here I test whether women antici- pate receiving less credit for collaborative work using a survey conducted with economists currently working at the top 35 U.S. economics departments. The survey was sent to all professors, regardless of rank, at these institutions and received an 32% response rate. The gender composition of the sample is representative of the profession today, with 89 respondents being female and 300 being male. In the survey, economists were asked the following question: Suppose a solo-authored AER increases your chance of receiving tenure by 15%. For each of the following, please give an estimate of how much you think the described paper would increase your chance of receiving tenure. Respondents then go through five types of papers (coauthored AER, coauthored AER with senior faculty, coauthored AER with junior faculty, solo-authored top field, and coau- thored top field) and record their beliefs about the returns to these papers 14 . In Table 6, I test the difference in the mean beliefs of men and women. 15 There is no statistically significant difference in the beliefs of men and women for any type of paper. Men believe that a coauthored AER will increase their chance of receiving tenure by 12.1%, and women by 12.2%. Women believe that there are slightly lower returns to AER papers coauthored with senior faculty (8.8% versus 9.1% for men), but the difference is not statistically significant. These results suggest that, in this context, women are unaware of 14 I did not ask respondents about paper coauthored with men/women so that they would not be primed to think about gender 15 Because the survey was anonymous, the answers can not be linked to the CV data. I can therefore only test for differences in means without controls. 18 the true returns to coauthoring. C.V. Evidence on Sorting by AbilityA second test of whether women know that they will receive less credit for papers and sort accordingly is to look at the correlation between propensity to coauthor and ability. I first test whether high ability women are less likely to coauthor than low ability women and then test for assortative matching among coau- thors. I proxy for ability using the quality of journal that an individual’s job market paper was published in. I assume that the job market paper is the first solo-authored paper an individual publishes after he or she graduates. If women anticipate discrimination, ability and the fraction of one’s papers that are coauthored will be negatively correlated. High ability women should be less likely to coauthor. In Figure 5.A I plot the coefficients ˆ β 1 and ˆ β 2 from estimating FracCA if st =β 1 a i +β 2 (fem i ×a i ) +β 3 fem i +β 4 TotPapers i +θ f +θ s +θ t + if st (7) whereFracCA if st is the fraction of personi’s papers that are coauthored anda i is person i’s ability (job market paper rank). If high ability women anticipate receiving less credit, we expect ˆ β 2 <0. In Figure 5.A, however, we see that ability is uncorrelated with the fraction of papers that are coauthored for both men and women: both estimates are precise zeros. There is no evidence that women along the ability distribution act strategically in their choice to coauthor versus solo author. I also find no evidence that high ability women strategically coauthor with other women rather than men. Figure 5.B plots the results from equation 7 using the fraction of papers that are coauthored with women as the dependent variable. Women are more likely to coauthor with other women than men are but there is no sorting by ability. While women do not seem to be sorting according to ability, it is possible that women tend to work with higher-ability or more prominent coauthors who then receive more credit for a paper. I test for this by correlating a person’s ability with that of his or her coauthors. While I do not have the job market paper information for all coauthors in the dataset, I can see where the coauthors were working at the time the individual went up for tenure. As a measure of average coauthor ability, I take the average school rank of all of an individual’s pre-tenure coauthors. For example, ificoauthors withjandkandjworks at the 5th-ranked institution andkworks at the 15th-ranked institution, the average ability ofi’s coauthors is 10. I correlatei’s ability with the average ability of her coauthors in Figure 6. The line of best fit is plotted controlling for number of coauthored and solo-authored publications, 19 time until tenure, and field, institution, and tenure year fixed effects. Men and women both sort positively on ability but women are more likely to collab- orate with individuals at more highly-ranked institutions than men are. To see whether this explains the main results, I estimate T if st =β 1 S i +β 2 (fem i ×S i ) +β 3 CA i +β 4 (fem i ×CA i ) +β 5 rank iJ +β 6 (CA i ×rank iJ ) +β 7 (fem i ×CA i ×rank iJ ) +β 8 (fem i ×rank iJ ) +β 9 fem i +γ ′ Z i +θ f +θ s +θ t + if st (8) whererank iJ is the average institution rank ofi’s coauthors and all other variables are defined as before. The results are reported in Table 7. If men receive more credit because they are coauthoring with lower ability women, ˆ β 7 should be negative. However, ˆ β 7 is close to zero, indicating that the ability or prominence of one’s coauthor is not driving the tenure gap for coauthoring women. Returns to Top PapersFor high ability women to receive no credit for their coauthored papers, employers would have to believe that there is no assortative matching by abil- ity. Otherwise, employers would receive a signal that women who coauthor with high ability men are also high ability, and be more likely to promote them. Figure 6 shows that assortative matching does occur, but it is possible that employers do not recognize this. I test for this by looking at how credit for top 5 publications is allocated. If employ- ers know that there is assortative matching, they should believe that women coauthoring with high-ability men are also likely to be high ability. Table 8 shows the results from estimating T ifst =β 1 TopS i +β 2 (fem i ×TopS i ) +β 3 TopCA i +β 4 (fem i ×TopCA i ) +β 5 NonTopS i +β 6 NonTopCA i +β 7 (fem i ×NonTopS i ) +β 8 (fem i ×NonTopCA i ) +β 9 fem i +γ ′ Z i +θ f +θ s +θ t + ifst (9) whereTopS i andTopCA i are the number of solo and coauthored papers that individuali has published in a top 5 journal. Similarly,NonTopS i andNonTopCA i are the number of solo and coauthored papers the individual has published in non-top 5 journals. In Table 8, the “nop-top 5” interaction terms are presented in the second column. Power becomes an issue as (1) there are relatively few people publishing in the top 5 journals, and (2) cutting by gender means that there are even fewer women in each category. Table 8 shows that coauthored papers published in a top 5 journal help women much 20 more than those published in non-top 5 journals. Non-top 5 coauthored papers do not have any positive influence on women’s tenure probability. It seems that employers re- ceive some signal when a woman publishes her coauthored papers in top journals which is at odds with the hypothesis that only low ability women coauthor with men. Overall, there is little evidence that ability-based sorting is driving the results. 16 If anything, employers seem to recognize that high ability men and women might work together and are therefore more likely to grant these women tenure. However, their tenure rate is still lower than that of high ability men. 4.2.2 Preference-Based Sorting If women prefer to coauthor with senior faculty, we could reasonably expect that women would have lower tenure rates. Assuming senior faculty are more likely to be credited for a paper, the fact that most senior faculty are men would drive the correlation between coauthoring with a man and tenure. That is, women receive less credit because they enjoy coauthoring with senior faculty and these senior faculty are predominantly male. The basic summary statistics showed that women were not more likely to coauthor with senior faculty than men. However, I conduct an additional test as to whether coau- thorship with senior faculty could be driving the results. I reestimate equation 3 but con- trol for the fraction of a person’s coauthors who are senior. The results are presented in Column 3 of Table 7. The seniority of women’s coauthors does not explain the results. Controlling for seniority, an additional coauthored paper increases a man’s probability of tenure by 8 percentage points but a woman’s by 5 percentage points. 4.2.3 Timing of Coauthorship It is possible that men offer to work with women who are struggling to publish. If this is the case, we should see women who have few publications in the early years of their appointment being more likely to coauthor with men. I test for this possibility by looking at differences in early publications and by testing whether women with a longer time lag between their initial appointment and first publication are more likely to coauthor with men. 16 Garcia and Serman (2015) show that there could be selection into coauthorship driven by a desire to be first author on a paper (that is, depending on where you are in the alphabet relative to your coauthors). This would be an issue in this setting if, for example, men are more likely to be strategic than woman and are therefore more likely to be first author on a paper (which is correlated with having more citations). I test whether men are more likely to be first author on their papers than women and whether men have a “higher” author position overall. I find that men in my sample are first author 57% of the time while women are first author 55% of the time (p-value = 0.907). 21 Figure 7 descriptively shows the timing of publications for men and women, split by whether they received tenure at their initial tenure institution. More formally, I test whether women have fewer publications early in their careers by estimating Y if st =β 1 Fem i +β 2 T is +β 3 (Fem i ×T is ) +β 4 Papers i +β 5 ̄q i +θ f +θ s +θ t + if st (10) whereY if st is the number of years between individuali’s initial appointment andi’s first post-appointment publication. 17 I test whether women who did not receive tenure had a longer publishing lag by interacting the female dummy term with an indicator for re- ceiving tenure at schools,T is . I control for the number of papers published pre-tenure (Papers i ) and the average quality of those papers ( ̄q i ) All other variables are defined as before. The results are presented in Table 9. Women who do not receive tenure do have a longer lag (approximately 0.5 years) between their first appointment and their first publi- cation although the result is noisily estimated. I test whether women with a longer lag are more likely to coauthor with men by estimating FracM if st =β 1 Fem i +β 2 T is +β 3 (Fem i ×T is ) +β 4 Y i +β 5 (Fem i ×Y i ) +β 6 (Fem i ×T is ×Y i ) +β 4 Papers i +β 5 ̄q i +θ f +θ s +θ t + if st (11) where the outcome variable,Y i in equation 10 is used as a regressor. If men bring women with a slow start to publishing onto their projects, we would expect to see ˆ β 5 >0. The results, presented in Column 2 of Table 9, do not support the hypothesis that women who struggle to publish initially are more likely to begin publishing with men. The coefficient onβ 5 is negative, suggesting that women with a longer publishing lag are less likely to coauthor with men although this result is again insignificant. 4.2.4 Women Not Claiming Credit for Papers Women might be given less credit for their work if they are less likely to claim it as their own. For example, if women present less frequently than men, people might associate a paper with the male coauthor who presents it more. The survey discussed in Section 4.2.1 also asked individuals how many times per year they present their work and whether they are more or less likely to present their coauthored papers than their coauthor. Panel B of Table 6 shows that men and women report the same likelihood of presenting their joint papers relative to their coauthors. Interestingly, though, women present their solo- 17 I exclude papers that were published before the person’s first appointment. 22 authored papers fewer times per year than men do. It is possible that women do not "advertise" their work as much as men do and this leads to women receiving less recogni- tion for their work in general. If this were true, though, women who solo author should also be less likely to receive tenure. 4.2.5 Taste-Based Discrimination If some employers have a distaste for tenuring women, as in Becker (1971), we should see women who write solo-authored papers being denied tenure as well. If employers cannot plausibly deny a woman who solo-authored several well-published papers, however, they might be constrained to deny tenure only to those for whom they can make a reasonable case. If it can be argued that a woman who coauthors did little of the work, taste-based discrimination could help to explain the results as employers have an excuse for denying tenure to coauthoring women. However, as shown in Table 3, only women who coauthor with men have lower tenure rates. This would imply that employers have a particular distaste for tenuring women who coauthor with men, which seems unlikely. 5 Experimental Evidence In the previous section, I provided suggestive evidence that factors like sorting and taste- based discrimination do not explain why women who coauthor with men are less likely to receive tenure. I instead argue that the results are most consistent with women receiv- ing less credit for joint work with men. Specifically, because coauthored signals are an unclear signal of ability, women receive less credit for their joint work with men if they are believed to be lower ability. I cannot rule out, though, that real of perceived differ- ences in effort explain the results. For example, tenure committees might hold the belief that women contribute less or lower effort when they work with men, regardless of their beliefs about a woman’s ability. In addition, tenure committees might believe that low ability women choose to work with high ability men even if the empirical evidence sug- gests otherwise. To shed light on whether effort or perceptions of effort and sorting are driving the results, I run an experiment on mTurk that is designed to shut down these channels. I also test whether differences in the allocation of credit depend on whether the associated task is male or female-stereotyped, which speaks to whether beliefs about ability are driving the results. In a female-stereotyped or gender-neutral task, we would not expect there to be differences in beliefs about men and women’s abilities. Although this setting is different 23 from academia, it provides additional evidence that gender plays a role in the allocation of credit and that differences in credit are not driven by sorting or effort. 5.1 Experiment Design The experiment consists of two incentivized parts and follows a two-by-two randomiza- tion design. In the first step, mTurk workers are recruited to complete two math quizzes (male-stereotyped) or two grammar quizzes (female-stereotyped), each containing five questions. 18 I refer to the two quizzes as Quiz 1 and Quiz 2. These workers, hereafter called “quiz-takers”, received $0.30 for participating in the study as well as $0.05 for each question they answer correctly. In the second step, designed to test whether people misallocate credit for joint work, a different set of mTurk workers, referred to as “predictors”, are recruited to predict how well quiz-takers will do on Quiz 2. Predictors are randomized into a solo treatment or a group treatment, described in greater detail below. In each treatment, predictors are shown the Quiz 1 questions and correct answers. They are then either shown two randomly-drawn quiz-takers’ individual scores on Quiz 1, or a single score that is the sum of the two quiz-takers’ Quiz 1 scores. Upon seeing these Quiz 1 scores, as well as the questions that were asked on Quizzes 1 and 2, the predictors are asked to estimate how well the quiz-takers will do on Quiz 2. Predictors were paid a participation fee of $0.50 and receive $0.10 for each score they correctly predict 19 . I also run a cross-treatment in which predictors in both the solo and group treatments are shown the average score of all men and women on each quiz to understand whether predictors’ guesses are driven by incorrect beliefs. 5.1.1 Treatments Solo TreatmentThe solo treatment tests whether predictors correctly predict scores when they see a clear signal of each quiz-taker’s ability. This parallels the solo-author paper analysis: if predictors correctly assign credit when they see a clear signal of ability (i.e. they assume that the quiz-taker was responsible for his or her quiz score), there should be no difference in how men and women are evaluated conditional on Quiz 1 scores. In the solo treatment, predictors are shown a randomly chosen man and woman’s score on Quiz 1. Both the man and the woman took the same quiz (either math or grammar). The predictor is first shown the questions asked on Quiz 1. Next, the predictor sees each 18 The math and grammar quizzes are shown in Appendix B. 19 Differences in payments reflect differences in the time it took to complete the tasks 24 quiz-taker’s score on Quiz 1, as well as the distribution of quiz scores across all partici- pants. Note that the distribution of scores is not broken out by gender. Predictors are then shown the Quiz 2 questions and are asked to predict each quiz-taker’s score on Quiz 2. The full set of instructions that predictors receive are shown in Appendix D. Group TreatmentThe group treatment is designed to understand how the predictors assign credit for performance when they cannot observe individual contributions. Quiz- takers are randomly paired with a member of the opposite sex. However, there is no interaction between the two: each quiz-taker completes the same five-question quiz and is paid based on his or her individual score. The other person’s score does not affect them in any way. In this treatment, though, the predictor is shown the sum of the two scores rather than the individual scores. For example, if Person A scored 3/5 and Person B scored 4/5, the predictor would see the score 7/10 for that pair. Importantly, the predictor is told that each quiz-taker was paid based on the number of questions that she answered correctly on herownquiz. Thus, the predictors know that quiz-takers are randomly paired, but worked independently and were individually incentivized. This removes any worry a predictor might have about selection into pairs (such as high-ability men working with low-ability women) and free-riding. The predictor’s guess should therefore only reflect his or her beliefs about each quiz-taker’s score and ability. To draw a parallel between this treatment and the main analysis, the individual scores that make up the joint score can be thought of as each person’s “contribution” to the group, but in this case cannot be driven by selection or effort. Cross-Randomization: Gender DistributionTo understand whether predictors’ guesses are driven by (possibly incorrect) beliefs about ability or taste-based animus, I run a cross- randomization in which I provide participants with information about the average per- formance of men and women on the quizzes. All predictors are shown the overall score distribution, but in this treatment, they are additionally shown the average male and fe- male scores, shown in Figure B1. All other predictors are shown the same figure but without the lines indicating the mean performance of men and women. If predictors exhibit taste-based animus, providing them with information about men and women’s average performance will not change their prediction. In addition, this treatment also helps to understand whether differences in attribution are driven by incor- rect beliefs about gender differences in performance. If participants hold incorrect beliefs about men and women’s average performances, the gender distribution treatment should 25 correct those beliefs and the predictors should adjust their guesses accordingly. To summarize, the specific steps of the experiment are as follows: 1. Quiz-takers are randomly assigned to take two math quizzes or two grammar quizzes 2. Predictors are shown the questions and correct answers from Quiz 1 3. Predictors are shown the quiz scores of two randomly drawn quiz-takers. If predic- tors are in the solo treatment, they see each quiz-taker’s score. If predictors are in the group treatment, they see the sum of the two scores. 4. Predictors are shown the distribution of quiz scores. If predictors are in the “gender distribution” cross treatment, they also see the average scores of all men and women who took the quiz. 5. Predictors are shown the Quiz 2 questions. 6. Predictors guess what each quiz-taker’s score will be on Quiz 2. 5.2 Results Quiz Results Women had a lower average score than men on the math quizzes (2.51/5 vs. 2.72/5) and a higher average score on the grammar quizzes (2.41 vs. 2.17). The distribution of scores on Quiz 1 are shown in Figure B1. This is the same figure that predictors are shown. If predictors are in the gender distribution cross-treatment, they also see the two lines indicating the mean male and female scores. Analysis The main experimental results are presented in Table 10. Columns (1) and (3) show how predictors’ guesses vary based on the quiz-taker’s gender and by whether they were in the gender distribution treatment. Specifically, I estimate Q2 ij =β 1 fem i +β 2 D j +β 3 (fem i ×D j ) +β 4 Q1 i + ij (12) separately for the sample of individuals who took math quizzes (Column 1) and grammar quizzes (Column 3). The outcome variable,Q2 ij , is predictorj’s estimate of quiz-takeri’s Quiz 2 score. An indicator for the quiz-taker being female,fem i , is interacted with an 26 indicator for the predictor being in the gender distribution cross-treatment,D j . I also control fori’s Quiz 1 score (Q1 i ). Women are predicted to have lower average math scores and higher average gram- mar scores than men, although the gender difference in math scores is insignificant. The difference in grammar score predictions is driven by the gender distribution treatment, suggesting that predictors did not hold the prior that women outperform men on gram- mar quizzes. In Columns (2) and (4), I interact all of the independent variables in equation 12 with an indicator for predictorjbeing in the group treatment. Note that in the group treatment, predictors do not see the individual Quiz 1 scores as they only see a man and a woman’s joint score. I control for this variable regardless and interact it with the group treatment dummy, as it should only have a significant effect for predictors in the solo treatment. The predicted score difference for math quizzes is entirely driven by the group treat- ment. When predictors see only the sum of a man and a woman’s Quiz 1 score, they pre- dict that women will score 0.35 points lower on the second math quiz than men. However, there is no significant difference in men and women’s predicted performance in the solo treatment (if anything, women are predicted to perform slightly better than men). This is consistent with the finding that women suffer a coauthor penalty when their contribution to a paper is unobserved but are not discriminated against when their contributions are observed, as in solo-authored papers. Predicting that the woman will do worse than the man in the group treatment suggests that predictors believe that the woman’s first score was lower; that is, she is worse at the task and therefore contributed less to the joint score. Puzzlingly, showing predictors the mean scores of male and female quiz-takers does not change the predictions for the second math quiz. Predictors guess that women will score 0.35 points below men regardless of whether they saw the mean scores. The grammar results in Column (4), though, suggest that the results are not driven by taste-based animus in which women are always penalized in collaborative situations. Women in the group treatment are not predicted to perform worse than men on the second quiz, indicated by a positive, though insignificant, coefficient on Female×Group Treatment. Predictors in the group treatment who see the actual score distribution also predict that women will outperform men on the second quiz. While the experimental context is different from the academic context, the results pro- vide evidence that individuals make different inferences about men and women’s contri- butions to a joint project which are rooted in stereotypes and bias. Even after shutting down effort and selection channels, and providing the distribution of scores, participants still used gender to assign credit. High scores on joint math quizzes were assumed to be 27 driven by men’s individual scores rather than women’s while high scores on join gram- mar quizzes were assumed to be driven by women’s individual scores. While there are gender differences in scores, the actual differences are smaller than the predicted differ- ences even when predictors saw the actual mean differences. In the individual treatment, as in the case of solo-authored papers, men and women are evaluated similarly. 6 Conclusion Women receive tenure at significantly lower rates than men in many academic fields. As discussed in the introduction, this phenomenon is not exclusive to academia. Several explanations have been put forward for the gap, but it persists even after accounting for observable characteristics such as fertility preferences and productivity. This paper proposes an alternative explanation. I argue that women receive less credit for group work when employers can not perfectly observe their contribution. When sig- nals are noisy, employers have to infer each worker’s ability or productivity. Coauthored papers provide employers with a noisy signal. The fact that women who work specifically with men receive tenure at lower rates than comparable women who work alone or with other women suggests that gender enters into the employer’s inference process. However, when employers receive clear signals, men and women are treated similarly. For exam- ple, men and women receive the same amount of credit for solo-authored papers, which provide a clear signal of ability. Evidence from an online experiment suggest that these results are not explained by sorting or differences in effort. The online experiment further suggests that this phenomenon is not specific to women. Men also suffer a penalty when working with women on a female-stereotyped task. Being aware of this phenomenon is important in a world that is increasingly relying on group work for production. The tech industry, for example, prides itself on collabora- tion. In such male-dominated fields, however, group work could result in fewer women moving up the career ladder if credit is not properly attributed. The same could be true for men in female-dominated industries. The unequal attribution of credit would then contribute to and help maintain gender segregation in occupations. 28 References [1] Antecol, Heather, Kelly Bedard, and Jenna Stearns. 2016. "Equal but Inequitable: Who Benefits from Gender-Neutral Tenure Clock Stopping Policies?" IZA Discussion Paper No. 9904. [2] Becker, Gary. 1971.The Economics of Discrimination.Chicago: University of Chicago Press. [3] Blau, Francine D. and Jed DeVaro. 2007. "New Evidence on Gender Differences in Pro- motion Rates: An Empirical Analysis of a Sample of New Hires." Industrial Relations, 46(3): 511-550. [4] Boschini, Anne and Ana Sjögren. 2007. "Is Team Formation Gender Neutral? Evidence from Coauthorship Patterns." Journal of Labor Economics, 25(2): 325-365. [5] Carrell, Scott E., Marianne E. Page, and James E. West. 2010. "Sex and Science: How Professor Gender Perpetuates the Gender Gap." Quarterly Journal of Economics, 125(3): 1101-1144. [6] Ceci, Stephen J., Donna K. Ginther, Shulamit Kahn, and Wendy M. Williams. 2014. "Women in Academic Science: A Changing Landscape." Psychological Science in the Public Interest, 15(3): 1-67. [7] Dynan, Karen and Cecilia Rouse. 1997. "The Underrepresentation of Women in Eco- nomics: A Study of Undergraduate Economics Students." The Journal of Economic Education, 28(4): 350-368. [8] Elmins, William, Robert Joyce, and Monica Costa Bias. 2016. "The Gender Wage Gap." Institute for Fiscal Studies Briefing Note 186. [9] Garcia, Daniel and Joshua Sherman. 2015. "Norms and Team Formation: Evidence from Research Partnerships." working paper. [10] Ginther, Donna K. and Shulamit Kahn. 2004. "Women in Economics: Moving Up or Falling Off the Academic Career Ladder?" Journal of Economic Perspectives, 18(3): 193-214. [11] Hale, Galina and Tali Regev. 2014. "Gender Ratios at Top PhD Programs in Eco- nomics." Economics of Education Review, 41: 55-70. 29 [12] Hammermesh, Daniel. 2013. "Six Decades of Top Economics Publishing: Who and How?" Journal of Economic Literature, 51(1): 162-172. [13] Kalaitzidakis, Pantelis, Theofanis Mamuneas, and Thanasis Stengos. 2003. "Rankings of Academic Journals and Institutions in Economics." Journal of the European Eco- nomic Assocation, 1(6): 1346-1366. [14] Lazear, Edward P. and Kathryn L. Shaw. 2007. "Personnel Economics:The Economist’s View of Human Resources." Journal of Economic Perspectives, 21(4): 91- 114. [15] MacLeod, W. Bentley. 2003. "Optimal Contracting with Subjective Evaluation." Amer- ican Economic Review, 93(1): 216-240. [16] Milkman, K.L., M. Akinola, and D. Chugh. 2015. "What Happens Before? A Field Experiment Exploring How Pay and Representation Differentially Shape Bias on the Pathway into Organizations." Journal of Applied Psychology, 100(6): 1678-1712. [17] Moss-Racusin, Corinne, John Dovidio, Victoria Brescoll, Mark Graham, and Jo Han- delsman. 2012. "Science Faculty’s Subtle Gender Biases Favor Male Students." PNAS, 109(41): 16474-16479. [18] Niederle, Muriel and Lise Vesterlund. 2007. "Do Women Shy Away From Competi- tion? Do Men Compete Too Much?" Quarterly Journal of Economics, 122(3): 1067- 1101. [19] Prendergast, Canice. 1999. "The Provision of Incentives in Firms." Journal of Eco- nomic Literature, 104(5): 7-63. [20] Sarsons, Heather. 2017. "Recognition for Group Work:Gender Differences in Academia." AER Papers & Proceedings, 107(5): 141-145. 30 Figures FIGURE1: DISTRIBUTION OFPAPERCOMBINATIONS Notes: This figure shows the number of women (Panel A) and men (Panel B) who had various combinations of solo and coauthored papers at the time of tenure. Each dot represents a specific combination of papers with the number of coauthored papers measured on the x-axis and the number of solo-authored papers measured on the y-axis. The shading of the dots represents how many individuals had that combination of papers at the time they went up for tenure, with darker shades indicating a larger number of individuals with that combination. In the legend, “n” is the minimum and maximum number of individuals who have a specific paper combination. Panel A is constructed using the full sample of women (N=143) and Panel B is constructed using the full sample of men (N=501). 31 FIGURE2: TENUREPROBABILITIES BYPAPERCOMBINATIONS Notes: This figure plots the unconditional tenure probability for women (Panel A) and men (Panel B) who have various combinations of papers at the time they go up for tenure. Coauthored papers are counted along the x-axis and solo-authored papers are countred along the y-axis. A darker shade indicates a higher probability of receiving tenure. For example, if a dot is the darkest shade, it indicates that individuals with that combination of solo and coauthored papers receives tenure with probability one. Panel A is constructed using the full sample of women (N=143) and Panel B is constructed using the full sample of men (N=501). 32 FIGURE3: TOTALPAPERS ANDTENURE Notes: This binned scatterplot shows the correlation between the total number of publications an individual has at the time they go up for tenure and the probability of receiving tenure. The y-variable, tenure, is a binary variable that equals one if an individual received tenure at their initial institution of employment. For more details on how the tenure variable is constructed, see Section 2. To construct the plot, tenure is first residualized with respect to the following controls: number of years it took to go up for tenure, average journal rank of pre-tenure publications, log citations, total coauthors, and tenure school, tenure year, and field fixed effects. The x-variable, number of publications, is then divided into twenty equal-sized groups. Within each of these groups, I plot the mean of the y-variable (tenure) residuals against the mean of the x-variable (also within each bin). I then add back the unconditional mean of Tenure to help with the interpretation of the line of best fit. The lines of best fit are estimated using the full sample (N=621) and have slopes ofβ=0.132(s.e. = 0.016) for men andβ=0.165(s.e. = 0.043) for women. 33 FIGURE4: RELATIONSHIPBETWEENPAPERCOMPOSITION ANDTENURE Notes: This figure is a binned scatterplot of the correlation between tenure and the fraction of an individual’s papers that are solo-authored, split by gender. The y-variable is a binary variable indicating whether an individual received tenure. To construct the plot, tenure is first residualized with respect to the following controls: total number of papers an individual published by the time of tenure, number of years it took to go up for tenure, average journal rank of pre-tenure publications, log citations, total coauthors, and tenure school, tenure year, and field fixed effects. The x-variable, fraction of papers that are solo-authored, is then divided into twenty equal-sized groups. Within each of these groups, I plot the mean of the y-variable (tenure) residuals against the mean of the x-variable (also within each bin). I then add back the unconditional mean of Tenure to help with the interpretation of the line of best fit. The line of best fit using OLS is shown separately for men and women. The lines of best fit are estimated using the full sample (N=621) and have slopes ofβ=0.521(s.e. = 0.158) for women andβ=0.023(s.e. = 0.748) for men. 34 FIGURE5: ABILITY ANDSORTING Notes: This binned scatterplot shows the correlation between an individual’s ability and the propensity to coauthor (Panel A) or the propensity to coauthor with women (Panel B). The outcome variable in Panel A is the fraction of an individual’s papers that were published by tenure that are coauthored. The outcome variable in Panel B is the fraction of an individual’s pre-tenure papers that are coauthored with only women. I proxy for an individual’s ability with the rank of the journal in which the individual’s job market paper was published. The plot is constructed as described in Figure 3 with the y-variable residualized on the following controls before plotting: total solo and coauthored papers, the number of years it took to go up for tenure, log citations, and tenure school, tenure year, and field fixed effects. The lines of best fit using OLS are shown separately for men and women. The estimates for Fig. 5A areβ=−0.0001(s.e. = 0.0003) for women andβ=0.0002(s.e. = 0.0002) for men. The estimates for Fig. 5B areβ=−0.00004(s.e. = 0.0008) for women andβ=0.0002(s.e. = 0.0003) for men. 35 FIGURE6: ASSORTATIVEMATCHING Notes: This binned scatterplot shows the correlation between an individual’s ability, proxied by the journal in which their job market paper is published, and their coauthor’s ability, proxied by the average school rank of their coauthors. The school rank of coauthors are measured at the time that individualiwent up for tenure. School rankings are taken from IDEAS/RePEc. The plot is constructed as described in Figure 3 with the y-variable residualized on the following controls before plotting: total solo and coauthored papers, the number of years it took to go up for tenure, log citations, and tenure school, tenure year, and field fixed effects. The line of best fit using OLS is shown separately for men and women. The lines of best fit are estimated on the full sample and have slopes of β=0.062(s.e. = 0.091) for women andβ=0.109(s.e. = 0.056) for men. 36 FIGURE7: TIMING OFPUBLICATIONS Notes: This figure shows the average number of publications an individual has in the years surrounding his or her initial appointment as an assistant professor. Year 0 is the year that the individual begins working at his/her tenure institution (tenure institutions are defined in Section 2). The blue bars represent publications that are coauthored with men. The red bars represent all other publications (either solo-authored or coauthored with women). Panels A and B show the timing of publications for women and men who were denied tenure. Panels C and D show the timing of publications for women and men who received tenure. 37 Tables TABLE1: SUMMARYSTATISTICS FullMaleFemalep-value Panel A: Tenure0.680.730.520.001 (0.47)(0.44)(0.50) Years to tenure6.86.67.30.001 (1.6)(1.6)(1.8) Total papers8.38.48.00.262 (3.9)(4.1)(3.3) Solo-authored3.03.03.00.879 (2.4)(2.4)(2.3) Coauthored5.35.45.00.189 (3.6)(3.7)(3.1) Panel B: Top 5 Solo0.670.660.680.900 (0.98)(0.99)(0.92) Top 5 Coauthored1.31.31.20.570 (1.4)(1.4)(1.4) AER Equivalent: Solo Pubs.0.340.340.330.500 (0.24)(0.23)(0.25) Coauthored Pubs.0.330.340.300.039 (0.20)(0.21)(0.18) Panel C Number Unique CAs4.524.554.470.767 (2.79)(2.78)(2.83) Frac. coauthors who are: Full Professor0.460.470.410.052 (0.35)(0.33)(0.38) Associate Professor0.160.150.160.810 (0.24)(0.23)(0.28) Assistant Professor0.250.230.280.060 (0.24)(0.22)(0.30) Graduate Student0.0170.0150.0210.239 (0.067)(0.056)(0.095) Female0.130.0940.2700.001 (0.23)(0.179)(0.309) Observations644501143 This table displays the average tenure rate, pre-tenure productivity, and pre-tenure authorship patterns of men and women who went up for tenure at one of top 35 U.S. economics departments between 1985 and 2014. The top 35 institutions are taken determined according to the RePEc/IDEAS economics de- partment rankings. In Panel A, Tenure is an indicator that equals one if an individual was promoted to associate or full professor 6-8 years after his or her initial appointment. Years to tenure is the number of years between an individual’s PhD graduation year and the year s/he went up for tenure. All paper counts are measured as the number of papers an individual had published at the time of tenure. Top 5 Solo/Coauthored are the number of publications an individual had published in one of the top 5 eco- nomics journals: AER, QJE, Econometrica, JPE, and ReStud.AER Equivalentis a measure that converts an individual’s publications into the number of AER-equivalent publications they correspond to. For more details on this variable, see Section 2. Number Unique CAs is the number of different coauthors an individual had published with by the time s/he went up for tenure. Coauthor positions (full, asso- ciate, assistant, and graduate student) are the positions an individual’s coauthors had at the time that individual went up for tenure. 38 TABLE2: RELATIONSHIPBETWEENPAPERS& TENURE Outcome Variable: Tenure (1)(2)(3)(4)(5) Sample:FullFullFullFemaleMale Total papers0.142 ∗∗∗ (0.016) Fem x Papers-0.005 (0.012) Solo-authored0.094 ∗∗∗ 0.097 ∗∗∗ 0.196 ∗∗∗ 0.095 ∗∗∗ (0.013)(0.019)(0.055)(0.024) Fem x Solo0.048 ∗∗∗ 0.057 ∗∗∗ (0.018)(0.015) Coauthored0.085 ∗∗∗ 0.082 ∗∗∗ -0.0310.090 ∗∗∗ (0.016)(0.014)(0.054)(0.016) Fem x Coauthored-0.030 ∗ -0.026 ∗ (0.016)(0.015) Total coauthors-0.0050.0010.0030.025-0.001 (0.004)(0.005)(0.005)(0.016)(0.006) Total Papers Sq-0.004 ∗∗∗ (0.001) Solo Papers Sq-0.005 ∗∗∗ -0.005 ∗∗∗ -0.007-0.005 ∗∗ (0.001)(0.002)(0.005)(0.002) Coauthored Sq-0.003 ∗∗∗ -0.003 ∗∗∗ 0.000-0.003 ∗∗∗ (0.001)(0.001)(0.003)(0.001) Log Citations0.059 ∗∗∗ 0.031 ∗∗ 0.065 ∗∗∗ 0.098 ∗ 0.058 ∗∗∗ (0.012)(0.012)(0.013)(0.054)(0.016) AER Equiv. Ranking0.533 ∗∗∗ (0.116) AER Equiv. Solo0.139 ∗ 0.331 ∗∗∗ 0.3210.416 ∗∗∗ (0.071)(0.069)(0.206)(0.091) AER Equiv. CA0.201 ∗∗ 0.325 ∗∗∗ 0.486 ∗ 0.280 ∗∗∗ (0.099)(0.073)(0.248)(0.089) Female-0.135-0.166-0.205 ∗ (0.105)(0.121)(0.109) Tenure Inst. FEYNYYY Tenure Year FEYNYYY Field FEYNYYY Observations625629621139482 R-squared0.4170.2870.4250.5210.421 This table shows the relationship between publications and tenure. The dependent variable, Tenure, is binary and indicates whether an individual received tenure 6-8 years after being hired at the initial tenure institution. Total papers is the number of papers an individual published by the time s/he went up for tenure. Solo-authored and Coauthored are the number of solo or coauthored papers s/he had published at the time of tenure. AER Equiv. Ranking, AER Equiv. Solo, and AER Equiv. CA are journal quality measures described in Section 2. Total coauthors is the number of coauthors an individual had on the papers s/he had published by the time of tenure. Tenure length is the number of years it took the individual to go up for tenure. Citations are from Google Scholar and measured in 2017. The equations are estimated using a linear probability model. Bootstrapped standard errors are clustered by tenure institution and reported in parentheses. (*=p<0.10, **=p<0.05 ,***=p<0.01) 39 TABLE3: COAUTHORGENDER (1) ×Female Solo-authored0.093 ∗∗∗ 0.049 ∗∗ (0.019)(0.015) CA with only fem CAs0.097 ∗∗∗ 0.019 (0.024)(0.020) CA with only male CAs0.087 ∗∗∗ -0.056 ∗∗∗ (0.015)(0.015) Pubs. with M and F CAs0.087 ∗∗ 0.033 (0.026)(0.042) Female-0.156 (0.101) Total coauthors-0.001 (0.004) Log Citations0.064 ∗∗∗ (0.014) AER Equiv. CA0.332 ∗∗∗ (0.073) AER Equiv. Solo0.328 ∗∗∗ (0.065) Tenure Inst. FEYes Tenure Year FEYes Field FEYes Observations621 This table presents the results of one regression where the variables that are in- teracted with Female (a dummy indicating that the researcher is a woman) are displayed in the right-hand column.Papers with only fem CAsis the number of publications an individual has in which all coauthors are female. Similarly, Papers with only male CAsandPapers with male and fem CAsare the number of publications with only male coauthors and with a mix of male and female coau- thors respectively. Controls for tenure length; quadratics in the number of papers; and tenure institution, year, and field fixed effects are also included. The equa- tions is estimated using a linear probability model. Bootstrapped standard errors are reported in parentheses and are clustered by tenure institution. (*=p<0.10, **=p<0.05 ,***=p<0.01) 40 T ABLE 4: R OBUSTNESS C HECKS Faculty Journal Rankings Publication Count Total AERs List Sample RePEc Over Time AER Bins Tenure +1 Tenure +2 (1) (2) (3) (4) (5) (6) (7) Solo-authored 0.115 ∗∗∗ 0.083 ∗∗∗ 0.078 ∗∗∗ 0.078 ∗∗∗ 0.058 ∗∗∗ 0.038 ∗∗∗ 0.060 ∗∗∗ (0.024) (0.018) (0.017) (0.019) (0.017) (0.013) (0.019) Fem x Solo 0.050 ∗∗ 0.055 ∗∗∗ 0.052 ∗∗∗ 0.045 ∗∗∗ 0.044 ∗∗∗ 0.055 ∗∗∗ 0.091 ∗ (0.019) (0.015) (0.015) (0.015) (0.014) (0.013) (0.024) Coauthored 0.092 ∗∗∗ 0.079 ∗∗∗ 0.081 ∗∗∗ 0.069 ∗∗∗ 0.038 ∗∗∗ 0.031 ∗∗∗ 0.089 ∗∗∗ (0.023) (0.015) (0.016) (0.020) (0.011) (0.008) (0.011) Fem x Coauthored -0.032 ∗ -0.026 ∗ -0.025 ∗ -0.029 ∗ -0.022 ∗ -0.013 -0.036 ∗ (0.019) (0.014) (0.014) (0.015) (0.013) (0.013) (0.019) Years to Tenure -0.046 ∗∗∗ -0.054 ∗∗∗ -0.055 ∗∗∗ -0.051 ∗∗∗ -0.046 ∗∗∗ -0.045 ∗∗∗ -0.050 ∗∗∗ (0.009) (0.008) (0.008) (0.009) (0.008) (0.008) (0.011) Total Coauthors -0.001 0.004 0.004 -0.004 0.008 ∗ 0.010 ∗∗ 0.003 (0.007) (0.005) (0.005) (0.006) (0.004) (0.004) (0.005) Log Citations 0.056 ∗∗∗ 0.070 ∗∗∗ 0.074 ∗∗∗ 0.079 ∗∗∗ 0.072 ∗∗∗ 0.074 ∗∗∗ 0.094 ∗∗∗ (0.017) (0.013) (0.013) (0.013) (0.014) (0.013) (0.016) CA Paper Rank 0.310 ∗∗∗ 0.003 ∗∗ 0.003 ∗∗∗ 0.345 ∗∗∗ 0.345 ∗∗∗ (0.084) (0.001) (0.001) (0.081) (0.081) Solo Paper Rank 0.457 ∗∗∗ 0.002 ∗ 0.004 ∗∗∗ 0.291 ∗∗∗ 0.299 ∗∗∗ (0.077) (0.001) (0.001) (0.069) (0.065) Female -0.158 -0.193 ∗ -0.197 ∗∗ -0.175 ∗ -0.197 ∗ -0.280 ∗∗∗ -0.199 ∗∗ (0.128) (0.102) (0.099) (0.100) (0.105) (0.110) (0.048) Observations 369 621 621 621 621 621 621 The dependent variable in all columns is an indicator for receiving tenure. Column 1 restricts the sample to those schools I received a historical faculty list from. Column 2 uses RePEc journal rankings as the paper quality measure. The ranking used can be found at https://ideas.repec.org/top/top.journals.all.html. Column3 uses historical journal rankings from RePEc to allow for rankings to change over time and to account for new journals entering. Column 4 controls for the numberof papers within each of 10 AER “bins”. For this analysis, the AER-equivalent measure is divided into deciles. For each individual, I then add up the number ofsolo and coauthored papers within each decile and include the number of papers in each bin as controls. In Columns 5 and 6, I include papers that were publishedone and two years after an individual went up for tenure in the paper counts. In Column 7, I use the AER Equivalent measure of journal ranking to calculate thetotal number of AER equivalents (solo and coauthored) an individual had at the time of tenure. I use this measure in place of the solo and coauthored paper counts(the main independent variables). All regressions control for a quadratic in the number of papers as well as tenure institution, tenure year, and field fixed effects.Bootstrapped standard errors are reported in parentheses and are clustered by tenure institution. (*=p<0.10, **=p<0.05 ,***=p<0.01) 41 TABLE5: FUTUREPRODUCTIVITY Outcome Var:Post TenureLog Citations Solo AER Equivalents (1)(2) PoissonOLS Fraction Coauthored-1.45 ∗∗∗ 0.533 (0.500)(0.390) Female-0.232-0.151 (0.380)(0.414) Female×Frac. Coauthored1.057 ∗ 0.742 (0.576)(0.660) Tenured0.1940.496 ∗ (0.352)(0.289) Tenured×Frac. Coauthored0.0020.408 (0.006)(0.486) Female×Tenured0.1850.210 (0.528)(0.509) Fem×Tenured×Frac. Coauthored-0.991-0.740 (1.131)(0.769) Top 5 Coauthored0.013 ∗∗ (0.007) Total papers0.071 ∗∗∗ (0.015) Tenure Inst. FENY Post-Tenure Inst. FEYN Tenure Year FEYY Field FEYY Observations621621 Column 1 shows the results from estimating equation 6 using a zero-inflated Poisson model, where the outcome vari- able is the number of solo-authored AER equivalents an individual published after the tenure decision (measured as of 2017). “Top 5 Coauthored” is the number of coauthored AER equivalents the individual published after tenure. Post-tenure institution is the institution the individual went to following the tenure decision. For people who re- ceived tenure, this the same as the tenure institution. Column 2 shows the results from estimating the same equation using OLS where log citations is the outcome variable. Citations are measured in 2015. Robust standard errors are reported in parentheses and are clustered at the tenure institution or post-tenure institution level. (*=p<0.10, **=p<0.05 ,***=p<0.01) 42 TABLE6: SURVEYRESULTS (1)(2)(3) Men Women p-value Panel A: Beliefs about Returns to Papers Coauthored AER12.112.20.939 Coauthored AER, Sr. Faculty9.18.80.528 Coauthored AER, Jr. Faculty13.313.40.796 Solo Top Field8.08.20.669 Coauthored Top Field6.36.80.223 Panel B: Frequency of Presenting Papers Times Presented3.12.20.07 Present More Freq. than CA0.370.440.20 Observations30089 This table presents the mean responses for men and women to the following survey questions: Panel A: "Suppose a solo authored AER increases your chance of receiving tenure by 15 percent. By how much do you think each of the following increases your change of receiving tenure?" Panel B: "How many times per year do you typically present your solo-authored papers? Are you more or less likely than your coauthors to present a joint paper?"Present More Freq. than CAis the fraction of respondents who reported that they are more likely than their coauthors to present a joint paper. The survey was conducted with a sample of academic economists currently working at a top 35 U.S. economics department. Respondents were anonymous. 43 TABLE7: ACCOUNTING FORSORTING Dep. Variable: Tenure (1)(2)(3) Solo-authored0.086 ∗∗∗ 0.084 ∗∗∗ 0.090 ∗∗∗ (0.017)(0.017)(0.017) Fem x Solo0.064 ∗∗∗ 0.067 ∗∗∗ 0.062 ∗∗∗ (0.017)(0.018)(0.017) Coauthored0.087 ∗∗∗ 0.089 ∗∗∗ 0.082 ∗∗∗ (0.014)(0.014)(0.015) Fem x Coauthored-0.032 ∗ -0.032 ∗ -0.033 ∗∗ (0.016)(0.016)(0.015) Female-0.220-0.348 ∗ -0.262 ∗ (0.121)(0.133)(0.136) Rank Difference0.001 (0.002) Fem×Rank Difference-0.001 (0.002) Avg. Coauthor Rank-0.002 (0.001) Fem×Avg. Coauthor Rank0.003 (0.002) Frac. Full Prof.-0.057 (0.071) Fem×Frac. Full Prof.0.194 (0.067) Observations595595595 The dependent variable in all columns is an indicator for receiving tenure. Column (1) shows the relationship between solo and coauthored papers and tenure when controlling for the difference between individuali’s institution rank and the average institution rank of his or her coauthors. Column (2) controls for the average institution rank of an individual’s coauthors, and column (3) controls for the fraction of an individual’s coauthors who are full professors. Only coauthors that an individual coauthored with up until tenure are included. All regressions control for tenure length, journal rank (AER equivalent measure), and log citations. They also include tenure institution, tenure year, and field fixed effects. The sample size is smaller in this analysis because individuals with no coauthors are excluded. (*=p<0.10, **=p<0.05 ,***=p<0.01) 44 TABLE8: PAPERSPLIT BYTOP5 Dep Var: Tenure (1) Top 5Non-Top 5 Solo0.067 ∗∗∗ 0.033 ∗∗∗ (0.019)(0.007) Coauthored0.086 ∗∗ 0.031 ∗∗∗ (0.016)(0.007) Fem x Solo0.0200.055 ∗∗ (0.037)(0.019) Fem x Coauthored-0.007-0.035 ∗∗ (0.031)(0.017) Female-0.171 (0.108) Total coauthors-0.002 (0.005) Years to tenure-0.049 ∗∗∗ (0.008) Log Citations0.079 ∗∗∗ (0.012) Tenure Inst. FEY Tenure Year FEY Field FEY Observations621 R-squared0.415 This table presents the results from estimating equation 9. The results in the able are from this single regression, but solo and coauthored papers are split into those published in the top 5 jour- nals (Column 1) and journals below the top 5 (Column 2). Top 5 papers are those published in the American Economic Review, Econometrica, the Journal of Political Economy, Quarterly Journal of Economics, or the Review of Economic Studies. The dependent variable is an indicator for receiving tenure. The regression in- cludes tenure institution, tenure year, and field fixed effects. Robust standard errors are clustered by tenure institution and reported in parentheses. (*=p<0.10, **=p<0.05 ,***=p<0.01) 45 TABLE9: TIMING OFCOAUTHORSHIP WITHMEN Years to First Fraction of Papers Publicationwith Men (1)(2) Female0.0540.062 (0.186)(0.080) Tenure-0.0020.060 (0.130)(0.047) Female×Tenure-0.151-0.258 ∗∗ (0.237)(0.089) Years to 1st Pub.0.016 (0.015) Fem×Years to 1st Pub.-0.013 (0.032) Tenure×Years to 1st Pub.-0.030 (0.018) Fem×Tenure×Years to 1st Pub.0.017 (0.041) Total papers-0.119 ∗∗∗ 0.009 ∗ (0.015)(0.003) AER Equiv.-0.3800.251 ∗∗ (0.344)(0.081) School FEYY Tenure Year FEYY Primary Field FEYY Observations603594 This table tests whether there are gender differences in the timing of an individual’s first publication (Col- umn 1) and whether women who take a longer time to publish their first paper are more likely to coauthor with men (Column 2). The outcome variable in Column 1 is the number of years it takes an individual to publish his or her first paper after graduating, and is measured as the year of the individual’s first pub- lication minus the year of the individual’s initial faculty appointment. Articles published before the first appointment (i.e. during graduate school) are not counted. The outcome variable in Column 2 is the frac- tion of an individual’s papers published by tenure that are coauthored with men. The independent vari- able,Y earsto1stPubis the outcome variable in Column 1. Both regressions include tenure institution, tenure year, and field fixed effects. Robust standard errors are reported in parentheses. (*=p<0.10, **=p<0.05 ,***=p<0.01) 46 TABLE10: EXPERIMENT: PREDICTEDSCORE BYQUIZTYPE Outcome: Predicted Quiz 2 Score MathGrammar (1)(2)(3)(4) Female-0.0960.1110.055-0.021 (0.097)(0.096)(0.103)(0.121) Gender Distribution0.0620.145-0.389 ∗∗∗ -0.241 ∗∗ (0.101)(0.096)(0.096)(0.112) Female×Gender Distribution-0.085-0.1080.424 ∗∗∗ 0.194 (0.133)(0.144)(0.144)(0.176) Female×Group Treatment-0.354 ∗∗ 0.124 (0.166)(0.179) Female×Group Treatment×Gender Distr.0.0010.545 ∗∗ (0.230)(0.255) Group Treatment×Gender Distr.-0.069-0.215 (0.165)(0.173) Group Treatment3.294 ∗∗∗ 2.957 ∗∗∗ (0.418)(0.474) Quiz 1 Score0.416 ∗∗∗ 0.735 ∗∗∗ 0.385 ∗∗∗ 0.725 ∗∗∗ (0.068)(0.072)(0.072)(0.089) Group Treatment×Quiz 1 Score-0.696 ∗∗∗ -0.693 ∗∗∗ (0.116)(0.131) Constant1.713 ∗∗∗ 0.1371.784 ∗∗∗ 0.289 (0.259)(0.261)(0.286)(0.327) Observations516516493493 R-squared0.0810.2990.0920.296 This table presents the results from an mTurk experiment in which participants predict how well an individual will do on a math or grammar quiz based on that individual’s performance on an earlier quiz. Columns 1 and 2 show the results for the sample of participants who predicted math quiz scores and Columns 3-4 show the results for the sample of participants who predicted grammar quiz scores. In the experiment, participants were randomized into one of two treatments: (1) a group that saw each individual’s score on a previous quiz, or (2) a group that saw the sum of two individuals’ scores. Group Treatment is a dummy variable indicating that participants saw the sum of two scores (Treatment 2) rather than individual scores (Treatment 1). Gender Distribution is a dummy indicating that participants were told the average quiz scores of all men and women. (*=p<0.10, **=p<0.05 ,***=p<0.01) 47 Appendix A Additional Tables TABLEA1: RESULTS BYINSTITUTION ANDYEAR Panel A: Tenure Institution Institution Rank:Top 10Top 20Top 35 (1)(2)(3) Solo-authored0.031 ∗∗∗ 0.053 ∗∗∗ 0.039 ∗∗ (0.006)(0.018)(0.018) Coauthored0.035 ∗∗ 0.052 ∗∗∗ 0.023 ∗∗∗ (0.013)(0.016)(0.007) Fem x Coauthored0.002-0.048 ∗∗ -0.048 ∗ (0.027)(0.020)(0.026) Fem x Solo0.074 ∗ 0.071 ∗ 0.104 ∗∗∗ (0.035)(0.037)(0.035) Female-0.471 ∗ -0.048-0.245 (0.247)(0.173)(0.243) Observations211157155 Panel B: Tenure Year Tenure Year:1985-19951996-20052006-2014 (1)(2)(3) Solo-authored0.034 ∗∗∗ 0.043 ∗∗ 0.033 ∗ (0.010)(0.021)(0.019) Coauthored0.018 ∗ 0.049 ∗∗∗ 0.047 ∗∗∗ (0.010)(0.011)(0.015) Fem x Coauthored0.011-0.047 ∗∗ -0.053 ∗ (0.041)(0.022)(0.027) Fem x Solo0.145 ∗∗∗ 0.079 ∗∗∗ 0.054 (0.037)(0.029)(0.042) Female-0.787 ∗∗∗ -0.219-0.003 (0.275)(0.160)(0.202) Observations141157215 Panel A shows the relationship between coauthoring and tenure by tenure institution rank. Schools are divided into the top 10, top 20, and top 35 departments, according to the RePEc rankings. All regressions include the following controls: time until tenure, number of coau- thors, log citations, solo and coauthored journal rankings, and tenure year and field fixed effects. Panel B shows the relationship splitting the sample by time period. The year groups are the years that an individual went up for tenure. All regressions include the following controls: time until tenure, number of coauthors, log citations, solo and coauthored journal rankings, and tenure rank and field fixed effects. (*=p<0.10, **=p<0.05 ,***=p<0.01) 48 Sociology Results The sociology sample consists of randomly sampled faculty at the top 20 sociology PhD- granting departments in the U.S 20 . There are 250 sociologists in the sample, 40% of whom are female. Summary statistics are presented in Table A2. There is no statistically signifi- cant difference between men and women’s tenure rates (with the mean tenure rate being 76%) although men seem to publish more solo-authored articles than women. TABLEA2: SOCIOLOGYSUMMARYSTATISTICS MenWomenp-value Tenure0.7520.7760.547 (0.433)(0.419) Total papers12.1510.180.033 (7.808)(5.726) Total coauthored6.4095.9590.567 (6.641)(4.999) Solo papers5.7454.2240.003 (4.451)(2.892) Time to tenure7.5847.5200.686 (1.607)(1.724) Books0.7790.5710.139 (1.185)(0.799) Observations150100 This table presents summary statistics for the full sample of sociologists and sepa- rately for men and women. All paper and book count variables (Total Papers,Solo- authored,Coauthored, andTop 5s) are the number of papers or books an individual had published at the time of tenure. To test whether men and women are treated differently, I reestimate equation 3 but include measures of the number of papers that researcheriis first author on. The results are presented in Table A3. I include the number and fraction of papers a researcher is first author on in Columns 1 and 2 respectively, along with female dummy interaction terms. 20 Ranking from U.S. News Education 49 TABLEA3: SOCIOLOGY: PAPERS ANDTENURE Dep Var: TenureProbitProbit (1)(2) Total first author0.050 ∗∗ (0.017) Fem x First Author0.026 (0.040) Fraction first author0.403 ∗∗∗ (0.043) Fem x Frac. First Author-0.042 (0.172) Solo papers0.0080.000 (0.006)(0.006) Fem x Total Solo0.0020.007 (0.011)(0.011) Total Coauthored-0.010 ∗ 0.009 (0.004)(0.007) Fem x Total CA-0.0200.001 (0.017)(0.015) Books0.063 ∗ 0.058 (0.032)(0.035) Book chapters0.0070.005 (0.013)(0.012) Female0.0260.010 (0.114)(0.163) School FEYesYes Tenure Year FEYesYes Observations237209 This table shows the relationship between the number and types of papers an individual publishes and tenure for a sample of sociologists. The de- pendent variable is a binary variable indicating whether the individual re- ceived tenure 6-7 years after being hired at the initial tenure institution.To- tal first authoris the number of papers an individual is first author on while Fraction first authoris the fraction of an individual’s papers that s/he was first author on. The equations are estimated using a probit model and the marginal probabilities calculated at the mean are displayed. Standard er- rors, reported in parentheses, are clustered by tenure institution. (*=p<0.10, **=p<0.05 ,***=p<0.01) 50 Appendix B Additional Figures FIGUREB1: EXPERIMENT: QUIZ1 RESULTS Notes: These bar graphs show the distribution of scores on first math and grammar quizzes. The lines mark the means score of men (dashed line) and women (solid lines). The experiment participants who predicted scores saw these distributions with or without the lines, depending on whether they were in the gender distribution treatment. 51 Appendix C Institutions List Received faculty list:Brown, Columbia, Cornell, Duke, Harvard, Michigan State Uni- versity, New York University, Northwestern, Ohio State University, Penn State, Rutgers, Stanford, UC Berkeley, UC Davis, UC San Diego, UCLA, University of Virginia, University of Maryland, University of Michigan, University of Minnesota, University of Pennsylva- nia, University of Wisconsin-Madison No faculty list:Boston College, Boston University, California Institute of Technology, Georgetown, MIT, Princeton,University of Southern California, University of Chicago, University of Texas - Austin, University of Rochester, Vanderbilt, Yale 52 Appendix D Experiment Information D.1 Quizzes Grammar Quiz 1 1. The storm prevented ....... on a picnic. (a) us to going (b) us going (c) us to go (d) us from going 2. A man’s concept of liberty is different from ........ . (a) a woman’s (b) womens (c) a woman (d) woman’s 3. ........ hour went by before we received ........ invitation (a) an/an (b) a/a (c) an/a (d) a/an 4. When a subordinate clause is followed by the main clause, what is required? (a) a dash (b) a semi-colon (c) a period (d) a comma 5. ........ are used around a relative clause that defines the noun it follows. (a) Only commas (b) No commas (c) Semi-colons (d) Quotation marks 53 Grammar Quiz 2 1. I am dizzy and need to ........ down (a) lie (b) lay (c) lye (d) go lay 2. Which of these is not an article? (a) The (b) A (c) It (d) An 3. His idea is ........ mine (a) different to (b) different from (c) different than (d) different then 4. Adverbs can modify which of the following? (a) nouns (b) adjectives (c) pronouns (d) none of the above 5. ........ did you bump into? (a) Who (b) Whose (c) Who’s (d) Whom 54 Math Quiz 1 1. Which of the following is a subset of {b,c,d}? (a) { } (b) {a} (c) {1,2,3} (d) {a,b,c} 2. A man’s regular pay is $3 per hour up to 40 hours. Overtime is twice the payment for regular time. If we was paid $168, how many hours overtime did he work? (a) 8 (b) 16 (c) 28 (d) 48 3. 3 4/5 expressed as a decimal is (a) 3.40 (b) 3.45 (c) 3.50 (d) 3.80 4. Which of the following is the highest common factor of 18, 24, and 36? (a) 6 (b) 18 (c) 36 (d) 72 5. Given thataandbare integers, which of the following is not necessarily an integer? (a)2a−5b (b)a 7 (c)b a (d)ab 55 Math Quiz 2 1. Items bought by a trader for80areoldfor100. The project expressed as a percentage of cost price is (a) 2.5% (b) 20% (c) 25% (d) 50% 2. A man bought a shirt at a sale. He saves30onthenormalpricewhenhepaid120 for the shirt. What was the percentage discount on the shirt? (a) 20 (b) 25 (c) 33.33 (d) 80 3. How many subsets does {a,b,c,d,e} have? (a) 2 (b) 4 (c) 10 (d) 32 4. What is the median of the given data: 13, 16, 12, 14, 19, 14, 13, 14 (a) 14 (b) 19 (c) 12 (d) 14.5 5. In coordinate geometry, what is the equation of the x-axis? (a)y=0 (b)x=y (c)x=0 (d)y=1 56 D.2 Experiment Instructions Screen 1 INSTRUCTIONS: Please read all the way through This project seeks to understand how well individuals can predict a person’s future performance on a task based on his/her past performance. We recruited a group of people to complete two math [grammar] quizzes. Each quiz had five questions. Participants had one minute to complete each quiz. In what follows, we will show two participants’ scores from the first quiz. We then ask you to predict each participant’s score on the second quiz. We will provide you with some basic information on each individual. You will be paid $0.50 for your participation but will also be paid a bonus of $0.10 if you correctly guess a participant’s score on the second quiz. Screen 2 We will first show you the distribution of scores on the first quiz. Each bar represents the fraction of people who obtained that score. For example, 30% of people scored 4/5 on the first quiz. Gender Distribution TreatmentParticipants also see: The average score of female par- ticipants (2.5/5) is shown by the solid line. The average score of male participants (2.8/5) is shown by the dashed line. Screen 3 Solo TreatmentBelow we are showing you two individual’s test scores on Quiz 1, as well as some basic demographic information about each of them. Please predict each individual’s score on Quiz 2. You can view each quiz by clicking on the link below. Group TreatmentBelow we are showing you a team’s score on Quiz 1. Recall that each team member worked on the questions independently. We then take the sum of the two scores and assign it to the team. For example, if Person A scored 3/5 and Person B scored 4/5, their team score would be 7/10. We provide you with some basic demographic in- formation about each team member. Based on the team’s performance, please predict each individual’s score on Quiz 2. You can view each quiz by clicking on the link below. 57
Changed text
Open file
Gender Differences in Recognition for Group Work Heather Sarsons, Klarita Gërxhani, Ernesto Reuben, and Arthur Schram ∗ September 15, 2019 Abstract Does gender influence how credit for group work is allocated? Using data from academic economists’ CVs, we test whether coauthored and solo-authored publica- tions matter differently for tenure for men and women. Because coauthors are listed alphabetically in economics, coauthored papers do not provide specific information about each contributor’s skills or ability. Solo-authored papers, on the other hand, provide a relatively clear signal of ability. We find that conditional on publication quality and other observables, men are tenured at roughly the same rate regardless of whether they coauthor or solo-author. Women, however, become less likely to receive tenure the more they coauthor. The result is most pronounced for women coauthoring with men and less pronounced among women who coauthor with other women. Two experiments suggest that both stereotypes surrounding a task as well as the evalua- tors’ gender affect who receives credit. Taken together, our results are best explained by gender and stereotypes having a noticeable influence on the allocation of credit for group work. ∗ Sarsons, University of Chicago Booth (heather.sarsons@chicagobooth.edu); Gërxhani, European Uni- versity Institute; Reuben, New York University Abu Dhabi and LISER; Schram, Amsterdam School of Eco- nomics and European University Institute. Sarsons especially thanks Roland Fryer, Claudia Goldin, Larry Katz, David Laibson, and Amanda Pallais for their guidance and encouragement. We also thank Mitra Akhtari, Amitabh Chandra, John Coglianese, Oren Danieli, Ellora Derenoncourt, Florian Ederer, Ben Enke, Raissa Fabregas, Nicole Fortin, Nickolas Gagnon, Peter Ganong, Edward Glaeser, Siri Isaksson, Emir Ka- menica, Sara Lowes, Rob McMillan, Eduardo Montero, Gautam Rao, Alex Segura, Nihar Shah, Peter Tu, Jeroen van de Ven, Justin Wolfers, and various conference and seminar participants for their helpful com- ments and suggestions. 1 1 Introduction Do employers use gender when allocating credit for group work, particularly when in- dividual contributions are unobserved? Organizations increasingly rely on group work for production (Lazear and Shaw, 2007), yet there is little empirical evidence document- ing how credit for group work is allocated. Unless employers can perfectly observe each worker’s contribution to the team’s output, they must decide how to allocate credit with- out having full information as to what each worker did. This could leave room for demo- graphic characteristics, such as gender, to influence the allocation of credit. In this paper, we test whether uncertainty over an individual’s contribution to a project leads to differential attribution of credit that contributes to the gender promotion gap. In many industries, women are not only hired at lower rates than men are, they are also promoted at lower rates. 1 This paper explores whether gender differences in credit for group work exist and whether they explain part of the promotion gap. We primarily look at the tenure decisions of academic economists to test whether gen- der influences the allocation of credit for coauthored papers. Economics is a relevant setting as there is a large tenure gap between men and women, and because the amount of coauthoring has risen dramatically in recent years (Ginther and Kahn, 2004; Hammer- mesh, 2013). Using data from economists’ CVs, we track individuals’ career trajectories and compare whether the trajectory is different for individuals who coauthor versus solo- author, and whether there is a difference by gender. Within economics, we find that men and women who solo-author most of their work have similar tenure rates conditional on a proxy for the quality of papers. However, an additional coauthored paper is correlated with an 8.2% increase in tenure probability for men but only a 5.6% increase for women. This gap is significantly less pronounced for women who coauthor with women, suggesting that the attribution of credit is related to the gender mix of coauthors. Furthermore, a man who coauthors is no less likely to receive tenure than a comparable man who solo-authors even though there is presumably more uncertainty as to how much work he did. A counterfactual exercise suggests that this difference in credit allocation explains 60% of the unconditional gender gap in tenure rates and 84% of the gap that remains after controlling for average paper quality, citations, tenure and PhD institution ranks, and field. To ensure that we are not picking up on ability differences between men and women, we control for the quality of papers using both journal rankings and citations, allowing 1 Blau and DeVaro (2007), for example, find that across jobs, women are less likely to be promoted than men conditional on worker’s performance and ability ratings. In the UK, female managers are nearly 40% less likely to be promoted than male managers (Elmins et al., 2016). 2 for a comparison of men and women with similar research portfolios. The results are also robust to including other individual-level controls such as length of time to tenure and the seniority of one’s coauthors, as well as tenure year, tenure institution, and primary field fixed effects. We argue that these results are most consistent with a story of women receiving less credit for their joint work with men because of bias. To show this, we first use current CV and citation data to compare the productivity of men and women who did and did not receive tenure at the institution where they initially went up for tenure. While the estimates are imprecise, we find suggestive evidence that women who coauthor and are denied tenure produce more solo-authored papers that publish in high-ranking journals than men who are denied tenure. Data on citations show a similar result. We then rule out several alternative explanations for the empirical patterns. For ex- ample, several papers have demonstrated that selection into coauthorship in economics is not random. 2 We test for selection into coauthorship and do not find any evidence that women coauthor with high ability or more senior men. We also look at the timing of coau- thorship and find no evidence that women begin coauthoring if they have a slower start to their careers. The empirical patterns are also inconsistent with taste-basted discrimina- tion. Because the CV data do not allow us to rule out the possibility that women actually contribute less to papers that are coauthored with men, we conduct two experiments de- signed to test whether real or perceived differences in contributions drive credit allocation. In the first experiment, we first hire individuals to complete quizzes on topics that are ei- ther male or female-stereotyped. We then hire participants who act as “predictors” and are randomized into an individual treatment or a joint treatment. Predictors in the indi- vidual treatment are shown two individual’s separate quiz scores while predictors in the joint treatment are shown the combined score of two individuals. They are then asked to predict the performance of each participant on future quizzes. In the joint treatment, women are predicted to perform worse than their male counter- parts for male-stereotyped quizzes, suggesting that predictors believe that women con- tributed less to the combined score (that is, they performed worse). However, if pairs performed a female-stereotyped quiz, women and men are given equal credit. To under- stand whether these results are driven by participants’ beliefs about the ability distribu- tions of men and women, we randomly provide some participants with the distribution of scores on the initial quiz by gender. Women appear to be given equal credit in the female- stereotyped quiz because participants view it as being gender-neutral. That is, they do not 2 See, for example, Boschini and Sjögren (2007), Garcia and Sherman (2015), and Bikard et al (2015). 3 realize that women tend to outperform men. Showing participants the gender distribution of scores corrects this belief and women are then predicted to have a better performance in future female-stereotyped quizzes but it does not affect the predicted performance gap for women and men performing male-stereotyped tasks. The second experiment is conducted in a more natural setting with human resources personnel. Following a similar design, we again test whether women are less likely than men to receive credit for good group performance. We additionally elicit the HR person- nels’ beliefs about male and female performance and find that differences in the allocation of credit are largely driven by differences in beliefs. We also find that male HR personnel are more likely to hire in favor of men, and women in favor of women. This paper replicates and builds off of the results in Sarsons (2017), which shows the basic correlational patterns between paper composition and tenure. In this paper, we replicate the results using more data and then use the C.V. data and two experiments to es- tablish a channel through which gender influences the allocation of credit. The paper also relates to a large literature seeking to understand difference in labor market outcomes be- tween men and women. Factors such as productivity, personality and behavioural differ- ences (such as competition aversion), and fertility preferences have been shown to explain some differences in career choice and progression. 3 In academia in particular, studies have pointed to both supply-side factors, including differences in subject matter interest (Dy- nan and Rouse, 1997) and the availability of role models (Hale and Regev, 2014; Carrell et al., 2010); demand-side factors, such as implicit bias (Milkman et al., 2015; Moss-Racusin et al., 2012); and institutional factors (Antecol et al., 2018). This paper directly tests whether the differential treatment of work output contributes to the gender gap. The remainder of the paper is organized as follows. Section 2 describes the data and shows that a tenure gap exists between male and female economists. In Section 3, we show that the tenure gap closes as women produce more solo-authored papers but does not close as they produce more coauthored papers. Women have a consistently lower probability of tenure for each additional coauthored paper than men. We show that the results are robust to accounting for attrition, and to using different journal rankings and definitions of tenure. In Section 4, we argue that the results are in line with a story in which women receive less credit for joint work with men. We test alternative explanations of the relationship between coauthorship and tenure and argue that none can fully explain the observed empirical patterns. Section 5 discusses the design and results of the experiments. Section 6 concludes. 3 There is a large literature documenting gender differences in productivity, attitudes toward different types of work, and family choices. See, for example, Niederle and Vesterlund (2007), Buser et al. (2014), Antecol et al. (2018), Ceci et al. (2014), Reuben et al. (2017), and Ginther and Kahn (2004). 4 2 Data To examine the relationship between paper composition and tenure, we construct a dataset using the CVs of economists who came up for tenure between 1985 and 2014 at one of the top 35 U.S. PhD-granting universities 4 . The academic progression documented in the CVs makes it possible to evaluate the relationship between an individual’s research output and career progression. We can then compare the degree of collaborative work and reward for that work, and compare these results for men versus women. 2.1 Sample Selection and Data Overview We include only PhD-granting institutions in the sample as tenure evaluation at these schools is primarily based on research output, of which we have a clear measure. Other institutions like liberal arts colleges place greater weight on teaching ability for tenure, something that we cannot measure. We exclude business and public policy schools for similar reasons. 5 It is reasonable to assume that the top 35 economics departments in the U.S. emphasize research which is measured by the number and quality of papers one produces. One problem in collecting tenure information is that the CVs of individuals who went up for tenure, were denied it, and left to industry or government are difficult to find, lead- ing to a sample selection problem. To deal with this issue, we collected historical faculty lists from 23 of the 35 schools and locate over 90% of faculty who had ever gone up for tenure at these 23 institutions. For the remaining 12 schools that did not have historical faculty lists available, we looked at the top 75 U.S. institutions, the top 5 Canadian institu- tions, and the top 5 European institutions to locate anyone who went up for tenure at a top 35 U.S. school and then moved to another institution. We also checked economists’ CVs at the major Federal Reserve Boards and other large research institutes, such as Mathemat- ica, in the U.S. While there might still be a sample selection problem, we show in Section 3.2.1 that the results are robust to using only the sample for which we have historical faculty lists. From individuals’ CVs, we code where and when they received their PhDs, their em- ployment and publication history, and their primary and secondary fields. When looking at the relationship between publications and tenure in the main analysis, we only include 4 The list of institutions are taken from the RePEc/IDEAS Economics Department rankings. The list of schools included can be found in Appendix C. 5 Business and policy schools might also value teaching differently and put weight on different types of journals. 5 papers that were published up to and including the year an individual goes up for tenure. Book chapters are not included in the paper count. In a robustness check, we include papers that were published one and two years after tenure. To control for the quality of a person’s publications, we primarily use the “AER equiv- alent” ranking measure developed by Kalaitzidakis et al. (2003). This measure converts journal publications into their equivalent number of American Economic Review papers. 6 Less than 10% of journal articles cannot be converted because the journal does not appear in the ranking. In these cases we give the publication a ranking of zero. 7 Using the AER-equivalent measure instead of a list journal rank allows for different distances between journal ranks and for multiple journals to hold the same rank. For example, the top field journals can all hold the same rank. Other journal rankings force a ranking among these even though the journals might count the same amount toward tenure depending on one’s field. For robustness, we replace this paper quality measure with the RePEc/IDEAS ranking of economics journals in Section 3.2.2. Finally, we include citations, measured in 2015, of pre-tenure papers as a control vari- able. These citations were scraped from Google Scholar. We supplement this dataset with results from a survey designed to measure individu- als’ beliefs about the returns to various types of papers. The survey also contains informa- tion on how frequently individuals present their papers. The exact questions and nature of the survey are discussed in greater detail in Section 4. 2.2 Construction of Tenure To determine whether someone received tenure, we follow the guidelines on each school’s website (as of 2015) as to when tenure decisions are made. The majority of schools require faculty to apply for tenure 7 years after their initial appointment. We therefore consider years 6-8 to be the “tenure window” in which someone applies for tenure to account for people who go up for tenure early or late (because of a leave of absence, for example). We assume that an individual is denied tenure if s/he moves to a university ranked 5 positions below the initial institution during the tenure window. Similarly, we assume that an individual is denied tenure if s/he moves from academia to industry during the tenure window. Defining tenure in this way accounts for the fact that some people switch institutions 2-3 years after their initial appointment, not because they were denied tenure 6 The American Economic Review is regarded as one of the top journals in economics. Most journal publications are therefore converted to be some fraction of an AER paper. 7 If someone does not have any solo or coauthored papers, we set the relevant journal ranking to zero and include a dummy variable indicating that the individual has no solo (or coauthored) papers. This enables us to keep using the full sample. 6 but for personal preferences, and that some people might choose to move to a comparable school around the time of tenure even though they were offered tenure at their original institution. For example, someone who moves from MIT to Harvard after 7 years was presumably offered tenure at MIT but chose to move to Harvard for other reasons. As mentioned, a person who moves 5 or fewer years after his or her initial appointment is not assumed to have been denied tenure since s/he moved before the tenure window starts. If someone moves before the tenure window, we use the second institution they were at to determine tenure. For example, if a person’s first job is at University A but s/he moves to University B after three years, we use University B as the tenure institution but do not start the tenure clock over. We do not restart the clock because the data shows that in over 80% of cases, the individual still appears to go up for tenure within 8 years of his or her appointment at the first institution. However, we do extend this tenure clock in a robustness check. Individuals who move from an academic institution into industry before the tenure window are excluded from the sample. 2.3 Summary Statistics Table 1 presents summary statistics of the data. Approximately 68% of the full sample received tenure, but this masks a stark difference between men and women. Only 52% of women received tenure while 73% of men did. Total Papers,Solo-authored, andCoauthoredare the number of papers in each group that an individual had published by the time of tenure. These publication counts do not in- clude books or book chapters. Papers published in non-economics journals (such as a political science journal) are included but receive a ranking of 0 (the lowest ranking). The results are robust to excluding publications in non-economics journals. There is no statistically significant difference in the number of papers that men and women produce. Panel B looks at differences in the quality of papers. Men are no more likely to publish their papers in “Top 5” journals (American Economic Review, Economet- rica, Journal of Political Economy, Quarterly Journal of Economics, and The Review of Economic Studies) than women. The only statistically significant productivity difference is that men tend to publish their coauthored papers in slightly higher-ranking journals. Specifically, men’s coauthored papers have an average ranking of 0.34 AER-equivalents while women’s coauthored papers have an average ranking of 0.30 AER-equivalents. We therefore control for the quality of papers, measured using the AER-equivalent ranking as well as average citations, throughout the analysis. 7 Panel C displays differences in coauthoring patterns between men and women.Num- ber Unique CAsis the number of unique coauthors an individual has had by tenure. Men and women have roughly the same number of coauthors but there are some differences in the types of people men and women coauthor with. For example, women are less likely to coauthor with senior faculty and more likely to coauthor with other assistant professors. This could in part be driven by the fact that they are also more likely to coauthor with other women, many of whom are also junior professors. For illustrative purposes, we plot the number of women and men who have various combinations of solo and coauthored papers in Figure 1, as well as the average probability of receiving tenure for each paper combination in Figure 2. Most men and women have a similar combination of solo and coauthored papers. Figure 2 illustrates that individu- als with a large number of either solo or coauthored papers are likely to receive tenure. However, Panel A suggests that women with a higher fraction of their papers that are solo-authored have a better chance of receiving tenure than women with a mix of solo and coauthored papers. We examine this claim formally in the next section. 3 Empirical Strategy and Results 3.1 Main Results We show three main results. We first establish that a significant tenure gap exists between men and women. We then show that the gap becomes more pronounced the more women coauthor, and that women who solo-author all of their papers have comparable tenure rates to men. Finally, we show that the gender of a woman’s coauthor matters. Women who coauthor with other women do not suffer a coauthor penalty. 3.1.1 The Tenure Gap Figure 3 plots the coefficient ˆ β 1 from estimating T if st =β 1 TotPapers i +β 2 TotPapers 2 i +γ ′ Z i +θ f +θ s +θ t + if st (1) separately for men and women using OLS. The dependent variable,T if st , is an indicator that individualiin fieldfat schoolsreceives tenure in yeart.TotPapers i is the number of papers (both coauthored and solo-authored) individualihas at the time he or she went up for tenure. A quadratic in the number of papers is included to capture non-linearities in how publications matter for tenure. The vector of individual-level controls,Z i , includes 8 average journal rank (measured as average AER-equivalents), the log of total citations, the number of years it tookito go up for tenure, and the total number of coauthors oni’s papers. Tenure institution (θ s ), tenure year (θ t ), and field fixed effects (θ f ) are also included as tenure standards likely vary over time and by field and department. The figure shows that a significant tenure gap exists between men and women even after controlling for productivity, primary field, tenure institution, and tenure year. While an additional paper is correlated with a 13-16 percentage point increase in tenure proba- bility for men and women, women are consistently 10-13 percentage points less likely to receive tenure than men conditional on having written the same number and quality of papers. The lower intercept for women could stem from tenure committees starting with a lower prior about women’s ability. However, if all papers were clear signals of ability and tenure committees are Bayesian, we would expect the slope of the relationship between papers and tenure to be steeper for women. Put differently, if men and women received equal credit for papers, the coefficient onTotPapers i should be significantly larger for women than for men. We provide a formal test for the difference in slopes for men and women in Column 1 of Table 2, where we present the estimates from T if st =β 1 TotPapers i +β 2 fem i +β 3 (TotPapers i ×fem i ) +β 4 TotPapers 2 i +γ ′ Z i +θ f +θ s +θ t + if st (2) This is similar to estimating equation 1 except that we interact total papers with a female dummy,fem i rather than splitting the sample. There is no significant difference in the marginal benefit of an additional paper to men and women. 3.1.2 The Tenure Gap and Paper Composition To test whether coauthored papers matter differently for men and women, we separate papers into those that are solo-authored and those that are coauthored and estimate T ifst =β 1 S i +β 2 (fem i ×S i ) +β 3 CA i +β 4 (fem i ×CA i ) +δ 1 fem i +γ ′ Z i +θ f +θ s +θ t + ifst (3) using OLS. Here,S i andCA i are the number of solo-authored and coauthored papers an individual has at the time of tenure. The results are presented in Table 2. An additional solo-authored paper is associated with a 9.7 percentage point increase in men’s tenure rates and a 15.4 percentage point increase in women’s tenure rates (who start from a lower base tenure rate). If the lower initial tenure rate for women is due to employers holding the belief that women are lower 9 ability, it seems that the signals from solo papers begin to outweigh the employer’s prior. This is consistent with a model in which employers start with a lower prior about women and update as they receive clear signals about a woman’s ability, giving women full credit for this solo work. This is further discussed in the next section. If coauthored papers are an unclear signal of ability, an employer must make a judg- ment call as to how much each coauthor contributed to the paper which could lead to differential attribution of credit. Indeed, we see that while an additional coauthored pa- per helps both men and women, men benefit much more than women. Men’s tenure rates increase by 8.2 percentage points when they produce a coauthored paper whereas women’s increase by 5.6 percentage points. However, the fact that men benefit nearly as much from a coauthored paper as they do from a solo-authored paper is at odds with the story that employers are dividing credit for projects among authors. If employers do divide credit, not all men can get 100% of the credit, particularly for those papers coauthored with other men. 8 This result could point to an alternative mechanism. For example, if employers exhibit taste-based discrimination, they could use joint projects as an excuse to promote men over women. We discuss and test several such alternative stories in Section 4. The relationship between paper composition and tenure is summarized in Figure 4. This figure plots the relationship between the fraction of an individual’s papers that are solo-authored and tenure, controlling for the total number of papers, citations, journal quality, number of coauthors, and tenure institution, year, and field fixed effects. For men, it does not matter if one coauthors or solo-authors: tenure rates are comparable conditional on the quality of papers. Women who write all of their papers alone have similar tenure rates to men. However, women who coauthor all of their papers have an approximately 37% tenure rate, substantially lower than that of men who coauthor all of their papers ( 72%). The slope for women is ˆ β=0.521and is statistically significant at the 1% level (s.e.=0.158). 3.1.3 Does Coauthor Gender Matter? The probability of receiving tenure is not lower for all women who coauthor. In Table 3, we categorize coauthored papers into those written with only men, only women, or a mix of men and women: 8 It could be the case that because tenure committees are evaluating one person, they always assume that the man they evaluate deserves full credit for the paper (and we do not see the amount of credit they would have given to the other man). It is impossible to evaluate such theories with these data. 10 T if st =β 1 S i +β 2 (fem i ×S i ) +β 3 CAmale i +β 4 (fem×CAmale i ) +β 5 CAmix i +β 6 (fem×CAmix i ) +β 7 CAfem i +β 8 (fem i ×CAfem i ) +β 9 fem i +γ ′ Z i +θ f +θ s +θ t + if st (4) As before,S i is the number of solo-authored papers individualihas at the time of tenure. CAfem i is the number of coauthored papers individualihas in which all of the coauthors are female. Similarly,CAmale i is the number of papersihas in which all of the coauthors are male andCAmix i is the number of papersihas in which the coauthors consist of men and women. The estimated coefficients on the interaction terms show that the negative relationship between coauthoring and tenure for women is driven almost entirely by papers that are coauthored with men. While a coauthored paper with another man is associated with an 8.7 percentage point increase in tenure probability for a man, it is associated with a 3.1 percentage point increase in tenure probability for a woman. 9 An additional paper with a woman, however, is associated with an 11.6 percentage point increase in tenure probabil- ity for a woman. While this estimate is imprecise due to sample size, we can say that an additional coauthored paper with a woman has a more positive impact on tenure than an additional coauthored paper with a man. Any explanation as to why women have lower tenure rates than men when they coauthor must therefore be correlated with coauthor gender. The estimates are robust to including all of the control variables discussed earlier. 3.1.4 Counterfactual Analysis We conduct a counterfactual analysis to estimate how much of the gender gap in tenure rates can be explained by the different treatment of coauthored papers. We first estimate T if st =β 1 S i +β 2 CA i +δ 1 fem i +γ ′ Z i +θ f +θ s +θ t + if st (5) and use the estimates to predict the probability of tenure, ˆ T i , for everyone in the sample. We then let the female dummyfem i be 0 for everyone and predict tenure rates again (call this ̃ T i ). The difference ˆ T i − ̃ T i gives the gender gap in tenure rates conditional on all observable characteristics but not allowing for differences in the marginal impact of solo 9 These results again show the puzzling pattern that the amount of credit that is divided among male coauthors adds up to more than one. 11 and coauthored papers for men and women. 10 We then repeat this exercise using the estimates from equation 4, first letting the female dummy equal one and then predicting tenure rates again letting the female dummy (and therefore all of the interactions) equal zero. This second set of predicted tenure probabil- ities tells us what women’s predicted tenure rate would be if their papers were treated in the same way that men’s papers are treated. The unconditional gender gap in tenure rates is 22 percentage points. The conditional gap in tenure rates from equation 5 is approximately 16 percentage points. Thus, observ- able characteristics such as differences in time to tenure and paper quality account for about 27% of the gap. The results from using equation 4 to predict tenure probabilities suggest that the gap would close by a further 13.5 percentage points if men and women’s papers were treated similarly. The different assignment of credit thus accounts for ap- proximately 60% of the unconditional tenure gap and 84% of the conditional gap. 3.2 Robustness Checks One may be concerned that the results are a product of the types of productivity measures used or are affected by missing data. In this section, we show that the results are robust to using only the sample for which we have historical faculty lists, to using different journal rankings, to accounting for papers published shortly after tenure, and to using different measures of paper counts. 11 3.2.1 Attrition The results will be biased if the sample excludes individuals who are denied tenure and go into industry, government, or other institutions where we do not observe them. This would be particularly problematic if men who go to industry after being denied tenure disproportionately coauthored their papers. If this is true, we would be overestimating the benefit of coauthoring for men. We would have a similar problem if women who go to industry after being denied tenure typically wrote solo-authored papers. As discussed in Section 2.1, we attempted to find such individuals by searching insti- tutions outside of the top 35 U.S. schools, federal reserves, and other research institutes. 10 Interacting all variables except for the number of solo/coauthored papers with the female dummy does not substantially change the results. 11 In Appendix Table A1, we also test whether the results vary by school rank and over time. The esti- mates suggest that the coauthoring penalty is driven largely by schools outside of the top 10, although the estimates are imprecise. The coauthorship penalty is also stronger in later years but again the estimates are imprecise. 12 To further allay concerns about sample selection, we run the analysis on the sample for which we received historical faculty lists. These lists allow me to track who went up for tenure and find them even if they left academia. The results, presented in Column 1 of Ta- ble 4, do not change when run on the sample for which there should be very few missing observations. The coefficient on theFemale×Coauthoredinteraction is significant only at the 10% level due to the smaller sample, but the direction and magnitude do not change. 3.2.2 Journal Rankings In the main analysis, we use a flexible journal ranking that allows multiple journals to hold the same rank. However, while the economics profession largely agrees on what the “top” journals are, rankings of field journals or lower-tier journals have changed over time. In Columns 2-4 of Table 4, we show that the results are robust to using three alternative journal ranking metrics as controls. In Column 2, we use the current RePEc-IDEAS journal ranking. This ranking forces a linear relationship between journals and tenure but also contains a larger number of journals. The main results do not change when using this ranking. In Column 3, we allow journal rankings to change over time. We use historical rank- ings of economics journals (drawn from Laband and Piette, 1994, and combined with current rankings) and match each paper with its journal ranking at the time it was pub- lished. Using these rankings accounts for journals moving in rank over time as well as new journals being added. The coefficient on theFemale×Coauthoredinteraction is slightly smaller but the same pattern persists. An additional coauthored paper is associated with an 8.1 percentage point increase in tenure probability for men and a 5.6 percentage point increase for women. In section 4, we also separate papers into “Top 5s” and “non-Top 5s”. Finally, in Column 4, we divide the AER-equivalent measure into deciles and control for the number of solo and coauthored papers an individual has in each decile. For ex- ample, if an individual publishes one solo-authored paper in the AER and another in the lowest-rank journal, she will have one paper in the tenth bin, one in the first bin, and zero in the others. Thus, instead of having a single coauthored or solo-authored paper rank control, we include ten variables controlling for the quality of an individual’s solo- authored papers (the number of solo papers in each AER-equivalent bin) and ten variables controlling for the quality of an individual’s coauthored paper (the number of coauthored papers in each AER-equivalent bin). Again, the results hold. 13 3.2.3 Tenure Definition In the main analysis, we only consider papers that were published up to and including the year that an individual goes up for tenure. If an individual goes up for tenure in 1995, for example, papers published in 1996 are not included in the paper count even though they may have been “revise and resubmits” at the time of tenure. This could affect the results if men who coauthor have several promising unpublished papers at the time of tenure but women who coauthor do not, in which case we are not actually comparing people with similar publication records. In Columns 5 and 6 of Table 4, we include papers that are published one and two years after a person’s tenure year in the paper count variables. The magnitude of the coefficients are smaller but the results do not change: women continue to benefit less from coauthored papers than men do. 3.2.4 Paper Count Variable While we control for journal quality, the main independent variables (number of solo and coauthored papers) may not accurately reflect how tenure committees decide on tenure cases. For example, institutions might trade off the quantity and quality of papers in different ways. In Column 7 of Table 4, we use an alternative measure for the number of papers. Specifically, after converting each publication to its AER-equivalent, we add up the AER-equivalent measure to give the total number of “AERs” an individual has at the time of tenure. For example, if an individual published two solo-authored papers and one is worth 0.25 AERs and the other worth 0.8 AERs, the individual will have 1.05 solo-authored AERs at the time of tenure. Again, the patterns are the same. An additional coauthored “AER” paper is correlated with an 8.9 percentage point increase in a man’s tenure probability but a 5.3 percentage point increase in a woman’s tenure probability. 3.3 Testing Against Other Disciplines and Coauthoring Conventions Many disciplines use different coauthoring conventions, such as listing authors in order of contribution. However, these disciplines differ on several other dimensions, such as the fraction of women in the disciplines and what is most important for tenure (publications, grants, conference proceedings, etc.). In Appendix A, we conduct the same analysis for a sample of sociologists, a discipline that order authors by contribution. The sample and re- sults are discussed in more detail in the Appendix, but we do not find evidence of women being penalized for coauthoring. What matters is being first author on a paper: being first author is correlated with a 5% increase in tenure probability for both men and women. 14 Because sociology differs from economics in many ways, though, it is difficult to interpret whether these results suggest that ordering authors by contribution helps eliminate bias or whether the larger presence of women helps to eliminate it. 4 Channels The previous section established three facts: 1. For very few papers, women have a lower tenure probability than men; 2. As women produce more solo-authored papers, their tenure probability converges to that of comparable men; 3. Women benefit less than men from work coauthored with men. There are several explanations for these patterns. In this section, we argue that the results are most consistent with a story of women receiving less credit for their joint work with men rather than a story of women contributing less when they work with men. We assume that tenure committees begin with the prior that women are on average of lower ability than men, and that solo-authored papers provide a clear signal of one’s ability whereas coauthored papers provide an unclear signal. Employers then misattribute credit for work produced by a man and a woman as the man is assumed to be of higher ability. We first test the claim by comparing the productivity of men and women who were denied tenure. We then explore and rule out several threats to this story. Specifically, we test for ability and preference-based sorting, women receiving less exposure by presenting less, and taste-based discrimination. Finally, we present evidence from two experiments designed to shut down the possibility that women put in less effort when working with men, and find additional evidence that women receive less credit than men when they perform a stereotypically male task. Moreover, the gender of the person attributing credit matters in this context, and we also find that women receive at least as much credit as men when they perform a stereotypically female task. 4.1 Do Men Get the Credit or Do Women Contribute Less? If tenure committees hold the prior that women are lower ability than men and if solo- authored papers provide clear signals of ability, we will see differences in tenure rates for men and women with few publications. However, additional solo-authored publications of the same quality will have a larger marginal impact on a woman’s tenure probability than a man’s. As these clear signals begin to dominate the committee’s prior, tenure rates between men and women will converge. 15 If committees are biased toward giving men more credit for work coauthored with women, we would expect to see the following. Assuming that there is some fixed amount of credit that can be given for a paper, a man will benefit more than a woman from joint work between them. In addition, both men and women will benefit more from their coau- thored work with women than their coauthored work with men, as two men who coau- thor will be assumed to have contributed similarly while a woman will be assumed to have contributed less. These two claims largely play out in the data. Table 2 shows that the marginal solo- authored paper helps women more than it helps men as they start from a lower baseline tenure rate. Table 3 shows that men benefit the most from coauthoring with women (an increase in tenure probability of 9.7% when coauthoring with a woman vs. 8.7% when coauthoring with a man) although this difference is insignificant. Similarly, women ben- efit more from coauthoring with other women than with men. One result that it is in- consistent with a story of credit allocation is the fact that the total amount of credit that can be allocated, at least when all coauthors are men, seems to add up to more than one. Men benefit as much from a coauthored paper as they do from a solo-authored paper, suggesting that tenure committees are either making a mistake when dividing credit (for example, each committee assumes that the male author under consideration for tenure at its school did most of the work), or that there is an alternative mechanism behind the results. In Section 4.2, we test several potential mechanisms. We would see these same empirical patterns if women contribute less to projects that are joint with men. Comparing the productivity of men and women who were denied tenure helps to disentangle these two stories. If women who coauthor are given less credit, then women who coauthor and are denied tenure should on average be more productive than men who are denied tenure. If women who coauthor simply contribute less, we would not expect to see productivity differences between men and women who are denied tenure. We use two productivity measures to test whether women who coauthor and are de- nied tenure are more productive than men: the number of solo-authored AER-equivalents an individual publishes after the tenure decision and the log number of citations an indi- vidual has as of 2015. 12 Individuals who leave academia and do not publish after tenure are excluded from the AER-equivalent outcome sample, but including them and setting their number of post-tenure papers to zero does not change the results. 12 Citations were scraped from Google scholar in 2015. For the top 5 papers outcome, we do not compare coauthored papers as these can reflect the ability of one’s coauthors. Citation data includes both solo and coauthored papers as the data came in this structure. 16 Table 5 shows the results from estimating Y if st =β 1 fem i +β 2 FracCA it +β 3 T i +β 4 (fem i ×FracCA it ) +β 5 (fem i ×T i ) +β 6 (FracCA it ×T i ) +β 7 (FracCA it ×T i ×Fem i ) +γ ′ Z ′ i +θ f +θ t +θ p + if st (6) where the outcome variableY if st is one of the two productivity measures described above andT i is a tenure dummy. We include a post-tenure institution fixed effect,θ p , to account for the fact that individuals will have access to different resources depending on where they go after the initial tenure decision. Column 1 shows the results from estimating equation 6 with the number of solo- authored AER-equivalents as the outcome. Women who are denied tenure and coauthor have 0.4 more solo-authored AER-equivalents than men who are denied tenure and coau- thor ( ˆ β 2 + ˆ β 4 ). Column 2, which has log citations as the outcome variable, shows a similar pattern although the results are much noisier. Together, these results provide some sug- gestive evidence that these women receive less credit for joint projects. 4.2 Alternative Stories There are other possible explanations for the above findings, not all of which can be tested with these particular data. Here we shed light on four standard and testable channels: ability-based sorting, preference-based sorting, women not claiming credit for their work, and taste-based discrimination. The empirical patterns are inconsistent with all of the proposed explanations. 4.2.1 Ability-Based Sorting Employers might rationally deny women who coauthor tenure if individuals sort such that only lower ability women coauthor with men. This could arise for several reasons. For example, if coauthoring lowers the cost of producing a paper, but women know that they receive less credit for papers, high ability women might forego the cost savings and choose to work alone. They know they can produce high quality papers by themselves and send the employer a clearer signal of their ability. However, if low ability women can only produce high quality papers with the help of a high ability man, they might coauthor even if they receive less credit. High ability men will agree to coauthor with them if it reduces the cost of the paper without reducing the quality. Employers would then know that any woman coauthoring with a man is lower ability. 17 In what follows, we test whether women anticipate receiving less credit, whether high ability women sort out of coauthoring with men, and whether men coauthor with women whose careers begin more slowly. To do so, we first present survey evidence suggesting that women do not know that the returns to coauthoring are lower than solo-authoring. We then show that women do receive some credit for papers that publish well, suggesting that employers might believe that there is some assortative matching. We also provide evidence that even when women tend to work with men who are slightly higher ability than themselves this unequal match does not explain the gender gap in tenure. Survey Evidence on Knowledge of Returns to CoauthoringIf women know that their returns to coauthoring with men are low, it is plausible that high ability women would choose to solo-author or only work with other women. Here we test whether women anticipate receiving less credit for collaborative work using a survey conducted with economists currently working at the top 35 U.S. economics departments. The survey was sent to all professors, regardless of rank, at these institutions and received an 32% response rate. The gender composition of the sample is representative of the profession today, with 89 respondents being female and 300 being male. In the survey, economists were asked the following question: Suppose a solo-authored AER increases your chance of receiving tenure by 15%. For each of the following, please give an estimate of how much you think the described paper would increase your chance of receiving tenure. Respondents then go through five types of papers (coauthored AER, coauthored AER with senior faculty, coauthored AER with junior faculty, solo-authored top field, and coau- thored top field) and record their beliefs about the returns to these papers. 13 In Table 6, we test the difference in the mean beliefs of men and women. 14 There is no statistically significant difference in the beliefs of men and women for any type of paper. Men believe that a coauthored AER will increase their chance of receiving tenure by 12.1%, and women by 12.2%. Women believe that there are slightly lower returns to AER papers coauthored with senior faculty (8.8% versus 9.1% for men), but the difference is not statistically significant. These results suggest that, in this context, women are unaware of the true returns to coauthoring. 13 We did not ask respondents about paper coauthored with men/women so that they would not be primed to think about gender 14 Because the survey was anonymous, the answers can not be linked to the CV data. We can therefore only test for differences in means without controls. 18 Evidence on Sorting by Ability from CVsA second test of whether women know that they will receive less credit for papers and sort accordingly is to look at the correlation between propensity to coauthor and ability. We first test whether high ability women are less likely to coauthor than low ability women and then test for assortative matching among coauthors. We proxy for ability using the quality of journal that an individual’s job market paper was published in. We assume that the job market paper is the first solo- authored paper an individual publishes after he or she graduates. If women anticipate discrimination, ability and the fraction of one’s papers that are coauthored will be negatively correlated. High ability women should be less likely to coauthor. In Figure 5.A we plot the coefficients ˆ β 1 and ˆ β 2 from estimating FracCA if st =β 1 a i +β 2 (fem i ×a i ) +β 3 fem i +β 4 TotPapers i +θ f +θ s +θ t + if st (7) whereFracCA if st is the fraction of personi’s papers that are coauthored anda i is person i’s ability (job market paper rank). If high ability women anticipate receiving less credit, we expect ˆ β 2 <0. In Figure 5.A, however, we see that ability is uncorrelated with the fraction of papers that are coauthored for both men and women: both estimates are precise zeros. There is no evidence that women along the ability distribution act strategically in their choice to coauthor versus solo author. We also find no evidence that high ability women strategically coauthor with other women rather than men. Figure 5.B plots the results from equation 7 using the fraction of papers that are coauthored with women as the dependent variable. Women are more likely to coauthor with other women than men are but there is no sorting by ability. While women do not seem to be sorting according to ability, it is possible that women tend to work with higher-ability or more prominent coauthors who then receive more credit for a paper. We test for this by correlating a person’s ability with that of his or her coauthors. While we do not have the job market paper information for all coauthors in the dataset, we can see where the coauthors were working at the time the individual went up for tenure. As a measure of average coauthor ability, we take the average school rank of all of an individual’s pre-tenure coauthors. For example, ificoauthors withjandk andjworks at the 5th-ranked institution andkworks at the 15th-ranked institution, the average ability ofi’s coauthors is 10. We correlatei’s ability with the average ability of her coauthors in Figure 6. The line of best fit is plotted controlling for number of coauthored and solo-authored publications, time until tenure, and field, institution, and tenure year fixed effects. Men and women both sort positively on ability but women are more likely to collab- 19 orate with individuals at more highly-ranked institutions than men are. To see whether this explains the main results, we estimate T if st =β 1 S i +β 2 (fem i ×S i ) +β 3 CA i +β 4 (fem i ×CA i ) +β 5 rank iJ +β 6 (CA i ×rank iJ ) +β 7 (fem i ×CA i ×rank iJ ) +β 8 (fem i ×rank iJ ) +β 9 fem i +γ ′ Z i +θ f +θ s +θ t + if st (8) whererank iJ is the average institution rank ofi’s coauthors and all other variables are defined as before. The results are reported in Table 7. If men receive more credit because they are coauthoring with lower ability women, ˆ β 7 should be negative. However, ˆ β 7 is close to zero, indicating that the ability or prominence of one’s coauthor is not driving the tenure gap for coauthoring women. Returns to Top PapersFor high ability women to receive no credit for their coauthored papers, employers would have to believe that there is no assortative matching by ability. Otherwise, employers would receive a signal that women who coauthor with high abil- ity men are also high ability, and be more likely to promote them. Figure 6 shows that assortative matching does occur, but it is possible that employers do not recognize this. We test for this by looking at how credit for top 5 publications is allocated. If employ- ers know that there is assortative matching, they should believe that women coauthoring with high-ability men are also likely to be high ability. Table 8 shows the results from estimating T ifst =β 1 TopS i +β 2 (fem i ×TopS i ) +β 3 TopCA i +β 4 (fem i ×TopCA i ) +β 5 NonTopS i +β 6 NonTopCA i +β 7 (fem i ×NonTopS i ) +β 8 (fem i ×NonTopCA i ) +β 9 fem i +γ ′ Z i +θ f +θ s +θ t + ifst (9) whereTopS i andTopCA i are the number of solo and coauthored papers that individuali has published in a top 5 journal. Similarly,NonTopS i andNonTopCA i are the number of solo and coauthored papers the individual has published in non-top 5 journals. In Table 8, the “nop-top 5” interaction terms are presented in the second column. Power becomes an issue as (1) there are relatively few people publishing in the top 5 journals, and (2) cutting by gender means that there are even fewer women in each category. Table 8 shows that coauthored papers published in a top 5 journal help women much more than those published in non-top 5 journals. Non-top 5 coauthored papers do not have any positive influence on women’s tenure probability. It seems that employers re- 20 ceive some signal when a woman publishes her coauthored papers in top journals which is at odds with the hypothesis that only low ability women coauthor with men. Overall, there is little evidence that ability-based sorting is driving the results. 15 If anything, employers seem to recognize that high ability men and women might work together and are therefore more likely to grant these women tenure. However, their tenure rate is still lower than that of high ability men. 4.2.2 Preference-Based Sorting If women prefer to coauthor with senior faculty, we could reasonably expect that women would have lower tenure rates. Assuming senior faculty are more likely to be credited for a paper, the fact that most senior faculty are men would drive the correlation between coauthoring with a man and tenure. That is, women receive less credit because they enjoy coauthoring with senior faculty and these senior faculty are predominantly male. The basic summary statistics showed that women were not more likely to coauthor with senior faculty than men. However, we conduct an additional test as to whether coauthorship with senior faculty could be driving the results. We reestimate equation 3 but control for the fraction of a person’s coauthors who are senior. The results are pre- sented in Column 3 of Table 7. The seniority of women’s coauthors does not explain the results. Controlling for seniority, an additional coauthored paper increases a man’s prob- ability of tenure by 8 percentage points but a woman’s by 5 percentage points. 4.2.3 Timing of Coauthorship It is possible that men offer to work with women who are struggling to publish. If this is the case, we should see women who have few publications in the early years of their ap- pointment being more likely to coauthor with men. We test for this possibility by looking at differences in early publications and by testing whether women with a longer time lag between their initial appointment and first publication are more likely to coauthor with men. Appendix Figure B1 descriptively shows the timing of publications for men and women, split by whether they received tenure at their initial tenure institution. More formally, we 15 Garcia and Serman (2015) show that there could be selection into coauthorship driven by a desire to be first author on a paper (that is, depending on where you are in the alphabet relative to your coauthors). This would be an issue in this setting if, for example, men are more likely to be strategic than woman and are therefore more likely to be first author on a paper (which is correlated with having more citations). We test whether men are more likely to be first author on their papers than women and whether men have a “higher” author position overall. We find that men in our sample are first author 57% of the time while women are first author 55% of the time (p=0.907). 21 test whether women have fewer publications early in their careers by estimating Y if st =β 1 Fem i +β 2 T is +β 3 (Fem i ×T is ) +β 4 Papers i +β 5 ̄q i +θ f +θ s +θ t + if st (10) whereY if st is the number of years between individuali’s initial appointment andi’s first post-appointment publication. 16 We test whether women who did not receive tenure had a longer publishing lag by interacting the female dummy term with an indicator for re- ceiving tenure at schools,T is . We control for the number of papers published pre-tenure (Papers i ) and the average quality of those papers ( ̄q i ). All other variables are defined as before. The results are presented in Table 9. Women who do not receive tenure do have a longer lag (approximately 0.5 years) between their first appointment and their first publi- cation although the result is noisily estimated. We test whether women with a longer lag are more likely to coauthor with men by estimating FracM if st =β 1 Fem i +β 2 T is +β 3 (Fem i ×T is ) +β 4 Y i +β 5 (Fem i ×Y i ) +β 6 (Fem i ×T is ×Y i ) +β 4 Papers i +β 5 ̄q i +θ f +θ s +θ t + if st (11) where the outcome variable,Y i in equation 10 is used as a regressor. If men bring women with a slow start to publishing onto their projects, we would expect to see ˆ β 5 >0. The results, presented in Column 2 of Table 9, do not support the hypothesis that women who struggle to publish initially are more likely to begin publishing with men. The coefficient onβ 5 is negative, suggesting that women with a longer publishing lag are less likely to coauthor with men although this result is again insignificant. 4.2.4 Women Not Claiming Credit for Papers Women might be given less credit for their work if they are less likely to claim it as their own. 17 For example, if women present less frequently than men, people might associate a paper with the male coauthor who presents it more. The survey discussed in Section 4.2.1 also asked individuals how many times per year they present their work and whether they are more or less likely to present their coauthored papers than their coauthor. Panel B of Table 6 shows that men and women report the same likelihood of presenting their joint papers relative to their coauthors. Interestingly, though, women present their solo- authored papers fewer times per year than men do. It is possible that women do not 16 We exclude papers that were published before the person’s first appointment. 17 Isaksson (2019) finds experimental evidence that women often claim less credit than men for their con- tributions to solving puzzles. 22 “advertise” their work as much as men do and this leads to women receiving less recog- nition for their work in general. If this were true, though, women who solo author should also be less likely to receive tenure. 4.2.5 Taste-Based Discrimination If some employers have a distaste for tenuring women, as in Becker (1971), we should see women who write solo-authored papers being denied tenure as well. If employers cannot plausibly deny a woman who solo-authored several well-published papers, however, they might be constrained to deny tenure only to those for whom they can make a reasonable case. If it can be argued that a woman who coauthors did little of the work, taste-based discrimination could help to explain the results as employers have an excuse for denying tenure to coauthoring women. However, as shown in Table 3, only women who coauthor with men have lower tenure rates. This would imply that employers have a particular distaste for tenuring women who coauthor with men, which seems unlikely. 5 Experimental Evidence In the previous section, we provided suggestive evidence that factors like sorting and taste-based discrimination do not explain why women who coauthor with men are less likely to receive tenure. We instead argue that the results are most consistent with women receiving less credit for joint work with men. Specifically, because coauthored papers are an unclear signal of ability, women receive less credit for their joint work with men if they are believed to be of lower ability (Correll and Ridgeway, 2003). We cannot rule out, though, that real or perceived differences in effort explain the results. For example, tenure committees might hold the belief that women contribute less or provide lower effort when they work with men, regardless of their beliefs about a woman’s ability. In addition, tenure committees might believe that low ability women choose to work with high ability men even if the empirical evidence suggests otherwise. To shed light on whether different contributions to group work (or perceptions of dif- ferential contributions) and sorting are driving the results, we run two experiments de- signed to shut down these channels. The experiments also allow us to assess the role of beliefs about ability more directly. The first experiment is an artefactual experiment run on mTurk. The second is a framed field experiment for which we recruited individuals who work in human resources and whose job is to recruit personnel. Although these settings are different from academia, they provide additional evidence that gender plays a role in 23 the allocation of credit due to differences in beliefs about the ability of men and women. The first does so in a relatively abstract setting with high control, while the second adds more context from the process of hiring candidates (see Harrison and List, 2004). Both experiments consist of two incentivized parts. In the first step, workers are re- cruited to complete tasks individually. In the second step, designed to test whether people misallocate credit for joint work, another set of individuals are recruited to either predict how well the workers will do on a second set of related tasks (Experiment I) or to choose a worker to hire to perform a second set of tasks (Experiment II). In both experiments, we vary whether the predictors/hirers see workers’ individual scores in the first task, or the sum of two or more individuals’ scores. 5.1 Experiment I The first experiment consists of two incentivized parts. In the first step, mTurk workers, henceforth referred to as “workers” are recruited to complete two related quizzes (Quiz 1 and Quiz 2). 18 We then recruit 506 mTurk participants, referred to as “predictors”, to pre- dict the Quiz 2 scores of a randomly-chosen man and a randomly-chosen woman on Quiz 2. Before making their predictions, the predictors are told that the workers completed the two quizzes on their own, and are shown information on Quiz 1. Specifically, they see the questions asked, the overall score distribution (not broken out by gender), and informa- tion about the Quiz 1 scores of the two workers they will be making predictions about. Predictors are then shown the Quiz 2 questions and are asked to estimate the score of both workers in Quiz 2. Predictors are paid a participation fee of $0.50 and receive $0.10 for each score they correctly predict. The instructions given to predictors are available in Appendix D. This experiment uses a 2x2x2 treatment design, described in detail below. 5.1.1 Treatments Individual vs. Joint ScoresPredictors are randomized into an Individual treatment or a Joint treatment. In the Individual treatment, predictors are shown the individual score of each of the two workers in Quiz 1. This treatment tests whether predictors correctly predict scores when they see a clear signal of each worker’s ability. This parallels the solo- author paper analysis: if predictors correctly assign credit when they see a clear signal of ability, there should be no difference in how men and women are evaluated conditional on Quiz 1 scores. 18 Workers receive a participation fee of $0.30 plus $0.05 for each question they answer correctly. The quizzes contain five questions each and are available in Appendix D. 24 In the Joint treatment, predictors are shown thesumof the scores of the two workers. For example, if worker A scored 3 out of 5 and worker B scored 4 out of 5, the predic- tor would see the score 7 out of 10 for that pair. Importantly, predictors are told that there was no interaction between workers: each worker completed the same quiz and was paid according to his or her individual score. Thus, predictors know that workers are randomly paired with a member of the opposite sex but worked independently and were individually incentivized. This treatment is designed to understand how predictors assign credit for performance when they cannot observe individual contributions, but in a setting where there is no selection into pairs (such as high-ability men working with low-ability women) or free-riding. Therefore, the predictors’ estimates should reflect only their beliefs about each worker’s score and ability since they know that workers com- pleted the quizzes individually and were individually incentivized. To draw a parallel between this treatment and the main analysis, the individual scores that make up the joint score can be thought of as each person’s “contribution” to a group project that, in this case, is unaffected by selection or effort. No-Information vs. Gender-InformationTo understand whether predictors’ estimates are driven by (possibly incorrect) beliefs about ability or taste-based animus, we provided some predictors with information about the performance of men and women. In the No-Information treatment, the only aggregate information predictors receive is the over- all score distribution. In the Gender-Information treatment, predictors are additionally shown the average score of male and female workers. 19 If predictors exhibit taste-based animus, providing them with information about men and women’s average performance will not change their predictions. In addition, compar- ing these treatments helps to understand whether differences in attribution are driven by incorrect beliefs about gender differences in performance. If participants hold mistaken beliefs about men and women’s average performances, the gender information treatment should correct those beliefs, and the predictors should adjust their estimates accordingly. Male-Stereotyped vs. Female-Stereotyped quizzesTo evaluate whether differences in credit for joint work depends on the type of task that is being performed, workers in the Male-Stereotyped treatment completed math quizzes while workers in the Female- Stereotyped treatment completed grammar quizzes. 19 The overall score distribution was presented as a histogram. In the Gender-Information treatment, the histogram contained lines indicating the mean performance of men and women. See Figure D1 in Appendix D. 25 5.1.2 Results The main experimental results are presented in Table 10, which shows how predictors’ guesses vary based on the quiz-taker’s gender and the treatment. Specifically, we estimate Q2 ij =β 1 fem i +β 2 D j +β 3 (fem i ×D j ) +β 4 Q1 i + ij (12) separately for the sample of individuals who took math quizzes (Columns 1 and 2) and grammar quizzes (Columns 3 and 4), and by joint/individual treatment. The outcome variable,Q2 ij , is predictorj’s estimate of quiz-takeri’s Quiz 2 score. An indicator for the quiz-taker being female,fem i , is interacted with an indicator for the predictor being in the Gender-Information treatment,D j . We also control fori’s Quiz 1 score (Q1 i ). In the Individual treatment, there is no significant difference in men’s and women’s es- timated performance in the math quiz. In this treatment, predictors base their estimations on the observed individual scores. 20 By contrast, in the Group treatment, where predictors see only the sum of a man’s and a woman’s Quiz 1 score, they predict that women scored less than men on the second math quiz. This mirrors the finding that women suffer a coau- thor penalty when their contribution to a paper is unobserved but are not discriminated against when their contributions are observed, as in solo-authored papers. Predicting that the woman will do worse than the man in the Joint treatment suggests that predictors be- lieve that the woman’s first score was lower; that is, she is worse at the task and therefore contributed less to the joint score. Puzzlingly, showing predictors the mean scores of male and female workers does not change the predictions for the second math quiz. Together, these results suggest that predictors hold a prior that men are better than women at the math quiz, and the evidence that men are only slightly better does not affect this belief. The results for the grammar quiz in Columns 3 and 4 suggest that the results are not driven by taste-based animus in which women are always penalized in collaborative situ- ations. Here, women are not predicted to perform differently than men on the second quiz in both the Individual and Joint treatments. In addition, seeing that the mean grammar score of female workers is higher than that of male workers creates a gender difference in predicted scores in favor of women. 20 Women had a lower average score than men on the math quizzes (2.51/5 vs. 2.72/5) and a higher average score on the grammar quizzes (2.41 vs. 2.17). The distribution of scores on Quiz 1 are shown in Appendix Figure D1. This is the same figure that predictors are shown. If predictors are in the Gender- Information treatment, they also see the two lines indicating the mean male and female scores. 26 5.2 Experiment II The second experiment was designed to study attribution of credit for joint work in a setting that more closely approximates a hiring scenario. In addition, this experiment allows us to test for gender differences based on the recruiter’s gender, and to test more directly whether beliefs affect credit attribution. Before conducting the experiment, we collected individual characteristics from univer- sity students (henceforth referred to as job candidates), along with their performance in two incentivized real-effort tasks. The experiment itself was conducted with 479 actual human resource workers whose job is to recruit personnel. The HR workers, henceforth “recruiters”, were asked to choose job candidates for a task based on short resumes. The “recruiters” complete an incentivized online survey. Each recruiter is sequentially shown three sets of four candidates’ resumes. Recruiters pick one candidate from each set and are paid according to the chosen candidate’s score in the real-effort task. 21 After their choices, the recruiters’ belief about relative gender differences in ability is elicited. Specifically, they are asked to indicate the degree to which they think men or women are better at the real-effort task. 22 Therefore, Experiment II does not try to induce beliefs by providing information about scores, as in Experiment I. Instead, it elicits beliefs to observe whether beliefs are biased and to evaluate the extent to which individual beliefs affect the recruiters’ choices. The experiment uses a 2x2 design. 5.2.1 Treatments Individual vs. Joint scoresAs in Experiment I, recruiters are randomized into an In- dividual or a Joint treatment. In the Individual treatment, recruiters see the individual scores of all four candidates in a set. In the Joint treatment, recruiters see two summed scores (the sum of candidate 1’s and 2’s score, and the sum of candidates 3’s and 4’s score). The sets are chosen such that one of the summed scores is obviously superior to the other to give recruiters a strong incentive to choose one of these two candidates. These superior candidate pairs always include a male and a female whose resumes are otherwise alike. 23 The pair of inferior candidates may vary on all characteristics but had much lower joint 21 Excluding the participation fee, recruiters earned an average of $6 to complete the ten-minute experi- ment. 22 Answers ranged from a difference in means of 4 or more points in favor of men to 4 or more points in favor of women. Choosing the correct answer is rewarded with $1.50. The correct answer was calculated based on the actual scores of candidates in the tasks. For more details, see Appendix D. 23 In addition to their scores, the resume of each candidate shows the candidate’s field of study, degree length (from three to five years), age, gender, and geographic region of origin. See Appendix D for more details and an example of a set of candidates. 27 scores. Search vs. Vocabulary tasksFor the real-effort tasks, recruiters were randomized to pick candidates who performed a vocabulary task (finding words using a set of provided let- ters) or a numeric-search task (finding the highest numbers in each of two 10x10 ma- trices and adding them up, as in Weber and Schram, 2017). The tasks are described in more detail in Appendix D. 24 Compared to Experiment I, these tasks are arguably less stereotypical and have been shown to exhibit little to no gender difference in performance (Schram et al., 2019; Shurchkov, 2012). Because men and women perform similarly on these tasks, using them provides us with a stronger test of whether incorrect beliefs about performance drive credit allocation. 5.2.2 Results When recruiters are informed about the individual scores, the candidate’s gender does not affect the recruiters’ choices, which are primarily determined by individual scores (for details, see Appendix D). This mirrors the result in the Individual treatment in Experiment I and the observation that women and men receive equal credit for solo-authored papers. To investigate credit attribution in the Joint treatment, we use McFadden’s random- utility model to explain the binary choice of whether or not to select a candidate under the restriction that only one out of four candidates can be chosen in a set (McFadden, 1974). More specifically, we assume that the utility of recruiterjfrom choosing candidate iin setkis given by u jik =β 1 fem ik +β 2 (fem ik ×Belief j ) +β 3 Score ik +γ ′ Z ik +θ jk + jik ,(13) wherefem ik is an indicator that candidateiin setkis female,Score ik is candidatei’s joint score in the task, andBelief j is recruiterj’s belief about the difference in mean scores between men and women (constructed such that zero implies a belief of no gender differ- ences in mean scores, and positive (negative) values imply a belief that men (women) are better). The vector of controls,Z ik , include all the other elements of candidatei’s resume, whileθ jk correspond to fixed effects for each recruiter-set combination. Recruiteripicks the candidatejthat gives the highest utility in setk. The random variable jik is assumed to have an extreme value distribution, which allows us to estimate the model using a con- ditional logistic regression. The estimation results are presented in Table 11 as odds ratios. 24 Recruiters received $0.06 for each point in the vocabulary task by the chosen candidate or $0.15 for each correct addition in the numerical search task. 28 Column 1 contains the results for the search task and Column 2 for the vocabulary task. The results for all recruiters show that they are much more likely to pick candidates in pairs with a high joint score (i.e., from the superior pair in the set). On average, the gender of the candidate does not have a significant impact on the likelihood of being chosen in either task. However, this is no longer the case once recruiters are divided according to their gender. Columns 3 to 6 show the estimation results for male recruiters and Columns 7 to 10 for female recruiters. Columns 3 and 7 reveal that male recruiters are less likely to pick female candidates, though the odds ratio is not significantly different from one in the numeric-search task. This aligns with the results of the male-stereotyped quiz in Experiment I and the data on co-authorship in Economics, where men are given more credit for joint work. By contrast, Columns 7 and 9 show that female recruiters are significantlymorelikely to pick a female than a male candidate. We will return to this surprising result below. For both male and female recruiters, the task does not appear to matter much as the odds ratios for the female indicator are quite similar across the search and vocabulary tasks. Columns 4, 6, 8, and 10 show the estimation results once beliefs about gender differences in scores are introduced in the regressions. In all cases, recruiters who believe men are better than women at the task are significantly less likely to pick a female candidate (and vice-versa). Correcting for beliefs brings all the odds ratios of the female indicator closer to 1 (a significant effect for gender remains only for male recruiters in the vocabulary task), which suggests that the observed biases in credit attribution for joint work are largely mediated by the recruiters’ beliefs about which gender is better. 25 5.3 Discussion While the experimental context is different from the academic context, the results provide evidence that, even after shutting down effort and selection channels, individuals make different inferences about men and women’s contributions to a joint project that are rooted in beliefs. Our two experiments differ along two dimensions: the stereotypical nature of the tasks used (higher in Experiment I) and the specificity concerning the hiring context (higher in Experiment II). In addition, we distinguish between the gender of the recruiter in Exper- iment II. Despite these differences, both experiments provide evidence of a bias against women when attributing credit to joint work. Experiment I shows that this can depend on the task under consideration, with women receiving less credit for stereotypically male 25 Our data show (see Appendix D) that the beliefs of male recruiters are not significantly biased towards either gender in either task. By contrast, on average, female recruiters expect female candidates to do better than male candidates in both tasks. 29 tasks and men receiving less credit for stereotypically female tasks. Experiment II, on the other hand, shows that the credit-attribution bias can depend on the gender of the re- cruiter. Male recruiters exhibit this bias against women. Female recruiters, however, show the opposite bias by attributing more credit to women than to men for joint work. In both experiments, the differences can be explained by beliefs about which gender is better at a task. 26 The patterns observed in our two experiments can be used to re-evaluate the re- sults on tenure decisions in economics. In fact, if we assume that such decisions are made primarily by men and that economics is seen as a stereotypically-male discipline, then our experimental results would predict the bias observed in attributing credit to co-authored papers. 27 Our experimental results further suggest that this bias is caused by (incorrect) beliefs about the male and female co-authors contributions to joint work. 6 Conclusion Women receive tenure at significantly lower rates than men in many academic fields. As discussed in the introduction, this phenomenon is not exclusive to academia. Several explanations have been put forward for the gap, but it persists even after accounting for observable characteristics such as fertility preferences and productivity. This paper proposes an alternative explanation. We argue that women receive less credit for group work when employers can not perfectly observe their contribution. When signals are noisy, employers have to infer each worker’s ability or productivity. Coau- thored papers provide employers with a noisy signal. The fact that women who work specifically with men receive tenure at lower rates than comparable women who work alone or with other women suggests that gender enters into the employer’s inference pro- cess. However, when employers receive clear signals, men and women are treated simi- larly. For example, men and women receive the same amount of credit for solo-authored papers, which provide a clear signal of ability. Evidence from two experiments suggests that these results are not explained by sorting or differences in effort to group work. The experiments further suggest that this phenomenon is not specific to women as men also suffer a penalty when working with women on a female-stereotyped task. Finally, the 26 However, even after we account for beliefs, by providing information about Quiz 1 scores in Experiment I or controlling for measured beliefs in Experiment II, we observe too little credit is attributed to women by predictors in the math quiz of Experiment I and male recruiters in the vocabulary task of Experiment II. In other words, we find suggestive evidence that both beliefs about ability and taste-based animus can play a role. 27 These results are also consistent with the lack of evidence of women being penalized for coauthoring in sociology (see Appendix A), a discipline with relatively more women and thus less likely stereotypically- male than economics. 30 gender of the person assigning credit also influences credit attribution. Being aware of this phenomenon is important in a world that is increasingly relying on group work for production. The tech industry, for example, prides itself on collabora- tion. In such male-dominated fields, however, group work could result in fewer women moving up the career ladder if credit is not properly attributed. The same could be true for men in female-dominated industries. The unequal attribution of credit would then contribute to and help maintain gender segregation in occupations. 31 References [1] Antecol, Heather, Kelly Bedard, and Jenna Stearns. 2018. "Equal but Inequitable: Who Benefits from Gender-Neutral Tenure Clock Stopping Policies?" American Economic Review, 108 (9): 2420-41. [2] Becker, Gary. 1971.The Economics of Discrimination.Chicago: University of Chicago Press. [3] Bikard, Michael, Fiona Murray, and Joshua Gans. 2015. "Exploring Trade-offs in the Organization of Scientific Work: Collaboration and Scientific Reward." Management Science, 61(7): 1473-1740. [4] Blau, Francine D. and Jed DeVaro. 2007. "New Evidence on Gender Differences in Pro- motion Rates: An Empirical Analysis of a Sample of New Hires." Industrial Relations, 46(3): 511-550. [5] Boschini, Anne and Ana Sjögren. 2007. "Is Team Formation Gender Neutral? Evidence from Coauthorship Patterns." Journal of Labor Economics, 25(2): 325-365. [6] Buser, Thomas, Muriel Niederle, and H Oosterbeek. 2014. "Gender, Competitiveness, and Career Choices." Quarterly Journal of Economics, 129 (3): 1409-47. [7] Carrell, Scott E., Marianne E. Page, and James E. West. 2010. "Sex and Science: How Professor Gender Perpetuates the Gender Gap." Quarterly Journal of Economics, 125(3): 1101-1144. [8] Ceci, Stephen J., Donna K. Ginther, Shulamit Kahn, and Wendy M. Williams. 2014. "Women in Academic Science: A Changing Landscape." Psychological Science in the Public Interest, 15(3): 1-67. [9] Correll, Shelley J. and Cecilia L. Ridgeway. 2003. "Expectation States Theory." Pages 29-51 inThe Handbook of Social Psychology, ed. John Delamater. New York: Kluwer Academic Press. [10] Dynan, Karen and Cecilia Rouse. 1997. "The Underrepresentation of Women in Eco- nomics: A Study of Undergraduate Economics Students." The Journal of Economic Education, 28(4): 350-368. [11] Elmins, William, Robert Joyce, and Monica Costa Dias. 2016. "The Gender Wage Gap." Institute for Fiscal Studies Briefing Note 186. [12] Garcia, Daniel and Joshua Sherman. 2015. "Norms and Team Formation: Evidence from Research Partnerships." Working paper. [13] Ginther, Donna K. and Shulamit Kahn. 2004. "Women in Economics: Moving Up or Falling Off the Academic Career Ladder?" Journal of Economic Perspectives, 18(3): 193-214. 32 [14] Hale, Galina and Tali Regev. 2014. "Gender Ratios at Top PhD Programs in Eco- nomics." Economics of Education Review, 41: 55-70. [15] Hammermesh, Daniel. 2013. "Six Decades of Top Economics Publishing: Who and How?" Journal of Economic Literature, 51(1): 162-172. [16] Harrison, Glenn W. and John List. 2004. "Field Experiments" Journal of Economic Literature, 42(4): 1009-1055. [17] Isaksson, Siri. 2019. "It Takes Two: Gender Differences in Group Work." Working paper. [18] Kalaitzidakis, Pantelis, Theofanis Mamuneas, and Thanasis Stengos. 2003. "Rankings of Academic Journals and Institutions in Economics." Journal of the European Eco- nomic Assocation, 1(6): 1346-1366. [19] Lazear, Edward and Kathryn Shaw. 2007. "Personnel Economics: The Economist’s View of Human Resources." Journal of Economic Perspectives, 21(4): 91-114. [20] McFadden, Daniel. 1973. "Conditional logit analysis of qualitative choice behavior." In P. Zarembka (ed.),Frontiers in Econometrics. New York: Academic Press. [21] Milkman, K.L., M. Akinola, and D. Chugh. 2015. "What Happens Before? A Field Experiment Exploring How Pay and Representation Differentially Shape Bias on the Pathway into Organizations." Journal of Applied Psychology, 100(6): 1678-1712. [22] Moss-Racusin, Corinne, John Dovidio, Victoria Brescoll, Mark Graham, and Jo Han- delsman. 2012. "Science Faculty’s Subtle Gender Biases Favor Male Students." PNAS, 109(41): 16474-16479. [23] Niederle, Muriel and Lise Vesterlund. 2007. "Do Women Shy Away From Competi- tion? Do Men Compete Too Much?" Quarterly Journal of Economics, 122(3): 1067- 1101. [24] Reuben, Ernesto, Matthew Wiswall, and Basit Zafar. 2017. "Preferences and Biases in Educational Choices and Labour Market Expectations: Shrinking the Black Box of Gender." The Economic Journal, 127 (604): 2153-86. [25] Sarsons, Heather. 2017. "Recognition for Group Work:Gender Differences in Academia." AER Papers & Proceedings, 107(5): 141-145. [26] Schram, Arthur, Jordi Brandts, and Klarita Gërxhani. 2019. "Social-status ranking: a hidden channel to gender inequality under competition." Experimental Economics, 22(2): 396-418 [27] Shurchkov, Olga. 2012. "Under Pressure: Gender Differences in Output Quality and Quantity Under Competition and Time Constraints." Journal of the European Eco- nomic Association, 10: 1189-1213. [28] Weber, Matthias and Arthur Schram. 2017. "The Non-Equivalence of Labour Market Taxes: A Real-Effort Experiment." The Economic Journal, 127(604): 2187-2215. 33 Figures FIGURE1: DISTRIBUTION OFPAPERCOMBINATIONS Notes: This figure shows the number of women (Panel A) and men (Panel B) who had various combinations of solo and coauthored papers at the time of tenure. Each dot represents a specific combination of papers with the number of coauthored papers measured on the x-axis and the number of solo-authored papers measured on the y-axis. The shading of the dots represents how many individuals had that combination of papers at the time they went up for tenure, with darker shades indicating a larger number of individuals with that combination. In the legend, “n” is the minimum and maximum number of individuals who have a specific paper combination. Panel A is constructed using the full sample of women (N=143) and Panel B is constructed using the full sample of men (N=501). 34 FIGURE2: TENUREPROBABILITIES BYPAPERCOMBINATIONS Notes: This figure plots the unconditional tenure probability for women (Panel A) and men (Panel B) who have various combinations of papers at the time they go up for tenure. Coauthored papers are counted along the x-axis and solo-authored papers are counted along the y-axis. A darker shade indicates a higher probability of receiving tenure. For example, if a dot is the darkest shade, it indicates that individuals with that combination of solo and coauthored papers receives tenure with probability one. Panel A is constructed using the full sample of women (N=143) and Panel B is constructed using the full sample of men (N=501). 35 FIGURE3: TOTALPAPERS ANDTENURE Notes: This binned scatterplot shows the correlation between the total number of publications an individual has at the time they go up for tenure and the probability of receiving tenure. The y-variable, tenure, is a binary variable that equals one if an individual received tenure at their initial institution of employment. For more details on how the tenure variable is constructed, see Section 2. To construct the plot, tenure is first residualized with respect to the following controls: number of years it took to go up for tenure, average journal rank of pre-tenure publications, log citations, total coauthors, and tenure school, tenure year, and field fixed effects. The x-variable, number of publications, is then divided into twenty equal-sized groups. Within each of these groups, we plot the mean of the y-variable (tenure) residuals against the mean of the x-variable (also within each bin). We then add back the unconditional mean of Tenure to help with the interpretation of the line of best fit. The lines of best fit are estimated using the full sample (N=621) and have slopes ofβ=0.132(s.e. = 0.016) for men andβ=0.165(s.e. = 0.043) for women. 36 FIGURE4: RELATIONSHIPBETWEENPAPERCOMPOSITION ANDTENURE Notes: This figure is a binned scatterplot of the correlation between tenure and the fraction of an individual’s papers that are solo-authored, split by gender. The y-variable is a binary variable indicating whether an individual received tenure. To construct the plot, tenure is first residualized with respect to the following controls: total number of papers an individual published by the time of tenure, number of years it took to go up for tenure, average journal rank of pre-tenure publications, log citations, total coauthors, and tenure school, tenure year, and field fixed effects. The x-variable, fraction of papers that are solo-authored, is then divided into twenty equal-sized groups. Within each of these groups, we plot the mean of the y-variable (tenure) residuals against the mean of the x-variable (also within each bin). We then add back the unconditional mean of Tenure to help with the interpretation of the line of best fit. The line of best fit using OLS is shown separately for men and women. The lines of best fit are estimated using the full sample (N=621) and have slopes ofβ=0.521(s.e. = 0.158) for women andβ=0.023(s.e. = 0.748) for men. 37 FIGURE5: ABILITY ANDSORTING Notes: This binned scatterplot shows the correlation between an individual’s ability and the propensity to coauthor (Panel A) or the propensity to coauthor with women (Panel B). The outcome variable in Panel A is the fraction of an individual’s papers that were published by tenure that are coauthored. The outcome variable in Panel B is the fraction of an individual’s pre-tenure papers that are coauthored with only women. We proxy for an individual’s ability with the rank of the journal in which the individual’s job market paper was published. The plot is constructed as described in Figure 3 with the y-variable residualized on the following controls before plotting: total solo and coauthored papers, the number of years it took to go up for tenure, log citations, and tenure school, tenure year, and field fixed effects. The lines of best fit using OLS are shown separately for men and women. The estimates for Fig. 5A areβ=−0.0001(s.e. = 0.0003) for women andβ=0.0002(s.e. = 0.0002) for men. The estimates for Fig. 5B areβ=−0.00004(s.e. = 0.0008) for women andβ=0.0002(s.e. = 0.0003) for men. 38 FIGURE6: ASSORTATIVEMATCHING Notes: This binned scatterplot shows the correlation between an individual’s ability, proxied by the journal in which their job market paper is published, and their coauthor’s ability, proxied by the average school rank of their coauthors. The school rank of coauthors are measured at the time that individualiwent up for tenure. School rankings are taken from IDEAS/RePEc. The plot is constructed as described in Figure 3 with the y-variable residualized on the following controls before plotting: total solo and coauthored papers, the number of years it took to go up for tenure, log citations, and tenure school, tenure year, and field fixed effects. The line of best fit using OLS is shown separately for men and women. The lines of best fit are estimated on the full sample and have slopes of β=0.062(s.e. = 0.091) for women andβ=0.109(s.e. = 0.056) for men. 39 Tables TABLE1: SUMMARYSTATISTICS FullMaleFemalep-value Panel A: Tenure0.680.730.520.001 (0.47)(0.44)(0.50) Years to tenure6.86.67.30.001 (1.6)(1.6)(1.8) Total papers8.38.48.00.262 (3.9)(4.1)(3.3) Solo-authored3.03.03.00.879 (2.4)(2.4)(2.3) Coauthored5.35.45.00.189 (3.6)(3.7)(3.1) Panel B: Top 5 Solo0.670.660.680.900 (0.98)(0.99)(0.92) Top 5 Coauthored1.31.31.20.570 (1.4)(1.4)(1.4) AER Equivalent: Solo Pubs.0.340.340.330.500 (0.24)(0.23)(0.25) Coauthored Pubs.0.330.340.300.039 (0.20)(0.21)(0.18) Panel C Number Unique CAs4.524.554.470.767 (2.79)(2.78)(2.83) Frac. coauthors who are: Full Professor0.460.470.410.052 (0.35)(0.33)(0.38) Associate Professor0.160.150.160.810 (0.24)(0.23)(0.28) Assistant Professor0.250.230.280.060 (0.24)(0.22)(0.30) Graduate Student0.0170.0150.0210.239 (0.067)(0.056)(0.095) Female0.130.0940.2700.001 (0.23)(0.179)(0.309) Observations644501143 This table displays the average tenure rate, pre-tenure productivity, and pre-tenure authorship patterns of men and women who went up for tenure at one of top 35 U.S. economics departments between 1985 and 2014. The top 35 institutions are taken determined according to the RePEc/IDEAS economics de- partment rankings. In Panel A, Tenure is an indicator that equals one if an individual was promoted to associate or full professor 6-8 years after his or her initial appointment. Years to tenure is the number of years between an individual’s PhD graduation year and the year s/he went up for tenure. All paper counts are measured as the number of papers an individual had published at the time of tenure. Top 5 Solo/Coauthored are the number of publications an individual had published in one of the top 5 eco- nomics journals: AER, QJE, Econometrica, JPE, and ReStud.AER Equivalentis a measure that converts an individual’s publications into the number of AER-equivalent publications they correspond to. For more details on this variable, see Section 2. Number Unique CAs is the number of different coauthors an individual had published with by the time s/he went up for tenure. Coauthor positions (full, asso- ciate, assistant, and graduate student) are the positions an individual’s coauthors had at the time that individual went up for tenure. 40 TABLE2: RELATIONSHIPBETWEENPAPERS& TENURE Outcome Variable: Tenure (1)(2)(3)(4)(5) Sample:FullFullFullFemaleMale Total papers0.142 ∗∗∗ (0.016) Fem x Papers-0.005 (0.012) Solo-authored0.094 ∗∗∗ 0.097 ∗∗∗ 0.196 ∗∗∗ 0.095 ∗∗∗ (0.013)(0.019)(0.055)(0.024) Fem x Solo0.048 ∗∗∗ 0.057 ∗∗∗ (0.018)(0.015) Coauthored0.085 ∗∗∗ 0.082 ∗∗∗ -0.0310.090 ∗∗∗ (0.016)(0.014)(0.054)(0.016) Fem x Coauthored-0.030 ∗ -0.026 ∗ (0.016)(0.015) Total coauthors-0.0050.0010.0030.025-0.001 (0.004)(0.005)(0.005)(0.016)(0.006) Total Papers Sq-0.004 ∗∗∗ (0.001) Solo Papers Sq-0.005 ∗∗∗ -0.005 ∗∗∗ -0.007-0.005 ∗∗ (0.001)(0.002)(0.005)(0.002) Coauthored Sq-0.003 ∗∗∗ -0.003 ∗∗∗ 0.000-0.003 ∗∗∗ (0.001)(0.001)(0.003)(0.001) Log Citations0.059 ∗∗∗ 0.031 ∗∗ 0.065 ∗∗∗ 0.098 ∗ 0.058 ∗∗∗ (0.012)(0.012)(0.013)(0.054)(0.016) AER Equiv. Ranking0.533 ∗∗∗ (0.116) AER Equiv. Solo0.139 ∗ 0.331 ∗∗∗ 0.3210.416 ∗∗∗ (0.071)(0.069)(0.206)(0.091) AER Equiv. CA0.201 ∗∗ 0.325 ∗∗∗ 0.486 ∗ 0.280 ∗∗∗ (0.099)(0.073)(0.248)(0.089) Female-0.135-0.166-0.205 ∗ (0.105)(0.121)(0.109) Tenure Inst. FEYNYYY Tenure Year FEYNYYY Field FEYNYYY Observations625629621139482 R-squared0.4170.2870.4250.5210.421 This table shows the relationship between publications and tenure. The dependent variable, Tenure, is binary and indicates whether an individual received tenure 6-8 years after being hired at the initial tenure institution. Total papers is the number of papers an individual published by the time s/he went up for tenure. Solo-authored and Coauthored are the number of solo or coauthored papers s/he had published at the time of tenure. AER Equiv. Ranking, AER Equiv. Solo, and AER Equiv. CA are journal quality measures described in Section 2. Total coauthors is the number of coauthors an individual had on the papers s/he had published by the time of tenure. Tenure length is the number of years it took the individual to go up for tenure. Citations are from Google Scholar and measured in 2017. The equations are estimated using a linear probability model. Bootstrapped standard errors are clustered by tenure institution and reported in parentheses. (*=p<0.10, **=p<0.05 ,***=p<0.01) 41 TABLE3: COAUTHORGENDER (1) ×Female Solo-authored0.093 ∗∗∗ 0.049 ∗∗ (0.019)(0.015) CA with only fem CAs0.097 ∗∗∗ 0.019 (0.024)(0.020) CA with only male CAs0.087 ∗∗∗ -0.056 ∗∗∗ (0.015)(0.015) Pubs. with M and F CAs0.087 ∗∗ 0.033 (0.026)(0.042) Female-0.156 (0.101) Total coauthors-0.001 (0.004) Log Citations0.064 ∗∗∗ (0.014) AER Equiv. CA0.332 ∗∗∗ (0.073) AER Equiv. Solo0.328 ∗∗∗ (0.065) Tenure Inst. FEYes Tenure Year FEYes Field FEYes Observations621 This table presents the results of one regression where the variables that are in- teracted with Female (a dummy indicating that the researcher is a woman) are displayed in the right-hand column.Papers with only fem CAsis the number of publications an individual has in which all coauthors are female. Similarly, Papers with only male CAsandPapers with male and fem CAsare the number of publications with only male coauthors and with a mix of male and female coau- thors respectively. Controls for tenure length; quadratics in the number of papers; and tenure institution, year, and field fixed effects are also included. The equa- tions is estimated using a linear probability model. Bootstrapped standard errors are reported in parentheses and are clustered by tenure institution. (*=p<0.10, **=p<0.05 ,***=p<0.01) 42 T ABLE 4: R OBUSTNESS C HECKS Faculty Journal Rankings Publication Count Total AERs List Sample RePEc Over Time AER Bins Tenure +1 Tenure +2 (1) (2) (3) (4) (5) (6) (7) Solo-authored 0.115 ∗∗∗ 0.083 ∗∗∗ 0.078 ∗∗∗ 0.078 ∗∗∗ 0.058 ∗∗∗ 0.038 ∗∗∗ 0.060 ∗∗∗ (0.024) (0.018) (0.017) (0.019) (0.017) (0.013) (0.019) Fem x Solo 0.050 ∗∗ 0.055 ∗∗∗ 0.052 ∗∗∗ 0.045 ∗∗∗ 0.044 ∗∗∗ 0.055 ∗∗∗ 0.091 ∗ (0.019) (0.015) (0.015) (0.015) (0.014) (0.013) (0.024) Coauthored 0.092 ∗∗∗ 0.079 ∗∗∗ 0.081 ∗∗∗ 0.069 ∗∗∗ 0.038 ∗∗∗ 0.031 ∗∗∗ 0.089 ∗∗∗ (0.023) (0.015) (0.016) (0.020) (0.011) (0.008) (0.011) Fem x Coauthored -0.032 ∗ -0.026 ∗ -0.025 ∗ -0.029 ∗ -0.022 ∗ -0.013 -0.036 ∗ (0.019) (0.014) (0.014) (0.015) (0.013) (0.013) (0.019) Years to Tenure -0.046 ∗∗∗ -0.054 ∗∗∗ -0.055 ∗∗∗ -0.051 ∗∗∗ -0.046 ∗∗∗ -0.045 ∗∗∗ -0.050 ∗∗∗ (0.009) (0.008) (0.008) (0.009) (0.008) (0.008) (0.011) Total Coauthors -0.001 0.004 0.004 -0.004 0.008 ∗ 0.010 ∗∗ 0.003 (0.007) (0.005) (0.005) (0.006) (0.004) (0.004) (0.005) Log Citations 0.056 ∗∗∗ 0.070 ∗∗∗ 0.074 ∗∗∗ 0.079 ∗∗∗ 0.072 ∗∗∗ 0.074 ∗∗∗ 0.094 ∗∗∗ (0.017) (0.013) (0.013) (0.013) (0.014) (0.013) (0.016) CA Paper Rank 0.310 ∗∗∗ 0.003 ∗∗ 0.003 ∗∗∗ 0.345 ∗∗∗ 0.345 ∗∗∗ (0.084) (0.001) (0.001) (0.081) (0.081) Solo Paper Rank 0.457 ∗∗∗ 0.002 ∗ 0.004 ∗∗∗ 0.291 ∗∗∗ 0.299 ∗∗∗ (0.077) (0.001) (0.001) (0.069) (0.065) Female -0.158 -0.193 ∗ -0.197 ∗∗ -0.175 ∗ -0.197 ∗ -0.280 ∗∗∗ -0.199 ∗∗ (0.128) (0.102) (0.099) (0.100) (0.105) (0.110) (0.048) Observations 369 621 621 621 621 621 621 The dependent variable in all columns is an indicator for receiving tenure. Column 1 restricts the sample to those schools we received a historical faculty list from. Column 2 uses RePEc journal rankings as the paper quality measure. The ranking used can be found at https://ideas.repec.org/top/top.journals.all.html.Column 3 uses historical journal rankings from RePEc to allow for rankings to change over time and to account for new journals entering. Column 4 controls forthe number of papers within each of 10 AER “bins”. For this analysis, the AER-equivalent measure is divided into deciles. For each individual, we then add upthe number of solo and coauthored papers within each decile and include the number of papers in each bin as controls. In Columns 5 and 6, we include papersthat were published one and two years after an individual went up for tenure in the paper counts. In Column 7, we use the AER Equivalent measure of journalranking to calculate the total number of AER equivalents (solo and coauthored) an individual had at the time of tenure. We use this measure in place of the soloand coauthored paper counts (the main independent variables). All regressions control for a quadratic in the number of papers as well as tenure institution, tenureyear, and field fixed effects. Bootstrapped standard errors are reported in parentheses and are clustered by tenure institution. (*=p<0.10, **=p<0.05 ,***=p<0.01) 43 TABLE5: FUTUREPRODUCTIVITY Outcome Var:Post TenureLog Citations Solo AER Equivalents (1)(2) PoissonOLS Fraction Coauthored-1.45 ∗∗∗ 0.533 (0.500)(0.390) Female-0.232-0.151 (0.380)(0.414) Female×Frac. Coauthored1.057 ∗ 0.742 (0.576)(0.660) Tenured0.1940.496 ∗ (0.352)(0.289) Tenured×Frac. Coauthored0.0020.408 (0.006)(0.486) Female×Tenured0.1850.210 (0.528)(0.509) Fem×Tenured×Frac. Coauthored-0.991-0.740 (1.131)(0.769) Top 5 Coauthored0.013 ∗∗ (0.007) Total papers0.071 ∗∗∗ (0.015) Tenure Inst. FENY Post-Tenure Inst. FEYN Tenure Year FEYY Field FEYY Observations621621 Column 1 shows the results from estimating equation 6 using a zero-inflated Poisson model, where the outcome vari- able is the number of solo-authored AER equivalents an individual published after the tenure decision (measured as of 2017). “Top 5 Coauthored” is the number of coauthored AER equivalents the individual published after tenure. Post-tenure institution is the institution the individual went to following the tenure decision. For people who re- ceived tenure, this the same as the tenure institution. Column 2 shows the results from estimating the same equation using OLS where log citations is the outcome variable. Citations are measured in 2015. Robust standard errors are reported in parentheses and are clustered at the tenure institution or post-tenure institution level. (*=p<0.10, **=p<0.05 ,***=p<0.01) 44 TABLE6: SURVEYRESULTS (1)(2)(3) Men Women p-value Panel A: Beliefs about Returns to Papers Coauthored AER12.112.20.939 Coauthored AER, Sr. Faculty9.18.80.528 Coauthored AER, Jr. Faculty13.313.40.796 Solo Top Field8.08.20.669 Coauthored Top Field6.36.80.223 Panel B: Frequency of Presenting Papers Times Presented3.12.20.071 Present More Freq. than CA0.370.440.203 Observations30089 This table presents the mean responses for men and women to the following survey questions: Panel A: “Suppose a solo authored AER increases your chance of receiving tenure by 15 percent. By how much do you think each of the following increases your change of receiving tenure?” Panel B: “How many times per year do you typically present your solo-authored papers? Are you more or less likely than your coauthors to present a joint paper?”Present More Freq. than CAis the fraction of respondents who reported that they are more likely than their coauthors to present a joint paper. The survey was conducted with a sample of academic economists currently working at a top 35 U.S. economics department. Respondents were anonymous. 45 TABLE7: ACCOUNTING FORSORTING Dep. Variable: Tenure (1)(2)(3) Solo-authored0.086 ∗∗∗ 0.084 ∗∗∗ 0.090 ∗∗∗ (0.017)(0.017)(0.017) Fem x Solo0.064 ∗∗∗ 0.067 ∗∗∗ 0.062 ∗∗∗ (0.017)(0.018)(0.017) Coauthored0.087 ∗∗∗ 0.089 ∗∗∗ 0.082 ∗∗∗ (0.014)(0.014)(0.015) Fem x Coauthored-0.032 ∗ -0.032 ∗ -0.033 ∗∗ (0.016)(0.016)(0.015) Female-0.220-0.348 ∗ -0.262 ∗ (0.121)(0.133)(0.136) Rank Difference0.001 (0.002) Fem×Rank Difference-0.001 (0.002) Avg. Coauthor Rank-0.002 (0.001) Fem×Avg. Coauthor Rank0.003 (0.002) Frac. Full Prof.-0.057 (0.071) Fem×Frac. Full Prof.0.194 (0.067) Observations595595595 The dependent variable in all columns is an indicator for receiving tenure. Column (1) shows the relationship between solo and coauthored papers and tenure when controlling for the difference between individuali’s institution rank and the average institution rank of his or her coauthors. Column (2) controls for the average institution rank of an individual’s coauthors, and column (3) controls for the fraction of an individual’s coauthors who are full professors. Only coauthors that an individual coauthored with up until tenure are included. All regressions control for tenure length, journal rank (AER equivalent measure), and log citations. They also include tenure institution, tenure year, and field fixed effects. The sample size is smaller in this analysis because individuals with no coauthors are excluded. (*=p<0.10, **=p<0.05 ,***=p<0.01) 46 TABLE8: PAPERSPLIT BYTOP5 Dep Var: Tenure (1) Top 5Non-Top 5 Solo0.067 ∗∗∗ 0.033 ∗∗∗ (0.019)(0.007) Coauthored0.086 ∗∗ 0.031 ∗∗∗ (0.016)(0.007) Fem x Solo0.0200.055 ∗∗ (0.037)(0.019) Fem x Coauthored-0.007-0.035 ∗∗ (0.031)(0.017) Female-0.171 (0.108) Total coauthors-0.002 (0.005) Years to tenure-0.049 ∗∗∗ (0.008) Log Citations0.079 ∗∗∗ (0.012) Tenure Inst. FEY Tenure Year FEY Field FEY Observations621 R-squared0.415 This table presents the results from estimating equation 9. The results in the able are from this single regression, but solo and coauthored papers are split into those published in the top 5 jour- nals (Column 1) and journals below the top 5 (Column 2). Top 5 papers are those published in the American Economic Review, Econometrica, the Journal of Political Economy, Quarterly Journal of Economics, or the Review of Economic Studies. The dependent variable is an indicator for receiving tenure. The regression in- cludes tenure institution, tenure year, and field fixed effects. Robust standard errors are clustered by tenure institution and reported in parentheses. (*=p<0.10, **=p<0.05 ,***=p<0.01) 47 TABLE9: TIMING OFCOAUTHORSHIP WITHMEN Years to First Fraction of Papers Publicationwith Men (1)(2) Female0.0540.062 (0.186)(0.080) Tenure-0.0020.060 (0.130)(0.047) Female×Tenure-0.151-0.258 ∗∗ (0.237)(0.089) Years to 1st Pub.0.016 (0.015) Fem×Years to 1st Pub.-0.013 (0.032) Tenure×Years to 1st Pub.-0.030 (0.018) Fem×Tenure×Years to 1st Pub.0.017 (0.041) Total papers-0.119 ∗∗∗ 0.009 ∗ (0.015)(0.003) AER Equiv.-0.3800.251 ∗∗ (0.344)(0.081) School FEYY Tenure Year FEYY Primary Field FEYY Observations603594 This table tests whether there are gender differences in the timing of an individual’s first publication (Col- umn 1) and whether women who take a longer time to publish their first paper are more likely to coauthor with men (Column 2). The outcome variable in Column 1 is the number of years it takes an individual to publish his or her first paper after graduating, and is measured as the year of the individual’s first pub- lication minus the year of the individual’s initial faculty appointment. Articles published before the first appointment (i.e. during graduate school) are not counted. The outcome variable in Column 2 is the frac- tion of an individual’s papers published by tenure that are coauthored with men. The independent vari- able,Y earsto1stPubis the outcome variable in Column 1. Both regressions include tenure institution, tenure year, and field fixed effects. Robust standard errors are reported in parentheses. (*=p<0.10, **=p<0.05 ,***=p<0.01) 48 TABLE10: EXPERIMENTI PREDICTEDSCORE BYQUIZTYPE Dep. Var.: Predicted Quiz 2 ScoreMathGrammar Ind.JointInd.Joint (1)(2)(3)(4) Female0.111-0.243 ∗∗ -0.0210.103 (0.081)(0.118)(0.071)(0.114) Gender-Information0.1450.076-0.241 ∗∗ -0.456 ∗∗∗ (0.097)(0.134)(0.112)(0.130) Female×Gender-Information-0.108-0.1060.1940.738 ∗∗∗ (0.115)(0.154)(0.131)(0.153) Quiz 1 Score0.735 ∗∗∗ 0.0200.725 ∗∗∗ 0.016 (0.057)(0.051)(0.066)(0.053) Constant0.1373.432 ∗∗∗ 0.2893.246 ∗∗∗ (0.212)(0.361)(0.248)(0.386) Observations250266231262 Predictors125133116131 R-squared0.2980.0410.2390.139 This table presents the results from Experiment I in which participants predict how well an individual did on a math or grammar quiz based on that individual’s performance on an earlier quiz. Columns 1 and 2 show the results for the math quiz and Columns 3 and 4 show the results from the grammar quiz. In the experiment, participants were randomized into the Individual treatment, where participants saw each individual’s score on a previous quiz (Columns 1 and 3), or the Joint treatment, where participants saw the sum of two individuals’ scores (Columns 2 and 4). Gender-Information is a dummy indicating that participants were told the average quiz scores of all men and women. (*=p<0.10, **=p<0.05 ,***=p<0.01) 49 T ABLE 11: E XPERIMENT II O DDS R ATIOS OF B EING P ICKED BY T ASK AND R ECRUITER G ENDER Dep. Var.: All recruiters Male recruiters Female recruiters Picked by recruiter Search Vocab. Search Vocabulary Search Vocabulary (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) Female 1.120 1.041 0.890 0.922 0.750 ∗∗ 0.723 ∗∗ 1.341 ∗∗ 1.157 1.390 ∗∗ 1.237 (0.108) (0.099) (0.136) (0.142) (0.104) (0.098) (0.167) (0.152) (0.181) (0.178) Female × Belief 0.694 ∗∗∗ 0.724 ∗∗∗ 0.760 ∗∗∗ 0.844* (0.072) (0.072) (0.081) (0.086) Joint Score 1.173 ∗∗∗ 1.032 ∗∗∗ 1.151 ∗∗∗ 1.149 ∗∗∗ 1.036 ∗∗∗ 1.036 ∗∗∗ 1.190 ∗∗∗ 1.189 ∗∗∗ 1.032 ∗∗∗ 1.032 ∗∗∗ (0.026) (0.003) (0.036) (0.037) (0.005) (0.005) (0.036) (0.037) (0.005) (0.005) Candidate resumé controls Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Set × recruiter fixed effects Yes Yes Yes Yes Yes Yes Yes Yes Yes Yes Observations 3144 2172 1368 1368 996 996 1776 1776 1176 1176 Recruiters 262 181 114 114 83 83 148 148 98 98 This table presents the results from Experiment II in which human resource recruiters pick one candidate out of four for a search or vocabulary task based on short resumes. Columns 1, 3-4, and 7-8 show the results for the search task and Columns 2, 5-6, and 9-10 show the results for the vocabulary task. Results are shown separately depending on the recruiter’s gender: all recruiters inColumns 1-2, male recruiters in Columns 3-6, and female recruiters in Columns 7-10. All regressions include fixed effects for each set-recruiter combination and controls for other variables in thecandidates’ resumes. Results are presented as odds ratios. Standard errors are clustered at the recruiter level. (*=p<0.10, **=p<0.05 ,***=p<0.01) 50 Appendix A Additional Tables TABLEA1: RESULTS BYINSTITUTION ANDYEAR Panel A: Tenure Institution Institution Rank:Top 10Top 20Top 35 (1)(2)(3) Solo-authored0.031 ∗∗∗ 0.053 ∗∗∗ 0.039 ∗∗ (0.006)(0.018)(0.018) Coauthored0.035 ∗∗ 0.052 ∗∗∗ 0.023 ∗∗∗ (0.013)(0.016)(0.007) Fem x Coauthored0.002-0.048 ∗∗ -0.048 ∗ (0.027)(0.020)(0.026) Fem x Solo0.074 ∗ 0.071 ∗ 0.104 ∗∗∗ (0.035)(0.037)(0.035) Female-0.471 ∗ -0.048-0.245 (0.247)(0.173)(0.243) Observations211157155 Panel B: Tenure Year Tenure Year:1985-19951996-20052006-2014 (1)(2)(3) Solo-authored0.034 ∗∗∗ 0.043 ∗∗ 0.033 ∗ (0.010)(0.021)(0.019) Coauthored0.018 ∗ 0.049 ∗∗∗ 0.047 ∗∗∗ (0.010)(0.011)(0.015) Fem x Coauthored0.011-0.047 ∗∗ -0.053 ∗ (0.041)(0.022)(0.027) Fem x Solo0.145 ∗∗∗ 0.079 ∗∗∗ 0.054 (0.037)(0.029)(0.042) Female-0.787 ∗∗∗ -0.219-0.003 (0.275)(0.160)(0.202) Observations141157215 Panel A shows the relationship between coauthoring and tenure by tenure institution rank. Schools are divided into the top 10, top 20, and top 35 departments, according to the RePEc rankings. All regressions include the following controls: time until tenure, number of coau- thors, log citations, solo and coauthored journal rankings, and tenure year and field fixed effects. Panel B shows the relationship splitting the sample by time period. The year groups are the years that an individual went up for tenure. All regressions include the following controls: time until tenure, number of coauthors, log citations, solo and coauthored journal rankings, and tenure rank and field fixed effects. (*=p<0.10, **=p<0.05 ,***=p<0.01) 51 Sociology Results The sociology sample consists of randomly sampled faculty at the top 20 sociology PhD- granting departments in the U.S. 28 There are 250 sociologists in the sample, 40% of whom are female. Summary statistics are presented in Table A2. There is no statistically signifi- cant difference between men and women’s tenure rates (with the mean tenure rate being 76%) although men seem to publish more solo-authored articles than women. TABLEA2: SOCIOLOGYSUMMARYSTATISTICS MenWomenp-value Tenure0.7520.7760.547 (0.433)(0.419) Total papers12.1510.180.033 (7.808)(5.726) Total coauthored6.4095.9590.567 (6.641)(4.999) Solo papers5.7454.2240.003 (4.451)(2.892) Time to tenure7.5847.5200.686 (1.607)(1.724) Books0.7790.5710.139 (1.185)(0.799) Observations150100 This table presents summary statistics for the full sample of sociologists and separately for men and women. All paper and book count variables (Total Pa- pers,Solo-authored,Coauthored, andTop 5s) are the number of papers or books an individual had published at the time of tenure. To test whether men and women are treated differently, we reestimate equation 3 using a probit model but include measures of the number of papers that researcheriis first author on. The results are presented in Table A3. We include the number and fraction of papers a researcher is first author on in Columns 1 and 2 respectively, along with female dummy interaction terms. 28 Ranking from U.S. News Education 52 TABLEA3:SOCIOLOGY:PAPERSAND TENURE Dep Var: TenureProbitProbit (1)(2) Total first author0.050 ∗∗ (0.017) Fem x First Author0.026 (0.040) Fraction first author0.403 ∗∗∗ (0.043) Fem x Frac. First Author-0.042 (0.172) Solo papers0.0080.000 (0.006)(0.006) Fem x Total Solo0.0020.007 (0.011)(0.011) Total Coauthored-0.010 ∗ 0.009 (0.004)(0.007) Fem x Total CA-0.0200.001 (0.017)(0.015) Books0.063 ∗ 0.058 (0.032)(0.035) Book chapters0.0070.005 (0.013)(0.012) Female0.0260.010 (0.114)(0.163) School FEYesYes Tenure Year FEYesYes Observations237209 This table shows the relationship between the number and types of papers an individual publishes and tenure for a sample of sociolo- gists. The dependent variable is a binary variable indicating whether the individual received tenure 6-7 years after being hired at the initial tenure institution.Total first authoris the number of papers an individ- ual is first author on whileFraction first authoris the fraction of an indi- vidual’s papers that s/he was first author on. The equations are esti- mated using a probit model and the marginal probabilities calculated at the mean are displayed. Standard errors, reported in parentheses, are clustered by tenure institution. (*=p<0.10, **=p<0.05 ,***=p<0.01) 53 Appendix B Additional Figures FIGUREB1: TIMING OFPUBLICATIONS Notes: This figure shows the average number of publications an individual has in the years surrounding his or her initial appointment as an assistant professor. Year 0 is the year that the individual begins working at his/her tenure institution (tenure institutions are defined in Section 2). The blue bars represent publications that are coauthored with men. The red bars represent all other publications (either solo-authored or coauthored with women). Panels A and B show the timing of publications for women and men who were denied tenure. Panels C and D show the timing of publications for women and men who received tenure. 54 Appendix C Institutions List Received faculty list:Brown, Columbia, Cornell, Duke, Harvard, Michigan State Uni- versity, New York University, Northwestern, Ohio State University, Penn State, Rutgers, Stanford, UC Berkeley, UC Davis, UC San Diego, UCLA, University of Virginia, University of Maryland, University of Michigan, University of Minnesota, University of Pennsylva- nia, University of Wisconsin-Madison No faculty list:Boston College, Boston University, California Institute of Technology, Georgetown, MIT, Princeton,University of Southern California, University of Chicago, University of Texas - Austin, University of Rochester, Vanderbilt, Yale 55 Appendix D Experiment I This section provides additional information for Experiment I. As mentioned in the main body of the paper, the first experiment was conducted with participants from the mTurk online platform. First, 80 participants were recruited to play the role of “workers” and perform two five-question quizzes (21 men and 19 women completed the math quizzes while 23 men and 17 women completed the grammar quizzes). Workers received a par- ticipation fee of $0.30 plus $0.05 for each question they answer correctly. On average, workers earned $0.55. The quizzes used are provided below. For the main part of the experiment, 505 participants were recruited to predict the scores of one randomly-chosen male worker and a randomly-chosen female worker in a task. Predictors were paid a participation fee of $0.50 and received $0.10 for each score they correctly predicted. The number of predictors in each treatment was as follows: 242 recruiters were assigned to the Individual treatment, of which 120 were assigned to the No-Information treatment (62 for math quizzes and 58 for grammar quizzes) and 122 to the Gender-Information treatment (63 for math quizzes and 59 for grammar quizzes), and 264 recruiters were assigned to the Joint treatment, of which 138 were assigned to the No- Information treatment (70 for math quizzes and 68 for grammar quizzes) and 126 to the Gender-Information treatment (63 for math quizzes and 63 for grammar quizzes). D.1 Quizzes used Grammar Quiz 1 1. The storm prevented ....... on a picnic. (a) us to going (b) us going (c) us to go (d) us from going 2. A man’s concept of liberty is different from ........ . (a) a woman’s (b) womens (c) a woman (d) woman’s 3. ........ hour went by before we received ........ invitation (a) an/an (b) a/a (c) an/a (d) a/an 4. When a subordinate clause is followed by the main clause, what is required? (a) a dash (b) a semi-colon (c) a period (d) a comma 5. ........ are used around a relative clause that defines the noun it follows. (a) Only commas (b) No commas (c) Semi-colons (d) Quotation marks 56 Grammar Quiz 2 1. I am dizzy and need to ........ down (a) lie (b) lay (c) lye (d) go lay 2. Which of these is not an article? (a) The (b) A (c) It (d) An 3. His idea is ........ mine (a) different to (b) different from (c) different than (d) different then 4. Adverbs can modify which of the following? (a) nouns (b) adjectives (c) pronouns (d) none of the above 5. ........ did you bump into? (a) Who (b) Whose (c) Who’s (d) Whom Math Quiz 1 1. Which of the following is a subset of {b,c,d}? (a) { } (b) {a} (c) {1,2,3} (d) {a,b,c} 2. A man’s regular pay is $3 per hour up to 40 hours. Overtime is twice the payment for regular time. If we was paid $168, how many hours overtime did he work? (a) 8 (b) 16 (c) 28 (d) 48 3. 3 4/5 expressed as a decimal is (a) 3.40 (b) 3.45 (c) 3.50 (d) 3.80 4. Which of the following is the highest common factor of 18, 24, and 36? (a) 6 (b) 18 (c) 36 (d) 72 5. Given thataandbare integers, which of the following is not necessarily an integer? (a)2a−5b(b)a 7 (c)b a (d)ab 57 Math Quiz 2 1. Items bought by a trader for $80 are sold for $100. The project expressed as a per- centage of cost price is (a) 2.5% (b) 20% (c) 25% (d) 50% 2. A man bought a shirt at a sale. He saves $30 on the normal price when he paid $120 for the shirt. What was the percentage discount on the shirt? (a) 20 (b) 25 (c) 33.33 (d) 80 3. How many subsets does {a,b,c,d,e} have? (a) 2 (b) 4 (c) 10 (d) 32 4. What is the median of the given data: 13, 16, 12, 14, 19, 14, 13, 14 (a) 14 (b) 19 (c) 12 (d) 14.5 5. In coordinate geometry, what is the equation of the x-axis? (a)y=0(b)x=y(c)x=0(d)y=1 D.2 Instructions Below are the instructions for the Joint and Gender-Information treatments. Instructions for the Individual and No-Information treatments are almost identical and are available upon request. Instructions screen 1 INSTRUCTIONS: Please read all the way through. This project seeks to understand how well individuals can predict a person’s future performance on a task based on his/her past performance. We recruited a group of people to complete two math [grammar] quizzes. Each quiz had five questions. Participants had one minute to complete each quiz. In what follows, we will show two participants’ scores from the first quiz. We then ask you to predict each participant’s score on the second quiz. We will provide you with some basic information on each individual. You will be paid $0.50 for your participation but will also be paid a bonus of $0.10 if you correctly guess a participant’s score on the second quiz. 58 Instructions screen 2 We will first show you the distribution of scores on the first quiz. Each bar represents the fraction of people who obtained that score. For example, 30% of people scored 4/5 on the first quiz. The average score of female participants (2.5/5) is shown by the solid line. The average score of male participants (2.8/5) is shown by the dashed line. Instructions screen 3 Below we are showing you a team’s score on Quiz 1. Recall that each team member worked on the questions independently. We then take the sum of the two scores and assign it to the team. For example, if Person A scored 3/5 and Person B scored 4/5, their team score would be 7/10. We provide you with some basic demographic information about each team member. Based on the team’s performance, please predict each individual’s score on Quiz 2. You can view each quiz by clicking on the link below. Histograms The histograms seen by recruiters containing the distribution of scores are seen below in Figure D1. 59 FIGURED1: DISTRIBUTION OF SCORES INQUIZ1 Notes: These bar graphs show the distribution of scores on first math and grammar quizzes. The lines mark the means score of men (dashed line) and women (solid lines). The experiment participants who predicted scores saw these distributions with or without the lines, depending on whether they were in the Gender-Information treatment. 60 Appendix E Experiment II This section provides additional information and analysis for Experiment II. E.1 Candidates Before running Experiment II, the sets of candidates are constructing using data from students who took part in laboratory experiments run in Bologna and Abu Dhabi. 29 In Bologna, 68 students completed one of the two tasks (16 men and 20 women completed the search task while 12 men and 20 women completed the vocabulary task), while in Abu Dhabi, 90 students completed both tasks (42 men and 48 women). Students were paid according to their performance in the tasks. Vocabulary Task Students are asked to solve Word-in-a-Word puzzles. They are given ‘large’ words, one at a time. The task is to find smaller Italian (Bologna) or English (Abu Dhabi) words that can be formed out of the letters of the large word. The task lasts 15 minutes. There is a maximum of 24 large words and participants can freely move to the next word at any time, but cannot return to previous words. The following rules apply: (i) words must consist of four letters or more, (ii) each letter of the large word can only be used once, (iii) proper nouns (names, etc.) are not allowed, and (iv) plurals and verb conjugations are allowed. Points are awarded to submitted words ofnletters according to the following rules: (i) each word found in a dictionary adds(n−3)points to the score, (ii) words not found in a dictionary subtract(n−3)points from the score, (iii) words that are too short subtract 1 point from the score; and (iv) words submitted more than once have no impact on the score. Points were converted to cash at an exchange rate of 0.10 euros per point in Bologna (around $0.11 per point) and 1 Emirati dirham per point in Abu Dhabi (around $0.27 per point). Search Task Students are shown two 10x10 matrices. Each cell is filled with a two-digit number. The task is to find the highest number in each matrix, add these up, and enter the sum. Each correct answer increases the score by one point. After entering a number, a new pair of 29 We thank BLESS for allowing us to use their facilities in Bologna. The experimental software used in Bologna was developed in PHP-MySQL with the help of Ailko van Veen and Joep Sonnemans, and was later adapted for the use in Qualtrics by Manu Muñoz. In Abu Dhabi, the experiment was run using zTree. 61 matrices appear, irrespective of whether the sum is correct. The task lasts 15 minutes. Points were converted to cash at an exchange rate of 0.50 euros for every point in Bologna (around $0.55 per point) and 4 Emirati dirhams per point in Abu Dhabi (around $1.09 per point). Resumes In addition to performing the tasks, students in Bologna and Abu Dhabi answered a few questions about their demographics and studies. This information and their scores are used to construct eight sets of “candidates” for each task. Each set consists of the resumes of four candidates. The resume of each candidate includes information about their score in the real effort task as well as their field of study, degree length (from three to five years), age, gender, and geographic region of origin. The score is shown for each candidate in the Individual treatment or as sums of two pairs of candidates in the Joint treatment. An example from the Vocabulary task treatment is provided in Figure E1. The other sets of this treatment and those of the Search task treatment are available upon request. FIGUREE1: EXAMPLE OF ONE SET OF CANDIDATE RESUMES Individual treatment Joint treatment 62 Note that the sets are constructed such that the summed score of one pair of candidates in the Joint Treatment is obviously better to that of the other pair (e.g., candidates 1 and 2 in Figure E1). The candidate pairs with the high score always include a male and a female whose resumes are otherwise alike. Specifically, the field of study, degree length, and geographic region of origin is always identical while age is allowed to vary but within a narrow range. The characteristics of the pair of candidates with lower joint scores are permitted to vary. This design is used to mask the purpose of the study to recruiters by giving them multiple characteristics to base their decision on, while at the same time keep these characteristics constant within the relevant pair of candidates. E.2 Procedures For Experiment II, human resource workers from the United States and India were re- cruited from Qualtrics’ panel of participants to complete an incentivized online experi- ment. 30 Only respondents who are involved in their firm’s hiring decisions and those that passed a set of attention checks are considered. Respondents who complete the experi- ment receive a participation fee set by Qualtrics plus additional incentives based on their choices. In total, 479 human resource workers (212 in the U.S. and 267 in India) took part in the experiment as “predictors”. Predictors are randomly assigned to the Vocabulary task treatment or the Search task treatment, and subsequently, to the Individual treatment (top example in Figure E1) or the Joint treatment (bottom example in Figure E1). The number of predictors in each treatment was as follows: 281 predictors were assigned to the Search task treatment, of which 19 were assigned to the Individual treatment (10 men and 9 women) and 262 to the Joint treatment (114 men and 148 women), and 198 predictors were assigned to the Vocabulary task treatment, of which 17 were assigned to the Individual treatment (6 men and 11 women) and 181 to the Joint treatment (83 men and 98 women). More predictors were assigned to the Joint treatment because that is the treatment of interest. Predictors first complete a simplified version of the task they are assigned to and earn $0.06 per point in the vocabulary task or $0.15 per point in the Search task. Thereafter, in the main part of the experiment, each predictor sees three sets of four candidates and is required to pick one student from each set. The sets are shown sequentially and are picked at random from the eight constructed sets. The picked students’ scores are paid 30 Throughout the paper, we pool the data from the U.S. and India. However, our results are unaffected by further controlling for the recruiter’s country. Running regressions like the ones in Table 11 including an interaction between the gender dummyfem ik and a country indicator results in insignificant coefficients for the interaction term. 63 out to the predictor at a rate of $0.06 per point in the Vocabulary task or $0.15 per point in the Search task. Finally, predictors are asked whether they think that men or women are better at the task they have participated in. Responses are in five categories and choosing the correct answer (based on the students’ actual scores in the task) is rewarded with $1.50. Instructions for the experiment are provided below. E.3 Instructions Below are the instructions for the Joint treatment with the Search task. Instructions for the Individual treatment and the Vocabulary task are very similar and are available upon request. Instructions welcome screen Thank you for taking part in this survey! The survey will take around 20 minutes to com- plete. We would like to see how people make choices when they have to select someone based on task performance. We will explain this in much more detail later. You will be compensated for participating in this survey in the usual way. In addition, you may makeextra earnings, depending on the answers you give and choices you make. How you can make extra earnings will be made clear in subsequent instructions. All extra earnings you make will be calculated in US dollars. Your total earnings in dollars will be paid to you as panel points in the usual manner. Once again, these extra earnings come on top of your compensation for participating. Your decisions in the study are private and anonymous. They will not be linked to your name in any way. We are interested in your own decisions. We kindly request that you do not communicate with other people while taking part in the study. The study consists of three parts. Part 2 will be explained after you have finished Part 1 and Part 3 will be explained after you have finished Part 2. Next, we will explain Part 1. Instructions part 1 screen In this first part, we ask you to do a simple addition task with which you can earn money. When you start, you will see two matrices on the screen. Each matrix has 6 rows and 6 columns and is filled with randomly generated numbers. Your task is to find the largest number in each of the two matrices and then to add them up. We will give you an example below. For each correct addition, you will receive $0.15. You will have five minutes to do this task. Irrespective of whether your answer is correct or incorrect, a new pair of matrices 64 will appear after you enter your answer. This means that, for each pair, you have only one attempt to provide the correct answer. At the top of the screen you can see how many correct answers you have so far. As mentioned, you will have five minutes in total. You will see the time that remains in the upper right corner of the screen. You will be allowed at most 40 addition attempts. This is much more than anyone can actually add up. After you have finished reading these instructions, you will see a link. Click on this link to complete the addition task. Note that the addition task will open a new window in your browser. Once you have completed the task, you will be given a code. You will need this code to complete the study and receive your payment. Please write it down. If you accidentally close the window, you can click on the link again and it will show you the code. Perform the addition task:Below is a10-digit number. Please write it down and then click on the link to perform the addition task. When you click on the link, anew window will appear where you will have to enter your 10-digit code. Note that if you enter the wrong code, we won’t be able to pay you for your performance in Part 1. Once you are done with the task, you will receive a password. You will have to come back to this page and enter the password below. This will confirm that you have com- pleted the addition task. Instructions part 2 screen Before we instruct you about Part 2, we would like to inform you of the following: Between 2016 and 2017, a large number of university students from all over the world per- formed an addition task like the one you have just performed. There are two differences between your addition task and the one performed by the university students: students faced larger matrices (10x10 instead of 6x6) and were given more time to perform the task (15 min instead of 5 min). These changes were made for you to be able to experience the same task without taking too much of your time. However, despite these changes, the nature of the task remains the same. This means that your experience with the task should give you a sense of what is needed to do well. Your choices in Part 2:We will present to you three different sets of profiles describ- ing some of the characteristics of students who did this previous task. Each set contains profiles of four different students. For each set, we would like you tochoose one student. 65 Your choice gives you money. More precisely, you will receive$0.15 for each correct addition performed by the student you choosewhen he or she did the task. Because we will give you three sets of profiles to choose a student from, you need to make a choice three times. This means you will earn money three times. Note that once you have made a choice you won’t be able to go back and change it. The profiles we give you will contain background information about the students. Specifically, their age, field of study, gender, type of university degree they purse, and the region of the world they come from. We will also give you an indication of the score obtained by the students when they did the task. However, you will not be told each stu- dent’s own score. Instead, we have grouped the students in pairs. Below is an example of how a set of four students will be presented. [Here the instructions included an example similar to the ones in Figure E1] Please continue to make your three choices. Instructions decision screen Examine the profiles closely and choose one student. Remember, you will receive $0.15 for each correct addition performed by the student you choose when he or she did the task. The profiles of four different students are below. Instructions part 3 screen In Part 3, we ask you to estimate whether female students or male students were better in the previously-described task. More precisely, we calculated the average score of all female students and the average score of all male students who participated in the task across all the regions of the world. We ask you to estimate whether females or males scored better on average by answering the question below. If you estimate correctly, we you will earn an additional $1.50. I estimate that: •Female students are much better (the average score of female students is 4 more than that of male students) •Female students are slightly better (the average score of female students is between 1 and 3.99 more than that of male students) •Male and female students are about the same (the average score of male and female students differs by less than 1) •Male students are slightly better (the average score of male students is between 1 and 3.99 more than that of female students) 66 •Male students are much better (the average score of male students is 4 more than that of female students) E.4 Additional analysis This subsection contains the additional analysis of Experiment II that could not be in- cluded in the main body of the paper due to space constraints. Individual Treatment Table E1 shows results from analyzing recruiters’ choices in the Individual treatment. Like in the main body of the paper, we use McFadden’s random-utility model to explain the choice of whether or not to select one candidate out of four in each set. Columns 1 to 4 contain the results for the Individual treatment (Columns 1 and 2 for male recruiters and Columns 3 and 4 for female recruiters) and Columns 5 to 8 for the Joint treatment for comparison (Columns 5 and 6 for male recruiters and Columns 7 and 8 for female recruiters). The regressions include data form the search and vocabulary tasks to have enough independent observations. The only difference between this specification and that in the paper is that instead of the candidates’ joint score, we use an indicator for having the highest individual or joint score in a set. The estimation results are presented as odds ratios. In contrast to the Joint treatment, the results show that in the Individual treatment, the gender of the candidate does not have a significant impact on the likelihood of being chosen irrespective of the gender of the recruiter. Moreover, the higher odds ratio for the indicator of the highest score shows that, compared to the Joint treatment, recruiters focus relatively more on scores when making a decision in the Individual treatment. Finally, one can also see that including the recruiters’ beliefs concerning the mean scores of men and women has a smaller effect in the Individual treatment vis-à-vis the Joint treatment, implying that Joint evaluation makes these beliefs a more important part of the decision. Beliefs The recruiters were asked to report their belief about the difference in mean scores of men and women in either the search or the vocabulary task. Answers were given in a five categories, which we code as: (-2) women much better (women’s mean score is more than 4 points larger), (-1) women slightly better (women’s mean score is between 1 and 3.99 points larger), (0) about the same (mean scores differ by less than 1 point), (1) men slightly 67 TABLEE1: EXPERIMENTII ODDSRATIOS OFBEINGPICKED Dep. Var.:Individual treatmentJoint treatment Picked byMale recruitersFemale recruitersMale recruitersFemale recruiters recruiter(1)(2)(3)(4)(5)(6)(7)(8) Female1.4451.5971.0451.0360.8560.8571.276***1.105 (0.465)(0.537)(0.474)(0.457)(0.082)(0.081)(0.109)(0.104) Female×Belief0.504*0.9650.717***0.793*** (0.190)(0.238)(0.053)(0.060) Highest score0.016***0.014***0.080***0.080***0.256***0.254***0.139***0.138*** (0.014)(0.014)(0.030)(0.030)(0.034)(0.034)(0.020)(0.020) Observations1921922402402364236429522952 Recruiters16162020197197246246 This table presents results from Experiment II. Columns 1-4 show the results for the Individual treatment and Columns 5-8 for the Joint treatment. Results are shown separately depending on the recruiter’s gender: male recruiters in Columns 1-2 and 5-6, and female recruiters in Columns 3-4 and 7-8. All regressions include fixed effects for each set-recruiter combination and controls for other variables in the candidates’ resumes. Results are presented as odds ratios. Standard errors clustered on recruiters. (*=p<0.10, **=p<0.05 ,***=p<0.01) better (men’s mean score is between 1 and 3.99 points larger), and (2) men much better (men’s mean score is more than 4 points larger). Figure E2 shows the distribution of the recruiters’ beliefs depending on the task and the gender of the recruiter. The modal belief of male recruiters is that the performance of men and women is about the same in both tasks. Moreover, the remaining answers are more or less evenly distributed among the remaining options, implying that the beliefs of male recruiters are not systematically biased in favor of either male or female candidates. This is confirmed by sign tests evaluating whether the median of the distribution is zero (p= 0.348for the search task andp=0.597for the vocabulary task). By contrast, the modal answer for female recruiters is that the performance of women is slightly better than that of men, reflecting a slight bias by female recruiters in favor of female candidates (sign testsp <0.001in both tasks). ). Finally, there are no significant differences in the beliefs distributions depending on the task (Fisher’s exact tests:p=0.283for male recruiters and p=0.726for female recruiters), which confirms that neither task is perceived as more stereotypically male (female) than the other. 68 FIGUREE2: DISTRIBUTION OF RECRUITERS’BELIEF OF GENDER DIFFERENCES IN PER- FORMANCE INEXPERIMENTII 69
Find difference