Abstract

On of the Internet’s recurring memes is a world where families have children until they have a son, then stop. There seems to be a fairly common misapprehension that this will lead to an asymmetry of sexes in the next generation.

This short note explores this question: we quickly see that symmetry is preserved, but then spend time exploring what’s going on in more detail.Finally we introduce a new element to the model which, on its own, favours neither boys nor girls, but allows social engineering to skew the sex distribution.

Introduction

As is usually the case with this sort of probability puzzle, the main problem is to find the right way of looking at the problem. Once you’ve found that, the answer’s obvious.

In this problem we’re asked to think imagine families who have children until they have a son, then stop. As posed it’s natural to think about what happens on a family-by-family basis, but that’s a mistake!

Rather, enlightenment comes when you think about each birth individually: although families might be indulging in social engineering, nothing changes the statistics of the birth itself. Every baby is equally likely to be a boy or a girl—we’ll ignore the observed fact that actually about 51% of babies are male.

The key insight is that the next generation is just the total of all the births: if half of the births are male, then the next generation will be half-male too. Similarly even at the family level, we’d expect to see the same number of boys and girls in each family.

Moreover, we know that every family has exactly one son in it and so on average it must have one daughter too. However, there’s no symmetry which relates the distributions of sons and daughters, so the number of daughters will vary.

Simulation

Given how easy it is to mislead oneself with this sort of problem, I think it’s usually sensible to simulate it. Often it’s easy to mechanically generate a single sample of what’s happening, and the computer is good at doing this many times then averaging the results. Happily modern computers are fast enough to simulate simple situations in a fraction of a second, even when the program is written in a slow language like Perl.

I think there are three advantages to doing the simulation:

Of course, there are potential problems too. One is that it may not be clear how many samples one needs to take. A simple check is to simply run the simulation three times and check that the outcomes are roughly the same: if they’re not something’s wrong. However the converse sadly doesn’t apply: if we were simulating the lottery (say a-million-to-one shot) with three 100,000 sample runs there’s a reasonable chance we’d conclude that nobody ever won and you shouldn’t play. Perhaps that error would be a good thing!

Here, for each family we simply simulate the family by picking sexes at random—each equally likely—until we produce a son. Then, we forget about that family and do another one. All we have to do is keep track of the number of sons and daughters, and perhaps how often we see a particular shape of family.

Explicitly, the heart of the program looks something like this::

n_boys  = 0;				
n_girls = 0;				
do {					
  if (rand() < 0.5) { n_boys++;  }	
  else              { n_girls++; }	
} while (n_boys == 0);

I hope that by making it quite explicit that every birth has an equal chance of being a son or a daughter, it’s clear that the expected numbers of sons and daughters in the next generation are the same. In fact, once you’ve written this bit of code, it’s debatable whether you actually need to run it!

On the other hand, it’s but a single command to run it, so I simulated 100,000 random families (which took all of about 0.3s on my laptop). Here are the results after running the program three times. I think it’s clear that there’s no asymmetry between the sexes here!

 Run 1Run 2Run 3
Mean no. of sons1.0001.0001.000
Mean no. of daughters0.9941.0041.003
Mean no. of children1.9942.0042.003
Fraction of sons50.2%49.9%49.9%

Family imbalances

As we said above, although the total number of sons in the next generation will be about the same as the total number of daughters, this doesn’t mean that the distribution of boys will match the distribution of girls. For example, about half the families will have no daughters (because the first-child was a son); on the other hand all the families will have exactly one son.

The figure above shows how families grow. We assume that every family starts out with no children and represent this by the \([0,0]\) node at the top of the diagram.

Every time a new child is born, we move down a row in the diagram. If the child’s male then we move down and left along the S arrow; if the child’s female we move down and right along the D arrow.

Each node is labelled with a pair of numbers \([s,d]\) which shows the numbers of sons and daughters. Given our rule that families stop growing when the first son is born, none of the \([1,d]\) nodes have arrows leading from them: these nodes correspond to final states of the family. These are shown as rectangular boxes. Conversely, all the \([0,d]\) states have two arrows leading from them which correspond to the birth of a new child.

Returning to our one-son-per-family model, we can immediately see several things:

It is easy to check these conclusions by simulating 100,000 families:

Number of daughtersRun 1Run 2Run 3
049.9%49.8%50.1%
125.1%25.1%25.0%
212.5%12.4%12.5%
36.3%6.3%6.2%
43.1%3.1%3.1%
51.6%1.6%1.6%
5 or more1.5%1.7%1.5%

More generally we can use the diagram to help us calculate the chances of any particular pattern of sons and daughters. To do this, note that every time we make a choice we choose the S or D arrow with equal chance. So, to find out the chance of getting to a particular state we just count the number of arrows we need to follow to get from the start to that node. Each step has a 50% chance of being taken, so we just raise ½ to the relevant power to get the probability.

For example, we have to follow three arrows to get from [0,0] to [1,2], so the chance of getting one son and two daughters is ½ × ½ × ½ i.e. ⅛.

Although we’ve already reasoned that the expected number of daughters in each family must be one, the we can verify this in three different ways. These verifications aren’t really meant to convince you that the answer really is one, rather they illustrate how the apparently asymmetric situation cunningly manages to be consistent with the symmetric result.

By recursion

Suppose \(d\) is the expected number of daughters. After the first child is born, we know there are two possibilities:

Hence,

\[ \begin{align} d &= ½ \times 0 + ½ \times (d + 1), \\ d &= 1. \end{align} \]

By summation over final states

Consider all the nodes where a son has just been born: these are the rectangular nodes which correspond to the final state of the family.

If we sum over all these states then,

\[ \begin{align} d &= ½ \times 0 + ¼ \times 1 + ⅛ \times 2 + \ldots,\\ d &= ½ \sum_{i=0}^{\infty}i\,\left(½\right)^i,\\ d &= 1. \end{align} \]

The sum is a standard one you can find in tables or ask Mathematica about. However, ignoring issues like convergence, there’s a cute trick to sum it:

\[ \begin{align} \sum_{i=0}^{\infty} i \theta^i &= \sum_{i=0}^{\infty} \theta \frac{d}{d\theta} \theta^i, \\ &= \theta \frac{d}{d\theta} \sum_{i=0}^{\infty} \theta^i, \\ &= \theta \frac{d}{d\theta} \left(\frac{1}{1 - \theta}\right), \\ &= \frac{\theta}{\left(1 - \theta\right)^2}. \end{align} \]

This is just like the old trick of ‘differentiating-under-the-integral-sign’ which Feynman talks about in ‘Surely You’re Joking, Mr Feynman!’.

By summation over births

If we consider each row of the tree, then we can ask how much it contributes to the expected number of daughters.

The probability of getting to row \(i\) is just \( (½)^i \), and half the time when we move down a row we’ll welcome another daughter to the family. So, moving from row 0 to row 1 adds half a daughter to the expected number, from row 1 to 2 adds a quarter, and so on:

\[ \begin{align} d &= ½ + ¼ + ⅛ + \ldots,\\ d &= 1. \end{align} \]

One could calculate the expected number of sons in exactly the same way, and get the same result.

Incidentally for a quick way to do this sum, just write the number in binary: \(0.11111\dot{1}\).

Finite families

The diagram above also helps us understand what happens if we limit the total number of children. The tree no longer goes on forever, but stops after a fixed number of rows:

I hope it’s clear that there’s still no asymmetry in the expected number of sons and daughters but if you’re not sure, let’s enumerate the three possibilities.

Final StateProbabilitySonsDaughters
[1,0]½10
[1,1]¼11
[0,2]¼02
Expected value ¾¾

If we wanted to extend our simulation program to handle this case, it would be easy:

n_boys  = 0;						
n_girls = 0;						
do {							
  if (rand() < 0.5) { n_boys++;  }			
  else              { n_girls++; }			
} while (n_boys == 0 && (n_boys + n_girls <= 2));	

Again it’s easy to see that the central assumption—on average we produce as many sons as daughters—hasn’t changed.

Breaking the symmetry

Having shown that the basic model won’t produce a sex-asymmetry, it’s natural to ask what will. Obviously we could just change the proportion of sons born, but that’s rather crude and not really in the spirit of the original model. It would be nice if we could find a model which by itself produces equal numbers of sons and daughters, but which generates an asymmetry when coupled with the ‘one-son-per-family’ rule.

Happily we can do this! Simply suppose that some families are predisposed to have daughters and others sons—I should say that this idea is wholly hypothetical and any agreement with real biology is entirely accidental.

In particular assume that half the families have a probability \(½ + \Delta\) of having sons, while the other half have probability \(½ - \Delta\). In other words \(\Delta\) just sets the scale of the effect: \(\Delta = 0\) makes the effect vanish, \(\Delta = ½\) makes families either have all sons, or all daughters. However, the two biases cancel each other out, and we’ll be back to the 50:50 split in the next generation.

It’s important to see that the effects will only balance if size of the boy-biased families is the same as the size of the girl-biased families. This requirement is broken when social engineering rears its ugly head. Since families stop growing as soon as they have a son, families which are biased in favour of having sons will typically be smaller than those biased in favour of daughters. So the next generation will have more girl-biased babies in it, which implies that overall the fraction of sons will fall below 50%.

Simulation

It’s straightforward to extend our simulation to handle this model—and one of the advantages of doing the simulation is that we this change is so small that we can be fairly confident of doing it correctly.

x = (rand() < 0.5) ? 0.5 + delta : 0.5 - delta;			
n_boys  = 0;							
n_girls = 0;							
do {								
  if (rand() < x) { n_boys++;  }				
  else            { n_girls++; }				
} while (n_boys == 0);

The following table shows some typical results from these simulations. We consider three different cases which correspond to different values of \(\Delta\). Happily when \(\Delta = 0\) we recover the results from the last section, and as \(\Delta\) grows the fraction of males in the next generation falls.

Δ0¼
{ \(\textrm{p}_S\), \(\textrm{p}_D\) }{ ½, ½ }{⅜, ⅝}{¼, ¾}
Run 10.5000.4690.377
Run 20.4970.4690.376
Run 30.4980.4700.375
Average0.4990.4690.376

The analytic result

Having worked out what the ‘right’ answer is, it would be nice to find an analytic result.

For a moment, suppose the bias were fixed and calculate how many daughters we’d expect. Formally suppose the probability of getting a daughter is \(\textrm{p}_D\). Then the chance of getting exactly \(i\) daughters (and one son) is just,

\[ \textrm{p}(d = i, s = 1) = \textrm{p}_D^i \, (1-\textrm{p}_D), \]

and so the expected number of daughters \(\mathbb{E}(d\,|\,\textrm{p}_D)\) is just the expected value of \(d\) given a particular value for \(\textrm{p}_D\):

\[ \begin{align} \mathbb{E}(d\,|\,\textrm{p}_D) &= \sum_{i = 0}^\infty i \times \textrm{p}(d = i, s = 1),\\ &= \sum_{i = 0}^\infty i \, \textrm{p}_D^i(1-\textrm{p}_D),\\ &= (1 - \textrm{p}_D) \frac{\textrm{p}_D}{(1-\textrm{p}_D)^2},\\ &= \frac{\textrm{p}_D}{1-\textrm{p}_D}. \end{align} \]

Having solved for a particular bias, now we have to average over the two possible biases:

\[ \begin{align} \mathbb{E}(d) &= ½ \left( \mathbb{E}(d\,|\,\textrm{p}_D = (½ + \Delta)) + \mathbb{E}(d\,|\,\textrm{p}_D = (½ - \Delta)) \right),\\ &= ½\left( \frac{½ + \Delta}{½ - \Delta} + \frac{½ - \Delta}{½ + \Delta}\right),\\ &= \frac{1 + 4 \Delta^2}{1 - 4 \Delta^2}. \end{align} \]

Or, in terms of the fraction of boys \(\beta\),

\[ \beta = ½\left(1 - 4\Delta^2\right). \]

It’s easy to verify that these results are consistent with our simulations:

ΔSimulationAnalytic Result
00.4991/2 = 0.5000
0.46915/16 ≅ 0.4688
¼0.3763/8 = 0.375

Other effects

We claimed before that without the one-son-per-family rule, there wouldn’t be any overall effect of this model. That’s true if you look at the fraction of sons in the next generation, but there is an effect on the distribution of sons and daughters in a particular family.

Let’s calculate the probabilities of different families, given a particular \(\Delta\). In every case, we’ll need to average over two cases:

\[ \begin{align} \textrm{p}_S &= \left\{ (½ - \Delta), (½ + \Delta) \right\}, \\ \textrm{p}_D &= \left\{ (½ + \Delta), (½ - \Delta) \right\} \end{align} \]

Consider first familes with only child: there are only two possibilities, a son or a daughter. Quantitatively:

\[ \begin{align} \textrm{p}_S &= ½\left( (½ - \Delta) + (½ + \Delta)\right), \\ &= ½, \\ \textrm{p}_D &= ½\left( (½ + \Delta) + (½ - \Delta)\right), \\ &= ½. \end{align} \]

As expected perhaps, there’s nothing to see there. However, now consider two children. This time we have four cases:

\[ \begin{align} \textrm{p}_{SS} &= ½\left( (½ - \Delta)^2 + (½ + \Delta)^2\right), \\ &= ¼ + \Delta^2, \\ \textrm{p}_{SD} &= ½\left( (½ - \Delta)(½ + \Delta) + (½ + \Delta)(½ - \Delta)\right), \\ &= ¼ - \Delta^2,\\ \textrm{p}_{DS} &= ½\left( (½ + \Delta)(½ - \Delta) + (½ - \Delta)(½ + \Delta)\right), \\ &= ¼ - \Delta^2,\\ \textrm{p}_{DD} &= ½\left( (½ + \Delta)^2 + (½ - \Delta)^2\right), \\ &= ¼ + \Delta^2. \end{align} \]

Now there is an effect! Not in the difference between sons and daughters though: observe that swapping \(S\) and \(D\) in the equations above doesn’t change the probability. Rather in this model as \(\Delta\) increases single-sex families become more probable—this really isn’t a surprise.

Although we won’t do it here, if you considered bigger families then the effect would become more pronounced.

In principle these results provide a signature which you could test empirically. However, to match these results against real-world data you’d need to model the non-uniform distribution of sexes, and think about twins and other multiple-births. For example, about one-in-a-thousand deliveries are identical twins, and they’re almost always of the same sex.

One would either have to control for that, or perhaps regard twins as a special case of the general phenomenon this model captures. It’s not quite the same though, because it’s hard to have only one child when twins are born!

Conclusions

I hope it’s clear that choosing to stop having children as soon as you’ve had a son won’t affect the fraction of men in the next generation. Indeed I hope it’s clear that no choice which only affects how many children people have will change the sex ratio assuming that we can treat each birth as an independent random event.

However, logically that assumption might not be true. We considered a simple way in which nature might be different, and saw that it couples the choice of when to stop having children to the sex of the children produced.

Although there’s nothing very deep in any of this, I still think it’s a nice problem.

Finally, you can download the code and source for the figures from Github.1