A dataset containing almost 100,000 first names and the proportion of people with that first name that are female and male.

first_names_gender

Format

A data frame with 99,444 rows and 4 variables:

name

The person's first name

probability_male

Probability that the first is male

probability_female

Probability that the first name is female

likely_gender

The most likely gender based on the probability of each gender

...

Source

https://www.ssa.gov/oact/babynames/limits.html