Search results

Showing 1 – 46 of 46 results.
Self-published

The Census Tree, 1850-1860 (ICPSR 193225)

Released/updated on: 2023-08-10
Time period: 1850-01-01--1860-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1850-1870 (ICPSR 193224)

Released/updated on: 2023-08-10
Time period: 1850-01-01--1870-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1850-1880 (ICPSR 193223)

Released/updated on: 2023-08-10
Time period: 1850-01-01--1880-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1850-1900 (ICPSR 193222)

Released/updated on: 2023-08-10
Time period: 1850-01-01--1900-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1850-1910 (ICPSR 193221)

Released/updated on: 2023-08-10
Time period: 1850-01-01--1910-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1850-1920 (ICPSR 193205)

Released/updated on: 2023-08-09
Time period: 1850-01-01--1920-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1850-1930 (ICPSR 193203)

Released/updated on: 2023-08-09
Time period: 1850-01-01--1930-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1850-1940 (ICPSR 193187)

Released/updated on: 2023-08-09
Time period: 1850-01-01--1940-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1860-1870 (ICPSR 193235)

Released/updated on: 2023-08-10
Time period: 1860-01-01--1870-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1860-1880 (ICPSR 193234)

Released/updated on: 2023-08-10
Time period: 1860-01-01--1880-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1860-1900 (ICPSR 193233)

Released/updated on: 2023-08-10
Time period: 1860-01-01--1900-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1860-1910 (ICPSR 193230)

Released/updated on: 2023-08-10
Time period: 1860-01-01--1910-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1860-1920 (ICPSR 193229)

Released/updated on: 2023-08-10
Time period: 1860-01-01--1920-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1860-1930 (ICPSR 193228)

Released/updated on: 2023-08-10
Time period: 1860-01-01--1930-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1860-1940 (ICPSR 193227)

Released/updated on: 2023-08-10
Time period: 1860-01-01--1940-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1870-1880 (ICPSR 193246)

Released/updated on: 2023-08-10
Time period: 1870-01-01--1880-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1870-1900 (ICPSR 193245)

Released/updated on: 2023-08-10
Time period: 1870-01-01--1900-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1870-1910 (ICPSR 193244)

Released/updated on: 2023-08-10
Time period: 1870-01-01--1910-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1870-1920 (ICPSR 193243)

Released/updated on: 2023-08-10
Time period: 1870-01-01--1920-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1870-1930 (ICPSR 193237)

Released/updated on: 2023-08-10
Time period: 1870-01-01--1930-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1870-1940 (ICPSR 193236)

Released/updated on: 2023-08-10
Time period: 1870-01-01--1940-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1880-1900 (ICPSR 193251)

Released/updated on: 2023-08-11
Time period: 1880-01-01--1900-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1880-1910 (ICPSR 193250)

Released/updated on: 2023-08-10
Time period: 1880-01-01--1910-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1880-1920 (ICPSR 193249)

Released/updated on: 2023-08-10
Time period: 1880-01-01--1920-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1880-1930 (ICPSR 193248)

Released/updated on: 2023-08-10
Time period: 1880-01-01--1930-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1880-1940 (ICPSR 193247)

Released/updated on: 2023-08-10
Time period: 1880-01-01--1940-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1900-1910 (ICPSR 193253)

Released/updated on: 2023-09-14
Time period: 1900-01-01--1910-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1900-1910, by Sex (ICPSR 193302)

Released/updated on: 2023-08-13
Time period: 1900-01-01--1910-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1900-1920 (ICPSR 193264)

Released/updated on: 2023-08-11
Time period: 1900-01-01--1920-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1900-1930 (ICPSR 193263)

Released/updated on: 2023-08-11
Time period: 1900-01-01--1930-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1900-1940 (ICPSR 193262)

Released/updated on: 2023-08-11
Time period: 1900-01-01--1940-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1910-1920 (ICPSR 193252)

Released/updated on: 2023-09-14
Time period: 1910-01-01--1920-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1910-1920, by Sex (ICPSR 193305)

Released/updated on: 2023-08-13
Time period: 1910-01-01--1920-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1910-1930 (ICPSR 193266)

Released/updated on: 2023-08-11
Time period: 1910-01-01--1930-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1910-1940 (ICPSR 193265)

Released/updated on: 2023-08-11
Time period: 1910-01-01--1940-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1920-1930 (ICPSR 193239)

Released/updated on: 2023-09-14
Time period: 1920-01-01--1930-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1920-1930, by Sex (ICPSR 193306)

Released/updated on: 2023-08-13
Time period: 1920-01-01--1930-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1920-1940 (ICPSR 193238)

Released/updated on: 2023-09-14
Time period: 1920-01-01--1940-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1920-1940, by Sex (ICPSR 193307)

Released/updated on: 2023-08-13
Time period: 1920-01-01--1940-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1930-1940 (ICPSR 193232)

Released/updated on: 2023-09-14
Time period: 1930-01-01--1940-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

The Census Tree, 1930-1940, by Sex (ICPSR 193308)

Released/updated on: 2023-08-13
Time period: 1930-01-01--1940-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. Each .csv file consists of a crosswalk between the two years indicated in the filename, using the IPUMS histids. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

Census Tree Links (ICPSR 144904)

Released/updated on: 2021-07-13
Time period: 1900-01-01--1920-01-01
The data sets in this repository allow users to link people among the U.S. decennial censuses, using the "histid" identifier. The census data sets users will need are indexed by Ancestry.com and are hosted by IPUMS at https://usa.ipums.org/usa-action/samples. Users will need to download the full-count census for each year and be sure to select the "histid" variable that is available under the Person/Historical Technical drop-down menu.

As of 7/12/21, links are available between the 1900-1910, 1910-1920, and 1900-1920 censuses.

A detailed account of how these links are created and a description of the data and its characteristics are available in the following article:

Price, J., Buckles, K., Van Leeuwen, J., & Riley, I. (2021). Combining family history and machine learning to link historical records: The Census Tree data set. Explorations in Economic History80, 101391.
https://www.sciencedirect.com/science/article/pii/S0014498321000024
Self-published

The Census Tree: Machine Learning Models (ICPSR 193324)

Released/updated on: 2023-08-14
Time period: 1850-01-01--1940-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. 

This folder includes all 36 machine learning models trained on Family Tree links for U.S. Census records from 1850 to 1940 (see Buckles, Haws, Price, and Wilbert (2023), available at https://censustree.org). We also include code to create features and obtain predicted match scores from a set of potential links.

Self-published

The Census Tree: Replication Files for 1900-1910 (ICPSR 193323)

Released/updated on: 2023-10-06
Time period: 1900-01-01--1910-01-01
The Census Tree is the largest-ever database of record links among the historical U.S. censuses, with over 700 million links for people living in the United States between 1850 and 1940. These links allow researchers to construct a longitudinal dataset that is highly representative of the population, and that includes women, Black Americans, and other under-represented populations at unprecedented rates. 

This project contains the files necessary to closely replicate the links between the 1900 and 1910 censuses. For more information, consult the included Read Me file, and visit https://censustree.org.
Self-published

Data and code from Price, Buckles, Van Leeuwen, & Riley (Explorations in Economic History) (ICPSR 130961)

Released/updated on: 2021-07-12
NOTE: Be sure you are using the most recent version of this repository.

These files include data and code used to create tables and figures for "Combining Family History and Machine Learning to Link Historical Records: The Census Tree Data Set" (citation below). We have included the truth set that we use to train our model along with the machine learning code we use to link records using this truth set. Our truth set includes histid pairs between the different census records, which can be combined with the restricted versions of the full-count census records that are distributed through the University of Minnesota.

For more details, see the "Read Me" file and the published paper at:

Price, J., Buckles, K., Van Leeuwen, J., & Riley, I. (2021). Combining family history and machine learning to link historical records: The Census Tree data set. Explorations in Economic History80, 101391.
https://www.sciencedirect.com/science/article/pii/S0014498321000024