Data from the Minnesota Breast Cancer Family Study. This contains
extended pedigrees from 426 families, each identified by a single
proband in 1945-52, with follow up for incident breast cancer.
Usage
data(minnbreast)
Format
minnbreast: A data frame with 29114 observations, one line per subject,
on the following 14 variables.
minnbreast2: Further data on oral contraceptive use and related
risk factors in 6150 female subjects.
id
subject identifier
proband
if 1, this subject is one of the original 426 probands
fatherid
identifier of the father, if the father is
part of the data set; zero otherwise
motherid
identifier of the mother, if the mother is
part of the data set; zero otherwise
famid
family identifier
endage
age at last follow-up or incident cancer
cancer
1= breast cancer (females) or prostate cancer
(males), 0=censored
yob
year of birth
education
amount of education: 1-8 years,
9-12 years, high school graduate, vocational education beyond high
school, some college but did not graduate, college graduate,
post-graduate education, refused to answer on the questionairre
marstat
marital status: married, living with someone
in a marriage-like relationship, separated or divorced, widowed,
never married, refused to answer the questionaiire
everpreg
ever pregnant: never pregnant at the time of
baseline survey, ever pregnant at the time of baseline survey
parity
number of births
sex
M or F
bcpc
part of one of the families in the breast/prostate
cancer substudy: 0=no, 1=yes. Note that subjects who were recruited
to the overall study after the date of the BP substudy are coded as zero.
id
subject identifier
startage
starting age for follow-up
endage
age at last follow-up or incident breast
cancer
cancer
1=incident breast cancer, 0=censored
oralc
current oral contraceptive use: none, <=4 years,
> 4 years
degree
familial relationship to the proband: first
degree, second degree, or marry-in
par.brth
a combination of parity and childbearing age:
nullparous (no children), 2 children with the first by age 20, 1-2
children after age 20, 3+ children with the first by age 20, or 3+
children after age 20.
agemen
age at first menarche
meno
age of menopause: currently pre-menopausal,
before age 45, age 45-50, after age 50
ooph
oophorectomy: 0=no, 1=yes
packyr
pack years of smoking: never smoker, 20 or
less, more than 20
Details
The original study was conducted by Dr. Elving Anderson at the Dight
Institute for Human Genetics at the University of Minnesota. From 1944
to 1952, 544 sequential breast cancer cases seen at the University
Hospital were enrolled, and information gathered on parents, siblings,
offspring, aunts/uncles, and grandparents with the goal of
understanding possible familial aspects of brest cancer. In 1991 the
study was resurrected by Dr. Tom Sellers. Of the original 544 he
excluded 58 prevalent cases, along with another 19 who had less than 2
living relatives at the time of Dr Anderson's survey. Of the
remaining 462 families 10 had no living members, 23 could not be
located and 8 refused, leaving 426 families on whom updated pedigrees
were obtained. This gave a study with 13351 males and 12699 females
(5183 marry-ins). Primary questions were the relationship of early
life exposures, breast density, and pharmacogenomics on incident
breast cancer risk.
For a subset of the families data was gathered on prostate cancer risk
for male subjects via questionairres sent to men over 40. Other than
this, data items other than parentage are limited to the female subjects.
In ___ a second phase of the study was instituted. The pedigrees were
further extended to the numbers found in this data set, and further
data gathered by questionairre.
The second data set contains time dependent covariates and so is coded
in a (start, stop) style. It is also more refined with respect to
marry-ins to the study, whose observed breast cancer risk does
not start until they join the family.
Source
Authors of the study
References
Epidemiologic and genetic follow-up study of 544 Minnesota breast
cancer families: design and methods. Sellers TA, Anderson VE, Potter
JD, Bartow SA, Chen PL, Everson L, King RA, Kuni CC, Kushi LH,
McGovern PG, et al. Genetic Epidemiology, 1995; 12(4):417-29.
Evaluation of familial clustering of breast and prostate cancer in the
Minnesota Breast Cancer Family Study.
Grabrick DM, Cerhan JR, Vierkant RA, Therneau TM, Cheville JC, Tindall
DJ, Sellers TA. Cancer Detect Prev. 2003; 27(1):30-6.
Risk of breast cancer with oral contraceptive use in women with a
family history of breast cancer. Grabrick DM, Hartmann LC, Cerhan JR,
Vierkant RA, Therneau TM, Vachon CM, Olson JE, Couch FJ, Anderson KE,
Pankratz VS, Sellers TA. JAMA. 2000; 284(14):1791-8.
Examples
data(minnbreast)
breastped <- with(minnbreast, pedigree(id, fatherid, motherid, sex,
status=(cancer& !is.na(cancer)), affected=proband,
famid=famid))
plot(breastped["8"]) #plot family 8, proband is solid, slash for cancers
#Note that breastped[8] is a different family, since ids are not 1,2,3,...