<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "http://jats.nlm.nih.gov/publishing/1.1/JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xml:lang="en" article-type="review-article" dtd-version="1.1">
<front>
<journal-meta>
<journal-id journal-id-type="pmc">BIOCELL</journal-id>
<journal-id journal-id-type="nlm-ta">BIOCELL</journal-id>
<journal-id journal-id-type="publisher-id">BIOCELL</journal-id>
<journal-title-group>
<journal-title>BIOCELL</journal-title>
</journal-title-group>
<issn pub-type="epub">1667-5746</issn>
<issn pub-type="ppub">0327-9545</issn>
<publisher>
<publisher-name>Tech Science Press</publisher-name>
<publisher-loc>USA</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">27884</article-id>
<article-id pub-id-type="doi">10.32604/biocell.2023.027884</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Review</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>A broad overview of genotype imputation: Standard guidelines, approaches, and future investigations in genomic association studies</article-title><alt-title alt-title-type="left-running-head">A broad overview of genotype imputation</alt-title><alt-title alt-title-type="right-running-head">A broad overview of genotype imputation</alt-title>
</title-group>
<contrib-group>
<contrib id="author-1" contrib-type="author" corresp="yes">
<name name-style="western"><surname>TRECCANI</surname><given-names>MIRKO</given-names></name><email>mirko.treccani@univr.it</email>
</contrib>
<contrib id="author-2" contrib-type="author">
<name name-style="western"><surname>LOCATELLI</surname><given-names>ELENA</given-names></name>
</contrib>
<contrib id="author-3" contrib-type="author">
<name name-style="western"><surname>PATUZZO</surname><given-names>CRISTINA</given-names></name>
</contrib>
<contrib id="author-4" contrib-type="author" corresp="yes">
<name name-style="western"><surname>MALERBA</surname><given-names>GIOVANNI</given-names></name><email>giovanni.malerba@univr.it</email>
</contrib><aff><institution>GMLab, Department of Neurosciences, Biomedicine and Movement Sciences, University of Verona</institution>, <addr-line>Verona, 37134</addr-line>, <country>Italy</country></aff>
</contrib-group><author-notes><corresp id="cor1"><label>&#x002A;</label>Address correspondence to: Giovanni Malerba, <email>giovanni.malerba@univr.it</email>; Mirko Treccani, <email>mirko.treccani@univr.it</email></corresp></author-notes>
<pub-date date-type="collection" publication-format="electronic"><year>2023</year></pub-date>
<pub-date date-type="pub" publication-format="electronic"><day>22</day><month>5</month><year>2023</year></pub-date>
<volume>47</volume>
<issue>6</issue>
<fpage>1225</fpage>
<lpage>1241</lpage>
<history>
<date date-type="received"><day>19</day><month>11</month><year>2022</year></date>
<date date-type="accepted"><day>16</day><month>2</month><year>2023</year></date>
</history>
<permissions>
<copyright-statement>&#x00A9; 2023 Treccani et al.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Treccani et al.</copyright-holder>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This work is licensed under a <ext-link ext-link-type="uri" xlink:type="simple" xlink:href="https://creativecommons.org/licenses/by/4.0/">Creative Commons Attribution 4.0 International License</ext-link>, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
</license>
</permissions>
<self-uri content-type="pdf" xlink:href="TSP_BIOCELL_27884.pdf"></self-uri>
<abstract>
<p>The advent of genomic big data and the statistical need for reaching significant results have led genome-wide association studies to be ravenous of a huge number of genetic markers scattered along the whole genome. Since its very beginning, the so-called genotype imputation served this purpose; this statistical and inferential procedure based on a known reference panel opened the theoretical possibility to extend association analyses to a greater number of polymorphic sites which have not been previously assayed by the used technology. In this review, we present a broad overview of the genotype imputation process, showing the most known methods and presenting the main areas of interest, with a closer look to the most up-to-date approaches and a deeper understanding of its usage in the present-day genomic landscape, shedding a light on its future developments and investigation areas.</p>
</abstract>
<kwd-group kwd-group-type="author">
<kwd>Haplotypes</kwd>
<kwd>Variant Calling</kwd>
<kwd>Inference</kwd>
<kwd>Linkage Analysis</kwd>
<kwd>NGS</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Genotype imputation is a process to statistically infer missing genotypes in target samples using local linkage disequilibrium patterns from a reference panel of phased haplotypes. It is used in modern genome-wide association studies (GWAS) to extend the number of genetic variants from a set of genotyping microarrays.</p>
<p>Imputation was first used on genetically isolated human populations to identify ancestral haplotypes shared among samples (<xref ref-type="bibr" rid="ref-95">Pilia <italic>et al</italic>., 2006</xref>; <xref ref-type="bibr" rid="ref-111">Scuteri <italic>et al</italic>., 2007</xref>). Following the improvement of sequencing technologies, genotype imputation was then applied to several research studies, focused on human genetic diseases and complex traits (<xref ref-type="bibr" rid="ref-30">Everest <italic>et al</italic>., 2022</xref>; <xref ref-type="bibr" rid="ref-57">Kember <italic>et al</italic>., 2022</xref>).</p>
<p>Genotype imputation consists of the application of the homonym statistical technique for statistically measuring the genotype in terms of allele dosage or genotype likelihood probabilities. It reconstructs missing genotypes from a sample of genotyped individuals at many markers, leveraging on a large supportive dataset of fully characterized haplotypes, known as a reference panel. Imputation algorithms have been greatly improved over time and now they can handle reference panels containing millions of individuals who underwent whole genome sequencing.</p>
<p>Regarding the human genome, it is noteworthy that more than 715&#x2032;081&#x2032;156 short sequence variants and 7&#x2032;097&#x2032;115 structural variants (<ext-link ext-link-type="uri" xlink:href="https://www.ensembl.org/Homo_sapiens/Info/Annotation">https://www.ensembl.org/Homo_sapiens/Info/Annotation</ext-link>) (<xref ref-type="bibr" rid="ref-18">Cunningham <italic>et al</italic>., 2022</xref>) have been reported so far. These represent most of the so-called genetic variability of human populations, and a part of them is likely to be associated with the variability of rare and common traits. Therefore, it would be very helpful to have tools able to predict the genotype at these polymorphic sites. This task could be accomplished using the proper reference panels and efficient imputation algorithms.</p>
<p>Imputation is routinely applied to samples that underwent genotyping microarray methodologies. Genotyping arrays carry probes for hundreds of thousands (or millions) of genetic loci widespread throughout the human genome; therefore, genotyping array&#x2013;based studies cannot assay the whole set of polymorphic sites scattered across the whole genome: for this reason, the reconstruction of missing genetic loci turns out to be fundamental to enrich downstream analyses and increase the chance to detect true associations.</p>
<p>Thus, since its first applications, genotype imputation increased the number of significant results on either Mendelian or complex diseases (<xref ref-type="bibr" rid="ref-64">Li <italic>et al</italic>., 2006</xref>; <xref ref-type="bibr" rid="ref-110">Scott <italic>et al</italic>., 2007</xref>; <xref ref-type="bibr" rid="ref-87">Mijatovic <italic>et al</italic>., 2012</xref>). Genotype imputation was not only restricted to empower GWAS, but also to confirm or correct genotyped markers based on their computed probabilities (<xref ref-type="bibr" rid="ref-79">Marchini and Howie, 2010</xref>), to fine-map variants and confirm low evidence of association (<xref ref-type="bibr" rid="ref-92">Orho-Melander <italic>et al</italic>., 2008</xref>) and combine multiple studies into meta-analyses (<xref ref-type="bibr" rid="ref-130">Zeggini <italic>et al</italic>., 2008</xref>). With the improvements in genotyping technologies and the advent of the sequencing era, genotype imputation was effective also in the context of the genotyping of highly polymorphic regions, such as the major histocompatibility complex (MHC) region, the human leukocyte antigen (HLA, chromosome 6p) system (<xref ref-type="bibr" rid="ref-86">Meyer and Nunes, 2017</xref>; <xref ref-type="bibr" rid="ref-89">Naito and Okada, 2022</xref>) and the genomic regions coding for immunoglobulin chains (on chromosome 2 for kappa light chain, chromosome 22 for lambda light chain and chromosome 14 for the heavy chain) (<xref ref-type="bibr" rid="ref-83">McBride <italic>et al</italic>., 1982a</xref>, <xref ref-type="bibr" rid="ref-84">1982b</xref>), to study low-quality sources, such as ancient DNA (<xref ref-type="bibr" rid="ref-100">Razali <italic>et al</italic>., 2021</xref>), and to investigate extremely rare variants (<xref ref-type="bibr" rid="ref-109">Sazonovs and Barrett, 2018</xref>).</p>
<p>In this review, we present an overview of genotype imputation, focusing on its application in human genomics, illustrating the major players of this inferential process, showing the main methods and approaches to obtain good-quality and reliable imputed data, and deeply exploring the main areas of investigation and research. Finally, we preview the future of genotype imputation, pointing out its major limitations and the challenges that have been overcome and that imputation is facing in the genomic big data and next-generation sequencing era.</p>
</sec>
<sec id="s2">
<title>A Brief History of Imputation</title>
<p>At present, imputation is becoming a common habit in several research designs, representing a standard procedure in different pipelines for genomic analyses. However, imputation has significantly got far from its original conception, broadening its scope and applications.</p>
<sec id="s2_1">
<title>First traces of imputation</title>
<p>The possibility to predict haplotypes and, therefore, alleles of ungenotyped variant loci arose from the fundamental concept in genetics that individuals of a given population share the haplotype stretches that have descended from common ancestors. This is evident when investigating the transmission of alleles into families (<xref ref-type="bibr" rid="ref-77">Malerba <italic>et al</italic>., 2000</xref>). Siblings share a higher number of identical stretches of DNA, deriving from parents, than a randomly chosen group of individuals. Therefore, individuals having a common ancestor are expected to share haplotype stretches whose extent depends on the temporal distance (number of recombination events throughout generations) between the individuals and the common ancestor.</p>
<p>The idea that individuals share identical genetic stretches (defined as haplotypes) led researchers to explore and implement imputation in genetic studies: genetic similarities among individuals suggested they could share the same ancestral haplotype and hence this turned out to be effective in genotype reconstruction of missing genotypes in related individuals. Since imputation is based on the linkage disequilibrium structure of a given genomic region, it is important to phase the alleles at different close variant loci which is the process of addressing marker alleles to either maternal or paternal haplotypes. This task can be challenging in unrelated individuals whereas it could be easier when performing family-based studies to infer genotype distribution in pedigrees and test genetic models of inheritance for traits and diseases (<xref ref-type="bibr" rid="ref-41">George and Elston, 1987</xref>; <xref ref-type="bibr" rid="ref-31">Fulker <italic>et al</italic>., 1999</xref>).</p>
</sec>
<sec id="s2_2">
<title>The theoretical imputation process</title>
<p>The general procedure behind imputation, originally developed on pedigrees, was based on three main steps: (1) some samples belonging to the same family were genotyped on several thousands (or hundreds of thousands) of markers distributed across the genome, (2) information on the haplotype to which these markers belonged were arranged and collected and (3) untyped markers of the remaining samples (of the same family) were inferred, recognizing the shared haplotypes with the fully characterized individuals (<xref ref-type="fig" rid="fig-1">Fig. 1</xref>). The whole process was entitled &#x201C;<italic>in silico</italic> genotyping&#x201D; (<xref ref-type="bibr" rid="ref-10">Burdick <italic>et al</italic>., 2006</xref>), most likely because of its ability to predict genotypes from missing genetic information (<xref ref-type="fig" rid="fig-1">Fig. 1</xref>).</p>
<fig id="fig-1">
<label>Figure 1</label>
<caption>
<title>Genotype imputation process. On top (A), two distinct phased haplotypes (Phased Haplotype 1 and Phased Haplotype 2) from a single individual are reported with missing genetic data. The middle section (B) shows the reference panel, made up of several haplotypes (colored lines). The phased haplotypes (see A) are compared with each haplotype of the reference panel (see B); genotype imputation infers the missing regions from the matching haplotypes of the reference panel (see B). Results (C) show the inferred haplotypes. The first haplotype has been inferred as the product of recombination of the yellow and blue haplotypes of the reference panel; the second haplotype has been inferred as the product of recombination of the pink, green and red haplotypes of the reference panel. For both haplotypes, the missing information has been fulfilled with different levels of certainty.</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="Biocell-47-27884-f001.tif"/>
</fig>
</sec>
<sec id="s2_3">
<title>Moving towards unrelated individuals</title>
<p>Sooner, research interest in imputation moved to samples of unrelated individuals, which mostly build up the core of case-control studies. Compared to pedigrees, unrelated individuals did not share long haplotypes (as in the case of family members), but shorter regions that, in any case, could still be identical-by-descent, because of the presence of common ancestors, albeit far distant in time. Because a direct, informative comparison was not possible, as in the case of consanguineous individuals, the imputation process needed to be slightly revised from its original design. Specifically, (1) a catalog of detailed haplotypes for the genotyping of many individuals was set (reference panel); (2) then, phasing of close markers and comparison of phased haplotypes with the detailed haplotypes of the reference panel was performed, and (3) finally missing genotypes were predicted based on the haplotypes of the reference panel matching the phased haplotypes of the sample set.</p>
</sec>
<sec id="s2_4">
<title>The modern imputation</title>
<p>The analysis of unrelated samples started moving the focus of imputation from close little groups of related samples, at most made of tens of individuals (<xref ref-type="bibr" rid="ref-78">Marchani <italic>et al</italic>., 2012</xref>; <xref ref-type="bibr" rid="ref-13">Chen <italic>et al</italic>., 2013</xref>), to higher amounts of individuals, not necessarily related, and sharing at a certain point similar genetic information, the so-called reference panel (<xref ref-type="bibr" rid="ref-7">Browning and Browning, 2016</xref>). The modern imputation finally began. Today, in the big data and next-generation sequencing era, genotype imputation relies on enormous reference panels, made of hundreds of thousands of samples and millions of genome-wide markers (<xref ref-type="bibr" rid="ref-85">McCarthy <italic>et al</italic>., 2016</xref>). Since the reference panels derive from individuals belonging to many ethnic groups, imputation may be conducted using the proper reference panels, even though it is not still clear what is their most effective structure (for example, haplotypes from the world-wide population or from the population having the same ethnicity of the tested samples) (<xref ref-type="bibr" rid="ref-103">Roshyara <italic>et al</italic>., 2016</xref>; <xref ref-type="bibr" rid="ref-22">Degenhardt <italic>et al</italic>., 2019</xref>; <xref ref-type="bibr" rid="ref-61">Kowalski <italic>et al</italic>., 2019</xref>). To handle these data, fast and efficient algorithms, as well as powerful and performant computational resources, are constantly improved, to keep up with the pace of present-day needs.</p>
</sec>
</sec>
<sec id="s3">
<title>The Imputation Workflow</title>
<p>Genotype imputation can be summarized as a comparative process between a sample set and a known target set aimed at filling missing gaps in the sample set using the information retrieved in the target set. Here, we define as &#x201C;sample set&#x201D; the input dataset, made of individuals who have been sparsely genotyped along the genome and as &#x201C;reference panel&#x201D; the set of individuals (haplotypes) who have been assayed at the genome level and are used as the template to impute retrieved information in the sample set.</p>
<sec id="s3_1">
<title>General workflow</title>
<p>The sample set is commonly made of tens or hundreds of individuals who have been genotyped for some thousands of markers along the whole genome. The reference panel, instead, is a set of several tens or hundreds of thousands of individuals who have been genotyped at the genomic level for several hundreds of thousands or millions of loci. In the reference panel, the genetic information of all the markers is well characterized and, for this reason, it acts as the template for comparisons with the sample set. Indeed, the typed markers in the sample set are matched with the markers stored in the reference panel; the aim of this comparison is to find matches between groups of markers, the haplotypes. Hence, the matching markers are used as seeds to reconstruct in every sample of the sample set the genetic information (or genotypes) of the surrounding markers.</p>
</sec>
<sec id="s3_2">
<title>A new perspective: Probabilistic genotypes</title>
<p>Following the proposed workflow, imputation can infer intervals of imputed variants surrounded by stretches of genotyped markers; imputed variants are generated from the comparison between the sample set and the reference panel. However, this inference does not generate results having absolute certainty: imputed markers show a variable level of uncertainty, depending on several factors (see section <italic>Factors affecting imputation</italic>), and for this reason, cannot be represented as standard genotypes. Indeed, each imputed marker is reported not as a discrete genotype but as a genotype probability (GP), that is to say giving two alleles 0 and 1, the probability of an individual in the sample set of being homozygous for the reference allele (0/0), heterozygous (0/1), and homozygous for the alternative allele (1/1). To summarize these probabilistic values, imputed data are mostly represented in terms of allelic dosage (DS), a value between 0 and 2; allelic dosage, or dosage, reports for every marker the number of alternative (or risk) alleles carried by a single individual in the sample set. Usually, a dosage equal to 0 corresponds to a probability of 1 of being homozygous for the reference allele, a dosage equal to 1 to a probability of 1 of being heterozygous and a dosage of 2 of being homozygous for the alternative alleles. Because the allelic dosage is a continuous value, all the possible values between 0 and 2 represent a probability of being closer to one of the three playing genotypes.</p>
</sec>
</sec>
<sec id="s4">
<title>Tools for Imputation</title>
<p>The advent of genotype imputation boosted an immediate development of a great variety of tools and methodologies, to facilitate the management if the increasing computational load. Imputation software was initially clustered into two main categories (<xref ref-type="bibr" rid="ref-66">Li <italic>et al</italic>., 2009</xref>): computationally intensive, like IMPUTE (<xref ref-type="bibr" rid="ref-80">Marchini <italic>et al</italic>., 2007</xref>; <xref ref-type="bibr" rid="ref-47">Howie <italic>et al</italic>., 2009</xref>) and MACH (<xref ref-type="bibr" rid="ref-110">Scott <italic>et al</italic>., 2007</xref>), which took into account every genotyped marker when imputing each missing genotypes and computationally efficient, like PLINK (<xref ref-type="bibr" rid="ref-96">Purcell <italic>et al</italic>., 2007</xref>; <xref ref-type="bibr" rid="ref-12">Chang <italic>et al</italic>., 2015</xref>) and BEAGLE (<xref ref-type="bibr" rid="ref-6">Browning and Browning, 2007</xref>), which focused on neighboring markers when imputing each missing genotypes.</p>
<p>So far, among the many methods which have been developed to achieve good imputation results, the implementation based on Hidden Markov Models (HMMs, <xref ref-type="bibr" rid="ref-117">Stephens and Donnelly, 2000</xref>) outperforms all the other approaches. Most, if not all, of current imputation methods, rely on HMMs: despite the different algorithmic implementations, most imputation software shows similar performances (<xref ref-type="bibr" rid="ref-79">Marchini and Howie, 2010</xref>). The main features which mostly differentiate imputation methods regard the way to handle the reference panels and data storage compression, the input data and the computed output values, and method-specific implementations, like the ability to perform phasing and imputation in one or two steps and the possibility to tweak parameters and set exclusion criteria (<xref ref-type="bibr" rid="ref-114">Spencer <italic>et al</italic>., 2009</xref>).</p>
<sec id="s4_1">
<title>Present-day imputation approaches</title>
<p>The first software implementations were extremely intensive and required a huge amount of computational power, memory, and data storage. During the past years, the methods underwent considerable improvements, becoming faster, more efficient, and more effective in every step of the genotype imputation procedure. Today, software implementation allows to impute not only bi-allelic SNPs but also multi-allelic variants (either SNPs or small INDELs); recent studies pointed out that all previously developed tools performed well on bi-allelic variants, but only new generation tools can handle multi-allelic variants, producing reliable results (<xref ref-type="bibr" rid="ref-43">Hanks <italic>et al</italic>., 2022</xref>). Indeed, current genotype imputation software is mostly categorized into two significantly different methods: population-based imputation (PBI) and family-based imputation (FBI) (<xref ref-type="bibr" rid="ref-70">Liu <italic>et al</italic>., 2019</xref>). The PBI methods are the most used and well-established approaches for genotype imputation; they usually rely on a reference panel of unrelated samples and make use of linkage disequilibrium and correlation between close SNPs of a specific population to predict ungenotyped markers. On the other hand, FBI methods are used to infer genotypes of related individuals; they rely on familial information, such as pedigrees and identity by descent, to reconstruct allelic phases and infer unobserved genotypes (<xref ref-type="bibr" rid="ref-106">Saad and Wijsman, 2014a</xref>). Different studies investigated the strengths and weaknesses of these methods (<xref ref-type="bibr" rid="ref-107">Saad and Wijsman, 2014b</xref>; <xref ref-type="bibr" rid="ref-70">Liu <italic>et al</italic>., 2019</xref>; <xref ref-type="bibr" rid="ref-122">Ullah <italic>et al</italic>., 2019</xref>). PBI showed to be particularly suitable for the investigation of common variants (frequency greater than 5%), having a higher imputation accuracy when the investigated sample set is closely related to the population of the reference panel; moreover, the size of the reference panel showed to increase imputation accuracy for rarer variants (frequency lower than 5%), probably due to their higher frequencies in a larger dataset. On the contrary, FBI methods are widely used for the imputation of rare variants, due to their increased frequencies in small and inbred populations, such as in families, but seemed to perform poorly on common variants; however, the use of study-specific reference panel seems to increase imputation accuracy for common variants, probably due to higher precision in the haplotypes of local reference samples (<xref ref-type="bibr" rid="ref-70">Liu <italic>et al</italic>., 2019</xref>; <xref ref-type="bibr" rid="ref-127">Whalen <italic>et al</italic>., 2019</xref>). For all these reasons, hybrid approaches have been explored (<xref ref-type="bibr" rid="ref-71">Liu <italic>et al</italic>., 2013</xref>; <xref ref-type="bibr" rid="ref-62">Kreiner-M&#x00F8;ller <italic>et al</italic>., 2015</xref>; <xref ref-type="bibr" rid="ref-63">Lent <italic>et al</italic>., 2016</xref>), to make good use of the advantages of the two strategies, showing a considerable increase in genotype imputation performance (<xref ref-type="bibr" rid="ref-122">Ullah <italic>et al</italic>., 2019</xref>). Of note, nowadays most used and standard software are Beagle 5.4 (<xref ref-type="bibr" rid="ref-9">Browning <italic>et al</italic>., 2018</xref>, <xref ref-type="bibr" rid="ref-8">2021</xref>), Eagle 2.4.1 (<xref ref-type="bibr" rid="ref-72">Loh <italic>et al</italic>., 2016</xref>, <xref ref-type="bibr" rid="ref-73">2016</xref>) and SHAPEIT4 (<xref ref-type="bibr" rid="ref-23">Delaneau <italic>et al</italic>., 2019</xref>) for phasing and Beagle 5.4 (<xref ref-type="bibr" rid="ref-9">Browning <italic>et al</italic>., 2018</xref>, <xref ref-type="bibr" rid="ref-8">2021</xref>), IMPUTE5 (<xref ref-type="bibr" rid="ref-105">Rubinacci <italic>et al</italic>., 2020</xref>) and Minimac4 (<xref ref-type="bibr" rid="ref-20">Das <italic>et al</italic>., 2016</xref>) for population-based imputation, and GIGI (<xref ref-type="bibr" rid="ref-14">Cheung <italic>et al</italic>., 2013</xref>), Merlin (<xref ref-type="bibr" rid="ref-10">Burdick <italic>et al</italic>., 2006</xref>) and cnF2freq (<xref ref-type="bibr" rid="ref-90">Nettelblad, 2012</xref>) for family-based imputation. In the end, the most promising hybrid approaches, taking advantage of both PBI and FBI methods, are Fimpute (<xref ref-type="bibr" rid="ref-108">Sargolzaei <italic>et al</italic>., 2014</xref>), and the combination of IMPUTE2 with Merlin (<xref ref-type="bibr" rid="ref-70">Liu <italic>et al</italic>., 2019</xref>) and GIGI with Beagle (<xref ref-type="bibr" rid="ref-106">Saad and Wijsman, 2014a</xref>).</p>
</sec>
<sec id="s4_2">
<title>Online platform for imputation</title>
<p>The constant increase of computational load, mostly related to the number of input markers and samples and the size of the reference panels, pointed out the need for powerful infrastructure to perform heavy computational tasks. Two free next-generation genotype imputation servers have been developed, respectively the Michigan Imputation Server and the TOPMed Imputation Server (<xref ref-type="bibr" rid="ref-20">Das <italic>et al</italic>., 2016</xref>). The platforms consist of a user-friendly interface in which the user, upon registration, can input their sample data (as a gzipped variant call format, one for each chromosome), setting the desired parameters like choosing standard reference panels (such as the 1000 Genomes Project phase 3, the Haplotype Reference Consortium panel and the TOPMed r2 panel; see section <italic>Reference panels</italic>) and an optional subpopulation, quality filtering to apply on the imputed data (see section <italic>Quality filtering for good imputation results</italic>), and finally choosing to perform phasing, imputation or both the two procedures sequentially. Both Michigan and the TOPMed Imputation Servers use Eagle 2.4 and Minimac4 to perform phasing and imputation, respectively. Although the combination of Eagle 2.4 and Minimac4 was assayed as one of the slowest approaches when compared to other software (such as Beagle 5), it turned out to be the most efficient in terms of computational resources for every size of the input sample set (<xref ref-type="bibr" rid="ref-9">Browning <italic>et al</italic>., 2018</xref>), representing the most suitable solution for online queued platforms.</p>
</sec>
<sec id="s4_3">
<title>The computational load of imputation</title>
<p>In general, genotype imputation requires high amounts of time and resources. State-of-the-art tools and algorithms have boosted performances, not only in managing big data but also in lowering the running time and computational load. Moreover, computation time and sample size are indirectly related: per-sample running time to perform imputation increases together with the decrease in sample size. This counterintuitive procedure is related to the fixed computational cost of reading the reference panel, which is shared among all the samples (<xref ref-type="bibr" rid="ref-9">Browning <italic>et al</italic>., 2018</xref>). Thus, the imputation procedure focuses on groups rather than on single individuals; this overview is confirmed not only from a constitutive perspective (that is, the need to have a group of individuals in both the sample and the reference panels, to clarify as much as possible the frequencies of the haplotypes in the working dataset), but also from a computational perspective.</p>
<p>In the last couple of years, with the advent of cloud computing services (such as Amazon Web Services, Microsoft Azure, or Google Cloud), imputation pipelines started to be optimized on pre-defined and suited virtual machines, in order to dramatically reduce the cost of imputation: a recent study pointed out the possibility to perform imputation at a cost lower than 1 U.S. cent per sample (<xref ref-type="bibr" rid="ref-9">Browning <italic>et al</italic>., 2018</xref>). The fact that imputation has been implemented in the context of a global computation indicates how much it is a fundamental tool in the most current frontier research.</p>
</sec>
</sec>
<sec id="s5">
<title>Reference Panels</title>
<p>Beyond the imputation software used, another key player is the reference panel. The reference panel was originally defined as a collection of individuals typed at a dense set of SNPs (<xref ref-type="bibr" rid="ref-66">Li <italic>et al</italic>., 2009</xref>). With the advancements in genotyping and sequencing technologies, it has been possible to type several hundreds of thousands of individuals for hundreds of thousands or millions of genetic loci at the genome-wide level that can be included in the reference panels (<xref ref-type="fig" rid="fig-2">Fig. 2</xref>).</p>
<fig id="fig-2">
<label>Figure 2</label>
<caption>
<title>The most relevant reference panels for the imputation of human populations. The picture shows the relevant features (number of markers, samples, and ethnicity), and the year of release of the most important reference panels for genome-wide imputation of human samples. The blue box reports the worldwide human population samples included into the different panels.</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="Biocell-47-27884-f002.tif"/>
</fig>
<sec id="s5_1">
<title>A brief history of reference panels</title>
<p>Historically, the first reference panels for genotype imputation came from international consortia: the International HapMap project and the 1000 Genomes Project. Both projects aimed to characterize the genetic variability of human populations.</p>
<p>The main differences among them were the number of samples and genetic markers available as well as the number of represented human populations and ethnicities. In chronological order, the first two panels were the HapMap phase I (<xref ref-type="bibr" rid="ref-49">International HapMap Consortium, 2005</xref>) and phase II (<xref ref-type="bibr" rid="ref-50">International HapMap Consortium <italic>et al</italic>., 2007</xref>) which comprised 269 samples across four populations (Yoruba in Ibadan, Japanese in Tokyo, Han Chinese in Beijing and U.S. Utah residence with northern and western Europe ancestry) on 1 and 3 million of SNPs, respectively. With the spreading of first- and second-generation sequencing technologies, the 1000 Genomes Project (1000 <xref ref-type="bibr" rid="ref-38">Genomes Project Consortium <italic>et al</italic>., 2010</xref>) came to light with its phase I (1000 <xref ref-type="bibr" rid="ref-39">Genomes Project Consortium <italic>et al</italic>., 2012</xref>) and phase III (1000 <xref ref-type="bibr" rid="ref-40">Genomes Project Consortium <italic>et al</italic>., 2015</xref>) panels, comprising 2504 individuals from 26 populations for over 88 million variants. In 2016, the 1000 Genomes Project established itself as the standard resource for genomic analysis, superseding the HapMap panels. However, in the last five years, alternative and more powerful resources have been developed: a clear example is the Trans-Omics for Precision Medicine (TOPMed) reference panel (<xref ref-type="bibr" rid="ref-20">Das <italic>et al</italic>., 2016</xref>), made of around 180 thousand sequenced individuals (60% of non-European ancestry) from more than 85 different studies for a total of 308 million variants (version r2).</p>
</sec>
<sec id="s5_2">
<title>Populations in the reference panels</title>
<p>One of the main features and concerns regarding the reference panels is the structure of populations and/or subpopulations included in the reference panel. The presence of a reference panel equipped with a reliable reference population, having a high number of markers and individuals (and hence, a good picture of the most common haplotypes across the population), is a fundamental step for good quality imputation results (see section <italic>Factors affecting imputation</italic>). During the comparison of samples of both unphased sample set and reference panel, genotype imputation can lead to stronger results when the two sets are highly similar (i.e., having a similar haplotype structure); on the contrary, when the genetic background of the sample and reference sets is quite distant, imputation is committed to a dramatical increasing of false and/or weak signals. As a rule, in order to increase the accuracy of imputation calls, a consensus between the ancestries of the sample and reference sets is fundamental (<xref ref-type="bibr" rid="ref-21">de Marino <italic>et al</italic>., 2022</xref>), but a suitable reference panel is not always available. Indeed, recent studies, which focused on sample sets of mixed ancestries or low-represented subpopulations, pointed out the limits of present-day reference panels. For this reason, several researchers (<xref ref-type="bibr" rid="ref-61">Kowalski <italic>et al</italic>., 2019</xref>; <xref ref-type="bibr" rid="ref-93">O&#x2019;Connell <italic>et al</italic>., 2021</xref>) tried to manage this issue by setting reference panels including individuals from different populations to ideally mimic the ancestries of the mixed populations. As a result, careful study designs together with a combination of reference populations from different ancestries seemed to be one of the most feasible approaches to overcome the bias of low-represented populations.</p>
</sec>
</sec>
<sec id="s6">
<title>Factors Affecting Imputation</title>
<p>The probabilistic nature of genotype imputation does not permit to have absolute certainty of imputed calls. Studies on genotype imputation all agreed on several factors affecting imputation quality and results (<xref ref-type="bibr" rid="ref-54">Johnson <italic>et al</italic>., 2013</xref>; <xref ref-type="bibr" rid="ref-59">Khankhanian <italic>et al</italic>., 2015</xref>; <xref ref-type="bibr" rid="ref-112">Shi <italic>et al</italic>., 2018</xref>; <xref ref-type="bibr" rid="ref-34">Geibel <italic>et al</italic>., 2021a</xref>; <xref ref-type="bibr" rid="ref-131">Zhang <italic>et al</italic>., 2021</xref>). The most known causes are due to the technology used for sample genotyping, the number of markers and the minor allele frequency (MAF) values in either the sample set or the reference panel and the selected reference population.</p>
<sec id="s6_1">
<title>Genotyping platforms</title>
<p>Genotyping technology (arrays or sequencing) plays a crucial role in imputation performances. Since each genotyping platform spans differently along the genome, it also determines the different sets of markers that would be inferred by the imputation process. The technology impacts the number of variant sites that can be tested (<xref ref-type="bibr" rid="ref-43">Hanks <italic>et al</italic>., 2022</xref>). Moreover, not only the sample set but also the reference panels are affected by the genotyping techniques. Depending on how the reference panel has been generated (setup of variant sites included), imputation might achieve different results (i.e., only the variant sites present in the reference panel can undergo imputation). Panels of hundreds of thousands of markers widespread throughout the genome and representing different genomic locations (for example, regions having different genomic complexity) positively affect imputation, guaranteeing good-quality results. On the contrary, panels having low SNP density or a small population size would negatively impact imputation results (<xref ref-type="bibr" rid="ref-19">Das <italic>et al</italic>., 2018</xref>).</p>
</sec>
<sec id="s6_2">
<title>Minor allele frequencies</title>
<p>Imputation is also influenced by minor allele frequencies (the allele at a variant site having the lower frequency). Common variants (&#x003E;5%), which are usually well represented across reference panels and present enough in sample sets, are smoother to impute than rarer variants. Moreover, imputation of low frequency (1%&#x2013;5%) and rare (&#x003C;1%) variants faces several limitations (<xref ref-type="bibr" rid="ref-129">Yu <italic>et al</italic>., 2022</xref>). Imputation can just infer known genetic information, which is stored in the reference panel; low-frequent to ultrarare (&#x003C;0.1%) variants are usually not much represented in reference panels and, for this reason, would be inferred with low probabilities, resulting in unreliable calls (<xref ref-type="bibr" rid="ref-131">Zhang <italic>et al</italic>., 2021</xref>). Rare variants are in strict dependency on the chosen reference population: rare allelic frequencies are most of the time specific to a precise population, and the usage of a population not too close to the sample set may trigger a huge number of false positives (<xref ref-type="bibr" rid="ref-43">Hanks <italic>et al</italic>., 2022</xref>). However, several approaches have been developed to increase imputation accuracy for rarer variants, such as using imputation methods that combine population-based and family-based approaches (see section <italic>Present-day imputation approaches</italic>) and the usage of specific reference panels (see section <italic>Reference populations</italic>).</p>
</sec>
<sec id="s6_3">
<title>Reference populations</title>
<p>Despite the need for a reference panel, reference populations can backfire on imputed results. The choice of an accurate and sample-specific reference panel is crucial to ensure good quality results. However, a perfect match between sampled and reference individuals is only theoretically possible due to the large number of admixed samples in modern-day human populations (<xref ref-type="bibr" rid="ref-43">Hanks <italic>et al</italic>., 2022</xref>). Thus, the lower amount of matches between samples and reference set may increase the levels of uncertainty in imputed genotype calls, prominently for admixed populations. However, the application of state-of-the-art methodologies, together with a fine phasing of genetic data and a deeper analysis of the genetic background of samples, may mitigate population biases, generating reliable imputed results (<xref ref-type="bibr" rid="ref-21">de Marino <italic>et al</italic>., 2022</xref>). Several studies (<xref ref-type="bibr" rid="ref-69">Lin <italic>et al</italic>., 2018</xref>; <xref ref-type="bibr" rid="ref-123">Vergara <italic>et al</italic>., 2018</xref>; <xref ref-type="bibr" rid="ref-4">Bai <italic>et al</italic>., 2019</xref>) have investigated how imputation accuracy is affected by the presence of different human populations in the reference panel when analyzing underrepresented ethnic groups. The increasing distance between the sample set and the reference population negatively affects the imputation accuracy of common (&#x003E;5%) and low-frequent variants (&#x003E;1%); however, a small fraction of diversity in the reference panel seems to improve imputation performance in rare variants (&#x003C;1%): examples have been reported for Han Chinese (<xref ref-type="bibr" rid="ref-4">Bai <italic>et al</italic>., 2019</xref>), Hawaiians (<xref ref-type="bibr" rid="ref-67">Lin <italic>et al</italic>., 2020</xref>), Turkish (<xref ref-type="bibr" rid="ref-56">Kars <italic>et al</italic>., 2021</xref>) and Latin Americans (<xref ref-type="bibr" rid="ref-53">Jim&#x00E9;nez-Kaufmann <italic>et al</italic>., 2022</xref>). Nevertheless, genotype imputation of non-European samples still represents a challenging task, due to the predominance of European samples in most reference panels. To answer this need, several research projects started developing reference panels tailored to underrepresented populations: notable examples are the CAAPA reference panel for African Americans (<xref ref-type="bibr" rid="ref-93">O&#x2019;Connell <italic>et al</italic>., 2021</xref>) (developed by the homonymous Consortium on Asthma among African-ancestry Populations in the Americas) (<xref ref-type="bibr" rid="ref-82">Mathias <italic>et al</italic>., 2016</xref>), the GAsP reference panel for Asian (<xref ref-type="bibr" rid="ref-37">GenomeAsia100K Consortium, 2019</xref>) (developed by the homonymous Genome Asia Pilot project together with the GenomeAsia 100K Project) and the ChinaMAP for Chinese (<xref ref-type="bibr" rid="ref-65">Li <italic>et al</italic>., 2021</xref>).</p>
</sec>
<sec id="s6_4">
<title>Finer factors affecting imputation</title>
<p>Other than the main factors affecting imputation quality, researchers identified finer and more specific parameters that need to be considered as a source of unreliable results: recombination rate, GC content, the distance between genotyped markers, the presence of structural variants and segmental duplications. Among these factors, a higher recombination rate, a lower GC content, the presence of structural variants, and segmental duplications have been found to be associated with lower-quality imputation results (<xref ref-type="bibr" rid="ref-43">Hanks <italic>et al</italic>., 2022</xref>), showing consistency and persistency across every method.</p>
</sec>
</sec>
<sec id="s7">
<title>Quality Filtering for Good Imputation Results</title>
<p>In order to overcome spurious results and obtain good quality imputed data, several filtering thresholds for pre-imputation and post-imputation quality controls are commonly applied.</p>
<sec id="s7_1">
<title>Pre-imputation filtering</title>
<p>Before imputation, genotyped samples undergo several exclusion criteria (<xref ref-type="bibr" rid="ref-66">Li <italic>et al</italic>., 2009</xref>; <xref ref-type="bibr" rid="ref-21">de Marino <italic>et al</italic>., 2022</xref>; <xref ref-type="bibr" rid="ref-43">Hanks <italic>et al</italic>., 2022</xref>) to remove individuals having low call rates (&#x003C;95%) and high missingness (&#x003E;2%), significantly deviating from the Hardy-Weinberg Equilibrium (<italic>p</italic>-value &#x003C; 10e-6) and duplicates. Moreover, allele labeling (reporting allele names referring to the same DNA strand for all the variant sites and all the sample sets, including the reference panel) between sampled markers and reference panel is checked, to avoid any comparison mistake, the harbinger of unreal mismatches or wrong strand reading (as for the markers where the two alleles are either A and T or C and G). In the end, samples are analyzed for their genetic background. Individuals are assayed through principal component analysis (PCA) for their genetic origins, to investigate the real ancestral background and to choose the most suitable reference panel for imputation.</p>
</sec>
<sec id="s7_2">
<title>Post-imputation filtering</title>
<p>To assay imputation quality and reliability, it is necessary to perform post-imputation quality control. Three main methods have been developed to assess imputation outcomes: concordance between genotypes, imputation quality score (IQS) and correlation between best-guessed genotypes (defined as R-squared or R2).</p>
<p>Assessing concordance between real and imputed genotypes represents a method to determine imputation quality. For every marker, genotype imputation infers a probabilistic value (in terms of genotype probabilities or alternative allele dosage), representing the probability of a genotype of being homozygous for the reference (0/0) or the alternative (1/1) allele or heterozygous (0/1). The approximation of genotype probabilities to a discrete genotypic value is far to be reliable and may lead to a higher level of wrong calls. Indeed, such an approximative method was previously tested (<xref ref-type="bibr" rid="ref-1">Abo <italic>et al</italic>., 2012</xref>) and benchmarked (<xref ref-type="bibr" rid="ref-79">Marchini and Howie, 2010</xref>) in early imputation studies, but was soon abandoned in favor of more functional metrics (<xref ref-type="bibr" rid="ref-21">de Marino <italic>et al</italic>., 2022</xref>).</p>
<p>Rather than approximating the probabilistic information to discrete genotypes, a more reliable way to evaluate imputation is to look at genotype probabilities directly. The first metric which was developed (<xref ref-type="bibr" rid="ref-68">Lin <italic>et al</italic>., 2010</xref>) was the imputation quality score (IQS). IQS discriminates between well-imputed and poorly imputed SNPs based on the genotype posterior probabilities. This metric compares the proportion of agreement between the haplotype-based and randomly imputed genotypes, weighted by the allelic frequencies of the imputed marker. Early usage of this metric showed remarkable improvement compared to the evaluation of the concordance approach (reported above), but since it is strongly dependent on allele frequency values, it is not suitable for rare variant sites.</p>
<p>Thus, the latest developed and most used metric is the R-squared (R2), defined as the correlation between the variance of the marker estimated from the counts of individual imputed alleles for every sample and the expected variance estimated from the overall allele frequencies. Hence, R2 is generally expressed as the ratio between the variance of the imputed alleles probabilities divided by the variance of the same alleles if they were perfectly imputed. In detail, R2 has been conceived in at least two different ways. Methods like MACH and Minimac express R2 values as the best approximation between the observed dosage variants on the expected dosage variants; instead, methods like Beagle and IMPUTE calculate the R2 values as the best approximation between the most likely genotype and the true unobserved expected allelic dosage (<xref ref-type="bibr" rid="ref-11">Chanda <italic>et al</italic>., 2012</xref>; <xref ref-type="bibr" rid="ref-99">Ramnarine <italic>et al</italic>., 2015</xref>).</p>
</sec>
</sec>
<sec id="s8">
<title>The Current Applications</title>
<p>Since its very first usage on pedigrees inference and resolution (<xref ref-type="bibr" rid="ref-41">George and Elston, 1987</xref>), genotype imputation has become a standard procedure for downstream analysis pipelines in routinary genomic studies. In the last decades, genotype imputation usage had exponential growth, thanks to both numerous advancements in genotyping and sequencing technologies together with the improvement in algorithms and methods, the empowerment of computational infrastructures, and the public availability of genetic and genomic data. Furthermore, imputation helped discover a new power in genotyped data, enriching their genetic information from their missingness. For all these reasons, genotype imputation is continuously applied to different genomic branches, such as data enrichment for genome-wide association studies, research harmonization and meta-analysis, analysis of regions of great complexity (such as MHC and HLA) or markers with rare or ultra-rare allelic frequencies. Indeed, recent studies pointed out the potential of genotype imputation in low coverage whole-genome sequencing (lcWGS) data, to confirm or discover rare and ultra-rare variants but also to provide insights on our genetic history, with the most recent application on ancient DNA (aDNA).</p>
</sec>
<sec id="s9">
<title>A Successful Combination: Genome-Wide Association Studies Empowered by Imputed Data</title>
<p>Genome-wide association studies are one of the most powerful techniques adopted to increase the chances of discovering novel variants and associated genes for a wide variety of traits and diseases. A large number of individuals are genotyped at a great number of polymorphic sites (ideally all the polymorphic sites of the human genome; realistically for hundreds of thousands or a few million markers), and then every locus is tested individually for the association within the trait under investigation. When, for a given polymorphic site, an allele is observed to be more common in affected individuals than in healthy individuals, the allele is reported as associated with the investigated trait or disease. This kind of analysis can be conducted at the allele or genotype level by testing for the association on all the sampled polymorphic sites (<xref ref-type="fig" rid="fig-3">Fig. 3</xref>).</p>
<fig id="fig-3">
<label>Figure 3</label>
<caption>
<title>Genotype imputation in a genome-wide association study (GWAS) workflow.</title></caption>
<graphic mimetype="image" mime-subtype="tif" xlink:href="Biocell-47-27884-f003.tif"/>
</fig>
<p>Data sampling: DNA samples and phenotypes are collected according to different designs.</p>
<p>Genotyping: samples are genotyped using either genotyping arrays or next-generation sequencing technologies (WGS: whole-genome sequencing; WES: whole-exome sequencing).</p>
<p>Data quality control: quality controls aim to filter out genotypes that do not meet the inclusion criteria, to not hamper imputation quality.</p>
<p>Haplotype phasing and imputation: unphased genotypes (GT) are compared to the known haplotypes of the reference panel, to infer the underlying haplotypes and therefore missing information; imputed data (Imputation output) show the phased genotypes (GT) together with the probabilistic representations in term of allelic dosage (DS) and genotype probabilities (GP), estimated by the comparison of the unphased genotypes and the reference haplotypes.</p>
<p>Data quality control: low-quality imputed data are filtered out (Final imputation output).</p>
<p>Association tests: final output undergoes association analysis for the investigated phenotype (GWAS) (<xref ref-type="bibr" rid="ref-120">Turner, 2018</xref>).</p>
<sec id="s9_1">
<title>Imputation and genomic scans</title>
<p>For several years, GWASs based solely on genotyped samples which were at most assayed for tens of thousands of markers, a small portion if compared to our genome. The increasing interest in genotype imputation and the early work on related and unrelated individuals showed researchers that combining the discovery potential of GWAS together with the ability of imputation to infer genetic data from missing information would have led to an increase in the statistical power of genomic scans (<xref ref-type="bibr" rid="ref-98">Quick <italic>et al</italic>., 2020</xref>). Early evidence of this successful combination was pointed out in diseases, such as type 2 diabetes in the Finnish population (<xref ref-type="bibr" rid="ref-110">Scott <italic>et al</italic>., 2007</xref>) and familial cases affected by multiple sclerosis (<xref ref-type="bibr" rid="ref-27">Dyment <italic>et al</italic>., 2008</xref>), and in complex traits (<xref ref-type="bibr" rid="ref-91">Newton-Cheh <italic>et al</italic>., 2009</xref>), identifying novel loci explaining genetic susceptibility, risk, or sample variance.</p>
<p>The possibility of combining genetic information coming from genotyping techniques together with probabilistic inferred data, immediately revealed the strengths and weaknesses of this approach. On the one hand, enriching typed data for a great number of markers that have not been previously genotyped allowed to reach reliable association signals and to confirm previously reported evidence, as for psoriasis (<xref ref-type="bibr" rid="ref-88">Nair <italic>et al</italic>., 2009</xref>) but also to identify novel loci of understudied diseases, as for eosinophilic granulomatosis with polyangiitis (<xref ref-type="bibr" rid="ref-76">Lyons <italic>et al</italic>., 2019</xref>). However, on the other hand, the methods available to study genetic signals did not support the probabilistic perspective generated in the imputation process. Approximating genotype probabilities or allelic dosages to genotype was firmly discouraged (<xref ref-type="bibr" rid="ref-66">Li <italic>et al</italic>., 2009</xref>), and for this reason, novel methodologies had to be developed.</p>
</sec>
<sec id="s9_2">
<title>Tools to study associations on imputed data</title>
<p>To overcome this initial bias in managing different representations of genetic data, several models that were able to account for probabilities and to handle uncertainty were implemented and integrated into association pipelines (<xref ref-type="bibr" rid="ref-79">Marchini and Howie, 2010</xref>). One of the first developed and probably most used models is the frequentist model. This model compares for every SNP a pre-determined model of association (such as additive, dominant, recessive, and others) against a model of no association, considering the genetic information as a likelihood probability. The frequentist model was immediately implemented since the earliest snptest release (<xref ref-type="bibr" rid="ref-80">Marchini <italic>et al</italic>., 2007</xref>; <xref ref-type="bibr" rid="ref-126">Wellcome Trust Case Control Consortium, 2007</xref>). However, the model faced immediate problems regarding spurious results, mostly due to small sample sizes, low allelic frequencies, and high levels of uncertainty in imputation calls (<xref ref-type="bibr" rid="ref-74">Lu <italic>et al</italic>., 2010</xref>). Novel approaches were developed to overcome these issues, mostly based on the Bayesian statistical method (<xref ref-type="bibr" rid="ref-116">Stephens and Balding, 2009</xref>). As for the frequentist model, Bayesian methods as well compare a model of association against a model of no association but implement a more complex and parametrized modeling. Despite the similarities and differences between these two approaches, genetic data which underwent imputation had to be considered as probabilistic, either in terms of genotype probabilities or allelic dosage, to ensure robust and unbiased association signals as demonstrated in several studies spanning from computational benchmarking (<xref ref-type="bibr" rid="ref-113">Song <italic>et al</italic>., 2018</xref>; <xref ref-type="bibr" rid="ref-55">J&#x00F8;rsboe and Albrechtsen, 2022</xref>) to genomic analyses investigating either disorder (<xref ref-type="bibr" rid="ref-125">Vissers <italic>et al</italic>., 2019</xref>) or traits (<xref ref-type="bibr" rid="ref-118">Tan <italic>et al</italic>., 2019</xref>).</p>
</sec>
</sec>
<sec id="s10">
<title>Study Harmonization and Meta-Analysis</title>
<p>The ability of genotype imputation to infer genetic data from missing values in the sample set turned out to be fundamental for the study combination. Cohorts from different research projects are rarely genotyped or sequenced with the same technologies and instruments. Thus, the absence of markers between two or more studies is a standard event, which happens because of the great variety of approaches and methodologies currently available. Moreover, a meta-analysis of genomic studies may reveal important outcomes, particularly related to complex traits and diseases. For example, a study on sepsis (<xref ref-type="bibr" rid="ref-44">Hernandez-Beeftink <italic>et al</italic>., 2022</xref>) conducted on more than six hundred samples from the GEN-SEP network (<xref ref-type="bibr" rid="ref-42">Guillen-Guio <italic>et al</italic>., 2020</xref>) was able to enrich its sampled dataset to over 7 million SNPs and combining it to thousands of individuals from the MESSI (<xref ref-type="bibr" rid="ref-101">Reilly <italic>et al</italic>., 2018</xref>); as a result, the study identified three independent low-frequency variants associated with reduced 28-day sepsis survival. A good example of data integration guided by genotype imputation is represented by research consortia. Consortia aims in integrating multiple results to empower research on a particular topic or phenotypes, as in the case of the CKDGen Consortium, an international collaboration for the genetic investigation of kidney functions in health and disease. In a study on chronic kidney disease (CKD) in children (<xref ref-type="bibr" rid="ref-128">Wuttke <italic>et al</italic>., 2016</xref>), they successfully integrated three pediatric CKD cohorts (<xref ref-type="bibr" rid="ref-32">Furth <italic>et al</italic>., 2006</xref>; <xref ref-type="bibr" rid="ref-29">ESCAPE Trial Group <italic>et al</italic>., 2009</xref>; <xref ref-type="bibr" rid="ref-97">Querfeld <italic>et al</italic>., 2010</xref>) and identified ten regions associated with creatinine clearance (GFRcrea), four regions with proteinuria and six regions with CKD.</p>
</sec>
<sec id="s11">
<title>Imputation of Highly Polymorphic Regions: The Example of the Major Histocompatibility Complex Region</title>
<p>The human major histocompatibility complex (MHC) region is also known as the human leukocyte antigen (HLA) complex. It maps on the short arm of chromosome 6 (6p21.3) (<xref ref-type="bibr" rid="ref-15">Choo, 2007</xref>; <xref ref-type="bibr" rid="ref-26">Douillard <italic>et al</italic>., 2021</xref>), and it is known to be the most gene-dense region along the human genome (&#x007E;260 genes in &#x007E;4 Mb of length) (<xref ref-type="bibr" rid="ref-119">Trowsdale and Knight, 2013</xref>; <xref ref-type="bibr" rid="ref-58">Kennedy <italic>et al</italic>., 2017</xref>). The MHC genes can be classified into three different groups by sequence similarity and function: MHC class I, class II, and class III (<xref ref-type="bibr" rid="ref-15">Choo, 2007</xref>; <xref ref-type="bibr" rid="ref-26">Douillard <italic>et al</italic>., 2021</xref>).</p>
<sec id="s11_1">
<title>A summary of the human leukocyte antigen region</title>
<p>The HLA complex presents the highest level of polymorphisms in human genomes (<xref ref-type="bibr" rid="ref-15">Choo, 2007</xref>). The high number of polymorphisms belonging to this region makes its genotyping quite challenging because of the continuous discoveries of new HLA alleles (<xref ref-type="bibr" rid="ref-115">Stefani <italic>et al</italic>., 2022</xref>; <xref ref-type="bibr" rid="ref-89">Naito and Okada, 2022</xref>).</p>
<p>All known HLA alleles are reported in the IPD-IMGT/HLA database (<xref ref-type="bibr" rid="ref-102">Robinson <italic>et al</italic>., 2013</xref>); so far, HLA-A and HLA-C present more than 4000 known alleles each, and HLA-B reports over 5000 different alleles (according to the IPD-IMGT/HLA database release 3.50). Therefore, many of these alleles have a very low frequency in the worldwide population (<xref ref-type="bibr" rid="ref-17">Cook <italic>et al</italic>., 2021</xref>). It is noteworthy that the complexity of the HLA region depends on the huge number of variants mapping in the regions and that the variants sites might have multiple complex alleles (indels and/or more than one single nucleotide).</p>
<p>Thanks to several bioinformatic strategies, it was possible to investigate short genomic sequences coming from whole genome sequencing (WGS) or whole exome sequencing (WES), pointing out the different HLA types (<xref ref-type="bibr" rid="ref-28">Erlich, 2015</xref>). The knowledge of HLA variants and alleles makes it possible to infer HLA patterns. However, the standard WGS-based imputation estimates missing genotypes of the HLA region inaccurately (<xref ref-type="bibr" rid="ref-121">Uffelmann <italic>et al</italic>., 2021</xref>).</p>
</sec>
<sec id="s11_2">
<title>Association studies on the major histocompatibility complex/human leukocyte antigen region</title>
<p>With the advent of the Human Genome Project (<xref ref-type="bibr" rid="ref-16">Collins and Fink, 1995</xref>), the MHC region started to be unraveled, stating its complexity (<xref ref-type="bibr" rid="ref-86">Meyer and Nunes, 2017</xref>; <xref ref-type="bibr" rid="ref-58">Kennedy <italic>et al</italic>., 2017</xref>). Many GWAS for complex traits, such as cardiovascular, metabolic, and neurological diseases, have reported several associations with markers of the MHC region (<xref ref-type="bibr" rid="ref-58">Kennedy <italic>et al</italic>., 2017</xref>; <xref ref-type="bibr" rid="ref-89">Naito and Okada, 2022</xref>).</p>
<p>The extreme complexity of this region always makes imputation quite difficult (<xref ref-type="bibr" rid="ref-121">Uffelmann <italic>et al</italic>., 2021</xref>). Due to the high levels of linkage disequilibrium (several extended haplogroups have been identified) and the great number of polymorphisms, it is important that the imputation process can use a good reference panel containing as many haplotypes as possible (<xref ref-type="bibr" rid="ref-28">Erlich, 2015</xref>; <xref ref-type="bibr" rid="ref-86">Meyer and Nunes, 2017</xref>; <xref ref-type="bibr" rid="ref-17">Cook <italic>et al</italic>., 2021</xref>).</p>
</sec>
<sec id="s11_3">
<title>Imputation of the major histocompatibility complex/human leukocyte antigen region</title>
<p>One of the major challenges to successfully performing genotype imputation in this complex region is to find a good and detailed reference panel. Thus, panels specific to the MHC/HLA region need to be accurately fine-mapped and comprise as many samples as possible, to overcome the enormous number of polymorphisms (<xref ref-type="bibr" rid="ref-17">Cook <italic>et al</italic>., 2021</xref>). When using sequencing, reliable imputation of the HLA region does need high read (short sequences) coverage and depth to call all the variability of the HLA region (<xref ref-type="bibr" rid="ref-26">Douillard <italic>et al</italic>., 2021</xref>).</p>
</sec>
<sec id="s11_4">
<title>Tools for imputation of the human leukocyte antigen complex</title>
<p>Several tools for imputation on HLA can infer missing HLA genotypes based on reference datasets and/or individual SNPs (<xref ref-type="bibr" rid="ref-26">Douillard <italic>et al</italic>., 2021</xref>).</p>
<p>Over the years, many imputation algorithms able to tackle LD features of the MHC region were developed (<xref ref-type="bibr" rid="ref-86">Meyer and Nunes, 2017</xref>; <xref ref-type="bibr" rid="ref-89">Naito and Okada, 2022</xref>). All the algorithms for HLA imputation are based on probabilistic approaches. The most known and used are HLA&#x002A;IMP (<xref ref-type="bibr" rid="ref-25">Dilthey <italic>et al</italic>., 2011</xref>), specific to the European population, which was subsequently improved in HLA&#x002A;IMP:02 (<xref ref-type="bibr" rid="ref-24">Dilthey <italic>et al</italic>., 2013</xref>), able to handle multiple populations; indeed, SNP2HLA (<xref ref-type="bibr" rid="ref-51">Jia <italic>et al</italic>., 2013</xref>), based on Beagle, was implemented to impute not only the HLA-specific alleles but also the related aminoacidic sequence; finally, CookHLA (<xref ref-type="bibr" rid="ref-17">Cook <italic>et al</italic>., 2021</xref>) improved SNP2HLA algorithm, accounting better for linkage disequilibrium. Due to the complexity of studying HLA regions with imputed data, researchers developed HLA-TAPAS (<xref ref-type="bibr" rid="ref-75">Luo <italic>et al</italic>., 2021</xref>), a three-in-one solution for reference panel construction, imputation, and association studies on HLA genetic data.</p>
</sec>
<sec id="s11_5">
<title>A reference panel for human leukocyte antigen imputation</title>
<p>As for all the genome regions and populations, imputation accuracy is influenced by the choice of the reference panel. At first, the HLA-specific reference panel was built for a single ancestry, considering the haplotype structures (due to the linkage disequilibrium in the region) of the European population (<xref ref-type="bibr" rid="ref-25">Dilthey <italic>et al</italic>., 2011</xref>). Moreover, the advent of NGS technologies boosted the construction of larger HLA-specific reference panels, leading to a more accurate investigation between HLA genotypes and diseases (<xref ref-type="bibr" rid="ref-89">Naito and Okada, 2022</xref>). Most recent research suggested that using multi-ethnic reference panels (<xref ref-type="bibr" rid="ref-75">Luo <italic>et al</italic>., 2021</xref>) could guarantee a more accurate genotype imputation (<xref ref-type="bibr" rid="ref-86">Meyer and Nunes, 2017</xref>; <xref ref-type="bibr" rid="ref-26">Douillard <italic>et al</italic>., 2021</xref>; <xref ref-type="bibr" rid="ref-89">Naito and Okada, 2022</xref>). A multi-ethnic reference panel was built from a high-coverage WGS dataset (<xref ref-type="bibr" rid="ref-75">Luo <italic>et al</italic>., 2021</xref>) and made available on the Michigan Imputation Server, together with an optimized procedure for HLA imputation. However, the accuracy and the sensitivity of the method are a major concern, as mixed populations can still lead to inaccurate results even if better than the ones using the standard reference panels (<xref ref-type="bibr" rid="ref-26">Douillard <italic>et al</italic>., 2021</xref>).</p>
</sec>
</sec>
<sec id="s12">
<title>Imputation of Low-Coverage Sequencing Regions</title>
<p>In the last two decades, a number of different genotyping arrays, based on different sets of polymorphic sites, have been used for several investigations, including the assessment of genetic distance (i.e., similarity studies) between individuals or populations, and GWAS for complex traits. It is known that when polymorphic site data are not obtained from a random sample of polymorphisms, the results of genetic studies can be distorted because of ascertainment biases (<xref ref-type="bibr" rid="ref-35">Geibel <italic>et al</italic>., 2021b</xref>). In this regard, genotype imputation improves the statistical power of GWAS (<xref ref-type="bibr" rid="ref-19">Das <italic>et al</italic>., 2018</xref>, <xref ref-type="bibr" rid="ref-34">Geibel <italic>et al</italic>., 2021a</xref>).</p>
<p>Compared to the whole set of possible detectable genotypes in the genome, the arrays can assay only a relatively small subset of pre-selected and fixed polymorphic sites, allowing the calling of low-density array-based genotypes.</p>
<p>With the emerging genotyping-by-sequencing technology, marker discovery and genotyping occur simultaneously, resulting in minimal ascertainment bias and therefore improving the accuracy of the genetic studies (i.e., genetic-based similarity, genotype imputation, and association studies) (<xref ref-type="bibr" rid="ref-45">Heslot <italic>et al</italic>., 2013</xref>).</p>
<p>However, despite the advancements in sequencing technologies and platforms, a whole genome sequencing-based study on several thousands of samples is still prohibitively expensive for many laboratories. In general, genotype imputation is used together with very large reference panels to increase the number of accurately imputed variants. To carry out a quality imputation, it is necessary to have certain attention, according to the cases under investigation. It may be possible that no large reference panels exist for the studied population or that the reference panel contains a too small number of individuals.</p>
<p>It has been suggested that to improve the accuracy of imputation on small sample sets of individuals and on low-frequency variants, it is crucial to maximize the genetics similarity between the sample set and the reference panel and to include in the reference panel as many polymorphic sites as possible (<xref ref-type="bibr" rid="ref-60">Korku&#x0107; <italic>et al</italic>., 2019</xref>). Moreover, in smaller populations, determining the optimal imputation strategy would be extremely challenging when high-density genotypes are not available. In non-human populations (for instance, cattle or poultry), one of the most used strategies is to sequence a subset of a population that is employed as the reference panel to perform genotype imputation with high accuracy (<xref ref-type="bibr" rid="ref-52">Jiang <italic>et al</italic>., 2022</xref>). Of note, according to the context, the parameters of the imputation software must be carefully fine-tuned, and the metrics to assess imputation quality (either in terms of reliability or accuracy) must be interpreted with caution, specifically if the genetic distance between the sample set and reference population is elevated (<xref ref-type="bibr" rid="ref-104">Roshyara and Scholz, 2015</xref>).</p>
<p>As a rule, it is recommended to use a large reference panel (better if individuals present a very small genetic distance from individuals of the investigated population) or, if not possible, to use genetically best-matched reference panels in which the individuals are of the same ethnicity of the individuals of the studied population. Larger reference panels are associated with higher computational costs and longer computation times for phasing and imputation. The need for new and more efficient software is therefore very compelling in this field today.</p>
<p>Although it may sound bizarre, imputation can be used not only within genotyping array-based studies but also in sequencing-based association studies. Among the various possible employment in the context of sequencing, it has been shown that imputation can be a very useful tool in the sequencing of low-coverage genomes and in the case of off-target regions from exome sequencing. Imputation was reported to be an efficient tool on extremely low-coverage sequencing (0.1&#x2013;0.5x) data to capture almost as much of the common (&#x003E;5%) and low-frequency (1%&#x2013;5%) variation across the genome with results that are very similar to the ones from classical SNP arrays, in many different ethnic groups (<xref ref-type="bibr" rid="ref-94">Pasaniuc <italic>et al</italic>., 2012</xref>). In this context, it was also shown that imputation can be used to impute variants mapping in the off-target regions of exome sequencing to some extent to the whole genome. Indeed, imputation on exome sequencing data is still under investigation; the first evidence showed that regions with &#x007E;3000 rare and common variants per 1 megabase, result in good quality imputed data. More interestingly, low coverage WGS (lcWGS) appears now to be an alternative technology to genotyping arrays for common genetic variant assessment in the context of genome-wide polygenic scores (PGS) calculation (<xref ref-type="bibr" rid="ref-46">Homburger <italic>et al</italic>., 2019</xref>). Indeed, lcWGS proved to have brilliant performance in imputation results and accuracy, showing comparable capabilities to genotyping array and overcoming ascertainment bias inherent to the variant selection of genotype array.</p>
</sec>
<sec id="s13">
<title>Imputation of Ancient DNA</title>
<p>In the last decade, genotype imputation was employed in ancient DNA (aDNA) investigation (<xref ref-type="bibr" rid="ref-36">Genome of the Netherlands Consortium, 2014</xref>). The ability of genotype imputation to infer and reconstruct genotypes of sampled individuals seemed particularly suitable for ancient samples. Indeed, DNA from ancient samples is totally subjected to degradation processes, including cross-linking, deamination, and fragmentation, which may lead to several difficulties in extraction, sequencing, and clear genotyping results (<xref ref-type="bibr" rid="ref-2">Allentoft <italic>et al</italic>., 2012</xref>). Although genotype imputation is capable of enriching samples with novel imputed variants, the technical difficulties in obtaining clear genetic signals as well as the unavailability of a specific reference panel for an ancient human genome, challenged modern methodologies and techniques. Due to the great uncertainty that aDNA analyses constantly face, new strategies seemed necessary. Low-coverage ancient DNA sequencing seemed to mitigate the quality issues of ancient samples; thus, it was a suitable method to deal with the probabilistic representation of genotypes, the core of the imputation process (<xref ref-type="bibr" rid="ref-3">Ausmees <italic>et al</italic>., 2022</xref>).</p>
<p>To achieve good quality results, the great majority of research on aDNA made use of low coverage (2X) and ultra-low coverage (0.5X&#x2013;1X) genotyping and sequencing strategies combined with phasing and genotype imputation methods (<xref ref-type="bibr" rid="ref-94">Pasaniuc <italic>et al</italic>., 2012</xref>), particularly using Beagle4 and Beagle5 (<xref ref-type="bibr" rid="ref-48">Hui <italic>et al</italic>., 2020</xref>). This combination of techniques turned out to be particularly successful, confirming genomic shifts and fluxes across ages in Hungarian samples from the Neolithic, Copper, Bronze, and Iron Ages (<xref ref-type="bibr" rid="ref-33">Gamba <italic>et al</italic>., 2014</xref>) and investigating ancient population substructures in the Portuguese population from the Neolithic to the Bronze Age (<xref ref-type="bibr" rid="ref-81">Martiniano <italic>et al</italic>., 2017</xref>).</p>
<p>Furthermore, several strategies were implemented to overcome reference bias. The absence of an ancient reference panel, whose development has become of great interest in the most recent research (<xref ref-type="bibr" rid="ref-3">Ausmees <italic>et al</italic>., 2022</xref>; <xref ref-type="bibr" rid="ref-5">Biddanda <italic>et al</italic>., 2022</xref>), forced the usage of reference panels made of modern-day samples. A two-step strategy was implemented on a modern reference panel using Beagle software, which demonstrated to achieve good results of imputation on low-coverage samples (<xref ref-type="bibr" rid="ref-48">Hui <italic>et al</italic>., 2020</xref>). This two-step procedure works as follow: (1) imputation is performed on a reference panel as similar as possible to the sample set, in term of genetic background and admixture; in this first step, the size of the reference panel is not taken into account; then, (2) the computed genotype likelihoods are added to the sample set and compared to a larger worldwide reference panel. Applications of the approach have been demonstrated to be successful, for example, in the fine estimation of Iberian population ancestries (<xref ref-type="bibr" rid="ref-124">Villalba-Mouco <italic>et al</italic>., 2019</xref>).</p>
</sec>
<sec id="s14">
<title>Past Achievements and Future Challenges</title>
<p>Genotype imputation is becoming increasingly common in genomic research, being nowadays part of the standard pipelines for genome-wide association studies. Since the very first approach of imputation to genetic data, several applications have been hypothesized, and numerous questions and challenges have arisen. No more than fifteen years ago, the imputation of HLA regions or the development of large reference panels constituted enormous limitations that may have stopped researchers from using genotype imputation in the genomic investigation.</p>
<p>Thanks to the advancements in genotyping and sequencing techniques as well as in computational implementation, imputation has become capable of combining cohorts to deeply investigate traits and diseases through the usage of meta-analysis and empowering genome-wide association studies by enriching samples with new untyped markers to discover newly associated variants and genes involved in traits and diseases.</p>
<p>Furthermore, the decreasing genotyping and sequencing costs permitted the genotyping of several hundreds of thousands of either markers or individuals. This enormous boost in the amount of genetic data enabled researchers to build up a larger reference panel, which could represent world-wide populations and carry information on both common and rare variants. However, population and subpopulation representation remain biased, particularly in terms of low-represented ancestries and ultra-rare variants, which may be of crucial relevance for determining risks related to modern-day diseases and for modeling polygenic risk scores. To overcome reference limitations, several recent studies proposed the combination of low-coverage whole genome sequencing and imputation and the integration of imputed samples into reference panels to lower per-sample costs and generate more specific and performant reference panels.</p>
<p>A powerful example of genotype imputation procedures is represented by the online next-generation imputation platforms, the Michigan Imputation Server and the TOPMed Imputation Server. These platforms allow standardization of phasing and genotype imputation processes, providing state-of-the-art tools and curated reference panels to answer either average or advanced needs for genotype imputation.</p>
<p>To sum up, genotype imputation is an impressive resource that can boost genetic and genomic research. Imputation resolutions are at the sample level; however, its perspective stands in the general sampled population as well as in the matching reference panel. The uncertainty, which is an intrinsic feature of this inferential technique, cannot be explained at the individual level but have a sense from a population perspective. Indeed, genotype probabilities and allelic dosages are a new descriptive measurement in genetics, mostly used for discrete genotype variables, and must be managed carefully using proper tools and parametrization. This specific probabilistic representation would allow better modelling of traits and disease as well as a better understanding of rare and ultra-rare variants, which may be fundamental to investigate the causes and hypothesize predictive models of present-day pathologies.</p>
</sec>
</body>
<back>
<sec>
<title>Funding Statement</title>
<p>The authors received no specific funding for this study.</p>
</sec>
<sec>
<title>Author Contributions</title>
<p>Manuscript conception and design: Mirko TRECCANI and Giovanni MALERBA; literature collection: Mirko TRECCANI, Elena LOCATELLI, Cristina PATUZZO, and Giovanni MALERBA; manuscript preparation: Mirko TRECCANI, Elena LOCATELLI, Cristina PATUZZO, and Giovanni MALERBA; realization of graphical content: Elena LOCATELLI; draft content revision: Cristina PATUZZO; manuscript revision and finalization: Giovanni MALERBA. All authors approved the final version of the manuscript.</p>
</sec>
<sec sec-type="data-availability">
<title>Availability of Data and Materials</title>
<p>Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.</p>
</sec>
<sec>
<title>Ethics Approval</title>
<p>Not applicable.</p>
</sec>
<sec sec-type="COI-statement">
<title>Conflicts of Interest</title>
<p>The authors declare that they have no conflicts of interest to report regarding the present study.</p>
</sec>
<ref-list content-type="authoryear">
<title>References</title>
<ref id="ref-1"><label>Abo <italic>et al</italic>. (2012)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Abo</surname> <given-names>R</given-names></string-name>, <string-name><surname>Hebbring</surname> <given-names>S</given-names></string-name>, <string-name><surname>Ji</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>H</given-names></string-name>, <string-name><surname>Zeng</surname> <given-names>ZB</given-names></string-name> <etal>et al.</etal></person-group> (<year>2012</year>). <article-title>Merging pharmacometabolomics with pharmacogenomics using &#x2018;1000 Genomes&#x2019; single-nucleotide polymorphism imputation: Selective serotonin reuptake inhibitor response pharmacogenomics</article-title>. <source>Pharmacogenetics and Genomics</source> <volume>22</volume>: <fpage>247</fpage>&#x2013;<lpage>253</lpage>. <pub-id pub-id-type="doi">10.1097/FPC.0b013e32835001c9</pub-id>; <pub-id pub-id-type="pmid">22322242</pub-id></mixed-citation></ref>
<ref id="ref-2"><label>Allentoft <italic>et al</italic>. (2012)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Allentoft</surname> <given-names>ME</given-names></string-name>, <string-name><surname>Collins</surname> <given-names>M</given-names></string-name>, <string-name><surname>Harker</surname> <given-names>D</given-names></string-name>, <string-name><surname>Haile</surname> <given-names>J</given-names></string-name>, <string-name><surname>Oskam</surname> <given-names>CL</given-names></string-name> <etal>et al.</etal></person-group> (<year>2012</year>). <article-title>The half-life of DNA in bone: Measuring decay kinetics in 158 dated fossils</article-title>. <source>Proceedings of the Royal Society B: Biological Sciences</source> <volume>279</volume>: <fpage>4724</fpage>&#x2013;<lpage>4733</lpage>. <pub-id pub-id-type="doi">10.1098/rspb.2012.1745</pub-id>; <pub-id pub-id-type="pmid">23055061</pub-id></mixed-citation></ref>
<ref id="ref-3"><label>Ausmees <italic>et al</italic>. (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ausmees</surname> <given-names>K</given-names></string-name>, <string-name><surname>Sanchez-Quinto</surname> <given-names>F</given-names></string-name>, <string-name><surname>Jakobsson</surname> <given-names>M</given-names></string-name>, <string-name><surname>Nettelblad</surname> <given-names>C</given-names></string-name></person-group> (<year>2022</year>). <article-title>An empirical evaluation of genotype imputation of ancient DNA</article-title>. <source>G3 Genes/Genomes/Genetics</source> <volume>12</volume>: <fpage>jkac089</fpage>. <pub-id pub-id-type="doi">10.1093/g3journal/jkac089</pub-id>; <pub-id pub-id-type="pmid">35482488</pub-id></mixed-citation></ref>
<ref id="ref-4"><label>Bai <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Bai</surname> <given-names>WY</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>XW</given-names></string-name>, <string-name><surname>Cong</surname> <given-names>PK</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>XJ</given-names></string-name>, <string-name><surname>Richards</surname> <given-names>JB</given-names></string-name>, <string-name><surname>Zheng</surname> <given-names>HF</given-names></string-name></person-group> (<year>2019</year>). <article-title>Genotype imputation and reference panel: A systematic evaluation on haplotype size and diversity</article-title>. <source>Briefing in Bioinformatics</source><italic>,</italic> <fpage>bbz108</fpage>. <pub-id pub-id-type="doi">10.1093/bib/bbz108</pub-id>; <pub-id pub-id-type="pmid">32002535</pub-id></mixed-citation></ref>
<ref id="ref-5"><label>Biddanda <italic>et al</italic>. (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Biddanda</surname> <given-names>A</given-names></string-name>, <string-name><surname>Steinr&#x00FC;cken</surname> <given-names>M</given-names></string-name>, <string-name><surname>Novembre</surname> <given-names>J</given-names></string-name></person-group> (<year>2022</year>). <article-title>Properties of 2-locus genealogies and linkage disequilibrium in temporally structured samples</article-title>. <source>Genetics</source> <volume>221</volume>: <fpage>iyac038</fpage>. <pub-id pub-id-type="doi">10.1093/genetics/iyac038</pub-id>; <pub-id pub-id-type="pmid">35294015</pub-id></mixed-citation></ref>
<ref id="ref-6"><label>Browning and Browning (2007)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Browning</surname> <given-names>SR</given-names></string-name>, <string-name><surname>Browning</surname> <given-names>BL</given-names></string-name></person-group> (<year>2007</year>). <article-title>Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering</article-title>. <source>American Journal of Human Genetics</source> <volume>81</volume>: <fpage>1084</fpage>&#x2013;<lpage>1097</lpage>. <pub-id pub-id-type="doi">10.1086/521987</pub-id>; <pub-id pub-id-type="pmid">17924348</pub-id></mixed-citation></ref>
<ref id="ref-7"><label>Browning and Browning (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Browning</surname> <given-names>BL</given-names></string-name>, <string-name><surname>Browning</surname> <given-names>SR</given-names></string-name></person-group> (<year>2016</year>). <article-title>Genotype Imputation with millions of reference samples</article-title>. <source>American Journal of Human Genetics</source> <volume>98</volume>: <fpage>116</fpage>&#x2013;<lpage>126</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2015.11.020</pub-id>; <pub-id pub-id-type="pmid">26748515</pub-id></mixed-citation></ref>
<ref id="ref-8"><label>Browning <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Browning</surname> <given-names>BL</given-names></string-name>, <string-name><surname>Tian</surname> <given-names>X</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Browning</surname> <given-names>SR</given-names></string-name></person-group> (<year>2021</year>). <article-title>Fast two-stage phasing of large-scale sequence data</article-title>. <source>American Journal of Human Genetics</source> <volume>108</volume>: <fpage>1880</fpage>&#x2013;<lpage>1890</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2021.08.005</pub-id>; <pub-id pub-id-type="pmid">34478634</pub-id></mixed-citation></ref>
<ref id="ref-9"><label>Browning <italic>et al</italic>. (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Browning</surname> <given-names>BL</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Browning</surname> <given-names>SR</given-names></string-name></person-group> (<year>2018</year>). <article-title>A one-penny imputed genome from next-generation reference panels</article-title>. <source>American Journal of Human Genetics</source> <volume>103</volume>: <fpage>338</fpage>&#x2013;<lpage>348</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2018.07.015</pub-id>; <pub-id pub-id-type="pmid">30100085</pub-id></mixed-citation></ref>
<ref id="ref-10"><label>Burdick <italic>et al</italic>. (2006)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Burdick</surname> <given-names>JT</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>WM</given-names></string-name>, <string-name><surname>Abecasis</surname> <given-names>GR</given-names></string-name>, <string-name><surname>Cheung</surname> <given-names>VG</given-names></string-name></person-group> (<year>2006</year>). <article-title><italic>In silico</italic> method for inferring genotypes in pedigrees</article-title>. <source>Nature Genetics</source> <volume>38</volume>: <fpage>1002</fpage>&#x2013;<lpage>1004</lpage>. <pub-id pub-id-type="doi">10.1038/ng1863</pub-id>; <pub-id pub-id-type="pmid">16921375</pub-id></mixed-citation></ref>
<ref id="ref-11"><label>Chanda <italic>et al</italic>. (2012)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chanda</surname> <given-names>P</given-names></string-name>, <string-name><surname>Yuhki</surname> <given-names>N</given-names></string-name>, <string-name><surname>Li</surname> <given-names>M</given-names></string-name>, <string-name><surname>Bader</surname> <given-names>JS</given-names></string-name>, <string-name><surname>Hartz</surname> <given-names>A</given-names></string-name>, <string-name><surname>Boerwinkle</surname> <given-names>E</given-names></string-name>, <string-name><surname>Kao</surname> <given-names>WH</given-names></string-name>, <string-name><surname>Arking</surname> <given-names>DE</given-names></string-name></person-group> (<year>2012</year>). <article-title>Comprehensive evaluation of imputation performance in African Americans</article-title>. <source>Journal of Human Genetics</source> <volume>57</volume>: <fpage>411</fpage>&#x2013;<lpage>421</lpage>. <pub-id pub-id-type="doi">10.1038/jhg.2012.43</pub-id>; <pub-id pub-id-type="pmid">22648186</pub-id></mixed-citation></ref>
<ref id="ref-12"><label>Chang <italic>et al</italic>. (2015)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chang</surname> <given-names>CC</given-names></string-name>, <string-name><surname>Chow</surname> <given-names>CC</given-names></string-name>, <string-name><surname>Tellier</surname> <given-names>LC</given-names></string-name>, <string-name><surname>Vattikuti</surname> <given-names>S</given-names></string-name>, <string-name><surname>Purcell</surname> <given-names>SM</given-names></string-name>, <string-name><surname>Lee</surname> <given-names>JJ</given-names></string-name></person-group> (<year>2015</year>). <article-title>Second-generation PLINK: Rising to the challenge of larger and richer datasets</article-title>. <source>GigaScience</source> <volume>4</volume>: <fpage>7</fpage>. <pub-id pub-id-type="doi">10.1186/s13742-015-0047-8</pub-id>; <pub-id pub-id-type="pmid">25722852</pub-id></mixed-citation></ref>
<ref id="ref-13"><label>Chen <italic>et al</italic>. (2013)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Chen</surname> <given-names>W</given-names></string-name>, <string-name><surname>Li</surname> <given-names>B</given-names></string-name>, <string-name><surname>Zeng</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Sanna</surname> <given-names>S</given-names></string-name>, <string-name><surname>Sidore</surname> <given-names>C</given-names></string-name>, <string-name><surname>Busonero</surname> <given-names>F</given-names></string-name>, <string-name><surname>Kang</surname> <given-names>HM</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Abecasis</surname> <given-names>GR</given-names></string-name></person-group> (<year>2013</year>). <article-title>Genotype calling and haplotyping in parent-offspring trios</article-title>. <source>Genome Research</source> <volume>23</volume>: <fpage>142</fpage>&#x2013;<lpage>151</lpage>. <pub-id pub-id-type="doi">10.1101/gr.142455.112</pub-id>; <pub-id pub-id-type="pmid">23064751</pub-id></mixed-citation></ref>
<ref id="ref-14"><label>Cheung <italic>et al</italic>. (2013)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cheung</surname> <given-names>CY</given-names></string-name>, <string-name><surname>Thompson</surname> <given-names>EA</given-names></string-name>, <string-name><surname>Wijsman</surname> <given-names>EM</given-names></string-name></person-group> (<year>2013</year>). <article-title>GIGI: An approach to effective imputation of dense genotypes on large pedigrees</article-title>. <source>American Journal of Human Genetics</source> <volume>92</volume>: <fpage>504</fpage>&#x2013;<lpage>516</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2013.02.011</pub-id>; <pub-id pub-id-type="pmid">23561844</pub-id></mixed-citation></ref>
<ref id="ref-15"><label>Choo (2007)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Choo</surname> <given-names>SY</given-names></string-name></person-group> (<year>2007</year>). <article-title>The HLA system: Genetics, immunology, clinical testing, and clinical implications</article-title>. <source>Yonsei Medical Journal</source> <volume>48</volume>: <fpage>11</fpage>&#x2013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.3349/ymj.2007.48.1.11</pub-id>; <pub-id pub-id-type="pmid">17326240</pub-id></mixed-citation></ref>
<ref id="ref-16"><label>Collins and Fink (1995)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Collins</surname> <given-names>FS</given-names></string-name>, <string-name><surname>Fink</surname> <given-names>L</given-names></string-name></person-group> (<year>1995</year>). <article-title>The human genome project</article-title>. <source>Alcohol Health and Research World</source> <volume>19</volume>: <fpage>190</fpage>&#x2013;<lpage>195</lpage>; <pub-id pub-id-type="pmid">31798046</pub-id></mixed-citation></ref>
<ref id="ref-17"><label>Cook <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cook</surname> <given-names>S</given-names></string-name>, <string-name><surname>Choi</surname> <given-names>W</given-names></string-name>, <string-name><surname>Lim</surname> <given-names>H</given-names></string-name>, <string-name><surname>Luo</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Kim</surname> <given-names>K</given-names></string-name>, <string-name><surname>Jia</surname> <given-names>X</given-names></string-name>, <string-name><surname>Raychaudhuri</surname> <given-names>S</given-names></string-name>, <string-name><surname>Han</surname> <given-names>B</given-names></string-name></person-group> (<year>2021</year>). <article-title>Accurate imputation of human leukocyte antigens with CookHLA</article-title>. <source>Nature Communications</source> <volume>12</volume>: <fpage>1264</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-021-21541-5</pub-id>; <pub-id pub-id-type="pmid">33627654</pub-id></mixed-citation></ref>
<ref id="ref-18"><label>Cunningham <italic>et al</italic>. (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Cunningham</surname> <given-names>F</given-names></string-name>, <string-name><surname>Allen</surname> <given-names>JE</given-names></string-name>, <string-name><surname>Allen</surname> <given-names>J</given-names></string-name>, <string-name><surname>Alvarez-Jarreta</surname> <given-names>J</given-names></string-name>, <string-name><surname>Amode</surname> <given-names>MR</given-names></string-name> <etal>et al.</etal></person-group> (<year>2022</year>). <article-title>Ensembl 2022</article-title>. <source>Nucleic Acids Research</source> <volume>50</volume>: <fpage>D988</fpage>&#x2013;<lpage>D995</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gkab1049</pub-id>; <pub-id pub-id-type="pmid">34791404</pub-id></mixed-citation></ref>
<ref id="ref-19"><label>Das <italic>et al</italic>. (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Das</surname> <given-names>S</given-names></string-name>, <string-name><surname>Abecasis</surname> <given-names>GR</given-names></string-name>, <string-name><surname>Browning</surname> <given-names>BL</given-names></string-name></person-group> (<year>2018</year>). <article-title>Genotype imputation from large reference panels</article-title>. <source>Annual Review of Genomics and Human Genetics</source> <volume>19</volume>: <fpage>73</fpage>&#x2013;<lpage>96</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-genom-083117-021602</pub-id>; <pub-id pub-id-type="pmid">29799802</pub-id></mixed-citation></ref>
<ref id="ref-20"><label>Das <italic>et al</italic>. (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Das</surname> <given-names>S</given-names></string-name>, <string-name><surname>Forer</surname> <given-names>L</given-names></string-name>, <string-name><surname>Sch&#x00F6;nherr</surname> <given-names>S</given-names></string-name>, <string-name><surname>Sidore</surname> <given-names>C</given-names></string-name>, <string-name><surname>Locke</surname> <given-names>AE</given-names></string-name> <etal>et al.</etal></person-group> (<year>2016</year>). <article-title>Next-generation genotype imputation service and methods</article-title>. <source>Nature Genetics</source> <volume>48</volume>: <fpage>1284</fpage>&#x2013;<lpage>1287</lpage>. <pub-id pub-id-type="doi">10.1038/ng.3656</pub-id>; <pub-id pub-id-type="pmid">27571263</pub-id></mixed-citation></ref>
<ref id="ref-21"><label>De Marino <italic>et al</italic>. (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>de Marino</surname> <given-names>A</given-names></string-name>, <string-name><surname>Mahmoud</surname> <given-names>AA</given-names></string-name>, <string-name><surname>Bose</surname> <given-names>M</given-names></string-name>, <string-name><surname>Bircan</surname> <given-names>KO</given-names></string-name>, <string-name><surname>Terpolovsky</surname> <given-names>A</given-names></string-name>, <string-name><surname>Bamunusinghe</surname> <given-names>V</given-names></string-name>, <string-name><surname>Bohn</surname> <given-names>S</given-names></string-name>, <string-name><surname>Khan</surname> <given-names>U</given-names></string-name>, <string-name><surname>Novkovi&#x0107;</surname> <given-names>B</given-names></string-name>, <string-name><surname>Yazdi</surname> <given-names>PG</given-names></string-name></person-group> (<year>2022</year>). <article-title>A comparative analysis of current phasing and imputation software</article-title>. <source>PLoS One</source> <volume>17</volume>: <fpage>e0260177</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0260177</pub-id>; <pub-id pub-id-type="pmid">36260643</pub-id></mixed-citation></ref>
<ref id="ref-22"><label>Degenhardt <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Degenhardt</surname> <given-names>F</given-names></string-name>, <string-name><surname>Wendorff</surname> <given-names>M</given-names></string-name>, <string-name><surname>Wittig</surname> <given-names>M</given-names></string-name>, <string-name><surname>Ellinghaus</surname> <given-names>E</given-names></string-name>, <string-name><surname>Datta</surname> <given-names>LW</given-names></string-name> <etal>et al.</etal></person-group> (<year>2019</year>). <article-title>Construction and benchmarking of a multi-ethnic reference panel for the imputation of HLA class I and II alleles</article-title>. <source>Human Molecular Genetics</source> <volume>28</volume>: <fpage>2078</fpage>&#x2013;<lpage>2092</lpage>. <pub-id pub-id-type="doi">10.1093/hmg/ddy443</pub-id>; <pub-id pub-id-type="pmid">30590525</pub-id></mixed-citation></ref>
<ref id="ref-23"><label>Delaneau <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Delaneau</surname> <given-names>O</given-names></string-name>, <string-name><surname>Zagury</surname> <given-names>JF</given-names></string-name>, <string-name><surname>Robinson</surname> <given-names>MR</given-names></string-name>, <string-name><surname>Marchini</surname> <given-names>JL</given-names></string-name>, <string-name><surname>Dermitzakis</surname> <given-names>ET</given-names></string-name></person-group> (<year>2019</year>). <article-title>Accurate, scalable and integrative haplotype estimation</article-title>. <source>Nature Communications</source> <volume>10</volume>: <fpage>5436</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-019-13225-y</pub-id>; <pub-id pub-id-type="pmid">31780650</pub-id></mixed-citation></ref>
<ref id="ref-24"><label>Dilthey <italic>et al</italic>. (2013)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dilthey</surname> <given-names>A</given-names></string-name>, <string-name><surname>Leslie</surname> <given-names>S</given-names></string-name>, <string-name><surname>Moutsianas</surname> <given-names>L</given-names></string-name>, <string-name><surname>Shen</surname> <given-names>J</given-names></string-name>, <string-name><surname>Cox</surname> <given-names>C</given-names></string-name>, <string-name><surname>Nelson</surname> <given-names>MR</given-names></string-name>, <string-name><surname>McVean</surname> <given-names>G</given-names></string-name></person-group> (<year>2013</year>). <article-title>Multi-population classical HLA type imputation</article-title>. <source>PLoS Computational Bioliogy</source> <volume>9</volume>: <fpage>e1002877</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pcbi.1002877</pub-id>; <pub-id pub-id-type="pmid">23459081</pub-id></mixed-citation></ref>
<ref id="ref-25"><label>Dilthey <italic>et al</italic>. (2011)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dilthey</surname> <given-names>AT</given-names></string-name>, <string-name><surname>Moutsianas</surname> <given-names>L</given-names></string-name>, <string-name><surname>Leslie</surname> <given-names>S</given-names></string-name>, <string-name><surname>McVean</surname> <given-names>G</given-names></string-name></person-group> (<year>2011</year>). <article-title>HLA&#x002A;IMP&#x2014;An integrated framework for imputing classical HLA alleles from SNP genotypes</article-title>. <source>Bioinformatics</source> <volume>27</volume>: <fpage>968</fpage>&#x2013;<lpage>972</lpage>. <pub-id pub-id-type="doi">10.1093/bioinformatics/btr061</pub-id>; <pub-id pub-id-type="pmid">21300701</pub-id></mixed-citation></ref>
<ref id="ref-26"><label>Douillard <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Douillard</surname> <given-names>V</given-names></string-name>, <string-name><surname>Castelli</surname> <given-names>EC</given-names></string-name>, <string-name><surname>Mack</surname> <given-names>SJ</given-names></string-name>, <string-name><surname>Hollenbach</surname> <given-names>JA</given-names></string-name>, <string-name><surname>Gourraud</surname> <given-names>PA</given-names></string-name>, <string-name><surname>Vince</surname> <given-names>N</given-names></string-name>, <string-name><surname>Limou</surname> <given-names>S</given-names></string-name></person-group> (<year>2021</year>). <article-title>Approaching genetics through the MHC lens: Tools and methods for HLA research</article-title>. <source>Frontiers in Genetics</source> <volume>12</volume>: <fpage>774916</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2021.774916</pub-id>; <pub-id pub-id-type="pmid">34925459</pub-id></mixed-citation></ref>
<ref id="ref-27"><label>Dyment <italic>et al</italic>. (2008)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Dyment</surname> <given-names>DA</given-names></string-name>, <string-name><surname>Cader</surname> <given-names>MZ</given-names></string-name>, <string-name><surname>Herrera</surname> <given-names>BM</given-names></string-name>, <string-name><surname>Ramagopalan</surname> <given-names>SV</given-names></string-name>, <string-name><surname>Orton</surname> <given-names>SM</given-names></string-name>, <string-name><surname>Chao</surname> <given-names>M</given-names></string-name>, <string-name><surname>Willer</surname> <given-names>CJ</given-names></string-name>, <string-name><surname>Sadovnick</surname> <given-names>AD</given-names></string-name>, <string-name><surname>Risch</surname> <given-names>N</given-names></string-name>, <string-name><surname>Ebers</surname> <given-names>GC</given-names></string-name></person-group> (<year>2008</year>). <article-title>A genome scan in a single pedigree with a high prevalence of multiple sclerosis</article-title>. <source>Journal of Neurology, Neurosurgery and Psychiatry</source> <volume>79</volume>: <fpage>158</fpage>&#x2013;<lpage>162</lpage>. <pub-id pub-id-type="doi">10.1136/jnnp.2007.122705</pub-id>; <pub-id pub-id-type="pmid">17550985</pub-id></mixed-citation></ref>
<ref id="ref-28"><label>Erlich (2015)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Erlich</surname> <given-names>HA</given-names></string-name></person-group> (<year>2015</year>). <article-title>HLA typing using next generation sequencing: An overview</article-title>. <source>Human Immunology</source> <volume>76</volume>: <fpage>887</fpage>&#x2013;<lpage>890</lpage>. <pub-id pub-id-type="doi">10.1016/j.humimm.2015.03.001</pub-id>; <pub-id pub-id-type="pmid">25777625</pub-id></mixed-citation></ref>
<ref id="ref-29"><label>ESCAPE Trial Group et al. (2009)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><collab>ESCAPE Trial Group</collab>, <string-name><surname>W&#x00FC;hl</surname> <given-names>E</given-names></string-name>, <string-name><surname>Trivelli</surname> <given-names>A</given-names></string-name>, <string-name><surname>Picca</surname> <given-names>S</given-names></string-name>, <string-name><surname>Litwin</surname> <given-names>M</given-names></string-name> <etal>et al.</etal></person-group> (<year>2009</year>). <article-title>Strict blood-pressure control and progression of renal failure in children</article-title>. <source>The New England Journal of Medicine</source> <volume>361</volume>: <fpage>1639</fpage>&#x2013;<lpage>1650</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMoa0902066</pub-id>; <pub-id pub-id-type="pmid">19846849</pub-id></mixed-citation></ref>
<ref id="ref-30"><label>Everest <italic>et al</italic>. (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Everest</surname> <given-names>E</given-names></string-name>, <string-name><surname>Ahangari</surname> <given-names>M</given-names></string-name>, <string-name><surname>Uygunoglu</surname> <given-names>U</given-names></string-name>, <string-name><surname>Tutuncu</surname> <given-names>M</given-names></string-name>, <string-name><surname>Bulbul</surname> <given-names>A</given-names></string-name> <etal>et al.</etal></person-group> (<year>2022</year>). <article-title>Investigating the role of common and rare variants in multiplex multiple sclerosis families reveals an increased burden of common risk variation</article-title>. <source>Scientific Reports</source> <volume>12</volume>: <fpage>16984</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-022-21484-x</pub-id>; <pub-id pub-id-type="pmid">36216875</pub-id></mixed-citation></ref>
<ref id="ref-31"><label>Fulker <italic>et al</italic>. (1999)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Fulker</surname> <given-names>DW</given-names></string-name>, <string-name><surname>Cherny</surname> <given-names>SS</given-names></string-name>, <string-name><surname>Sham</surname> <given-names>PC</given-names></string-name>, <string-name><surname>Hewitt</surname> <given-names>JK</given-names></string-name></person-group> (<year>1999</year>). <article-title>Combined linkage and association sib-pair analysis for quantitative traits</article-title>. <source>American Journal of Human Genetics</source> <volume>64</volume>: <fpage>259</fpage>&#x2013;<lpage>267</lpage>. <pub-id pub-id-type="doi">10.1086/302193</pub-id>; <pub-id pub-id-type="pmid">9915965</pub-id></mixed-citation></ref>
<ref id="ref-32"><label>Furth <italic>et al</italic>. (2006)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Furth</surname> <given-names>SL</given-names></string-name>, <string-name><surname>Cole</surname> <given-names>SR</given-names></string-name>, <string-name><surname>Moxey-Mims</surname> <given-names>M</given-names></string-name>, <string-name><surname>Kaskel</surname> <given-names>F</given-names></string-name>, <string-name><surname>Mak</surname> <given-names>R</given-names></string-name>, <string-name><surname>Schwartz</surname> <given-names>G</given-names></string-name>, <string-name><surname>Wong</surname> <given-names>C</given-names></string-name>, <string-name><surname>Mu&#x00F1;oz</surname> <given-names>A</given-names></string-name>, <string-name><surname>Warady</surname> <given-names>BA</given-names></string-name></person-group> (<year>2006</year>). <article-title>Design and methods of the Chronic Kidney Disease in Children (CKiD) prospective cohort study</article-title>. <source>Clinical Journal of the American Society of Nephrology</source> <volume>1</volume>: <fpage>1006</fpage>&#x2013;<lpage>1015</lpage>. <pub-id pub-id-type="doi">10.2215/CJN.01941205</pub-id>; <pub-id pub-id-type="pmid">17699320</pub-id></mixed-citation></ref>
<ref id="ref-33"><label>Gamba <italic>et al</italic>. (2014)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Gamba</surname> <given-names>C</given-names></string-name>, <string-name><surname>Jones</surname> <given-names>ER</given-names></string-name>, <string-name><surname>Teasdale</surname> <given-names>MD</given-names></string-name>, <string-name><surname>McLaughlin</surname> <given-names>RL</given-names></string-name>, <string-name><surname>Gonzalez-Fortes</surname> <given-names>G</given-names></string-name> <etal>et al.</etal></person-group> (<year>2014</year>). <article-title>Genome flux and stasis in a five millennium transect of European prehistory</article-title>. <source>Nature Communications</source> <volume>5</volume>: <fpage>5257</fpage>. <pub-id pub-id-type="doi">10.1038/ncomms6257</pub-id>; <pub-id pub-id-type="pmid">25334030</pub-id></mixed-citation></ref>
<ref id="ref-34"><label>Geibel <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Geibel</surname> <given-names>J</given-names></string-name>, <string-name><surname>Reimer</surname> <given-names>C</given-names></string-name>, <string-name><surname>Pook</surname> <given-names>T</given-names></string-name>, <string-name><surname>Weigend</surname> <given-names>S</given-names></string-name>, <string-name><surname>Weigend</surname> <given-names>A</given-names></string-name>, <string-name><surname>Simianer</surname> <given-names>H</given-names></string-name></person-group> (<year>2021a</year>). <article-title>How imputation can mitigate SNP ascertainment Bias</article-title>. <source>BMC Genomics</source> <volume>22</volume>: <fpage>340</fpage>. <pub-id pub-id-type="doi">10.1186/s12864-021-07663-6</pub-id>; <pub-id pub-id-type="pmid">33980139</pub-id></mixed-citation></ref>
<ref id="ref-35"><label>Geibel <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Geibel</surname> <given-names>J</given-names></string-name>, <string-name><surname>Reimer</surname> <given-names>C</given-names></string-name>, <string-name><surname>Weigend</surname> <given-names>S</given-names></string-name>, <string-name><surname>Weigend</surname> <given-names>A</given-names></string-name>, <string-name><surname>Pook</surname> <given-names>T</given-names></string-name>, <string-name><surname>Simianer</surname> <given-names>H</given-names></string-name></person-group> (<year>2021b</year>). <article-title>How array design creates SNP ascertainment bias</article-title>. <source>PLoS One</source> <volume>16</volume>: <fpage>e0245178</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0245178</pub-id>; <pub-id pub-id-type="pmid">33784304</pub-id></mixed-citation></ref>
<ref id="ref-36"><label>Genome of the Netherlands Consortium (2014)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><collab>Genome of the Netherlands Consortium</collab></person-group> (<year>2014</year>). <article-title>Whole-genome sequence variation, population structure and demographic history of the Dutch population</article-title>. <source>Nature Genetics</source> <volume>46</volume>: <fpage>818</fpage>&#x2013;<lpage>825</lpage>. <pub-id pub-id-type="doi">10.1038/ng.3021</pub-id>; <pub-id pub-id-type="pmid">24974849</pub-id></mixed-citation></ref>
<ref id="ref-37"><label>GenomeAsia100K Consortium (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><collab>GenomeAsia100K Consortium</collab></person-group> (<year>2019</year>). <article-title>The GenomeAsia 100K Project enables genetic discoveries across Asia</article-title>. <source>Nature</source> <volume>576</volume>: <fpage>106</fpage>&#x2013;<lpage>111</lpage>. <pub-id pub-id-type="doi">10.1038/s41586-019-1793-z</pub-id>; <pub-id pub-id-type="pmid">31802016</pub-id></mixed-citation></ref>
<ref id="ref-38"><label>Genomes Project Consortium et al. (2010)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><collab>Genomes Project Consortium</collab>, <string-name><surname>Abecasis</surname> <given-names>GR</given-names></string-name>, <string-name><surname>Altshuler</surname> <given-names>D</given-names></string-name>, <string-name><surname>Auton</surname> <given-names>A</given-names></string-name>, <string-name><surname>Brooks</surname> <given-names>LD</given-names></string-name>, <string-name><surname>Durbin</surname> <given-names>RM</given-names></string-name>, <string-name><surname>Gibbs</surname> <given-names>RA</given-names></string-name>, <string-name><surname>Hurles</surname> <given-names>ME</given-names></string-name>, <string-name><surname>McVean</surname> <given-names>GA</given-names></string-name></person-group> (<year>2010</year>). <article-title>A map of human genome variation from population-scale sequencing</article-title>. <source>Nature</source> <volume>467</volume>: <fpage>1061</fpage>&#x2013;<lpage>1073</lpage>. <pub-id pub-id-type="doi">10.1038/nature09534</pub-id>; <pub-id pub-id-type="pmid">20981092</pub-id></mixed-citation></ref>
<ref id="ref-39"><label>Genomes Project Consortium et al. (2012)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><collab>Genomes Project Consortium</collab>, <string-name><surname>Abecasis</surname> <given-names>GR</given-names></string-name>, <string-name><surname>Auton</surname> <given-names>A</given-names></string-name>, <string-name><surname>Brooks</surname> <given-names>LD</given-names></string-name>, <string-name><surname>DePristo</surname> <given-names>MA</given-names></string-name>, <string-name><surname>Durbin</surname> <given-names>RM</given-names></string-name>, <string-name><surname>Handsaker</surname> <given-names>RE</given-names></string-name>, <string-name><surname>Kang</surname> <given-names>HM</given-names></string-name>, <string-name><surname>Marth</surname> <given-names>GT</given-names></string-name>, <string-name><surname>McVean</surname> <given-names>GA</given-names></string-name></person-group> (<year>2012</year>). <article-title>An integrated map of genetic variation from 1,092 human genomes</article-title>. <source>Nature</source> <volume>491</volume>: <fpage>56</fpage>&#x2013;<lpage>65</lpage>. <pub-id pub-id-type="doi">10.1038/nature11632</pub-id>; <pub-id pub-id-type="pmid">23128226</pub-id></mixed-citation></ref>
<ref id="ref-40"><label>Genomes Project Consortium et al. (2015)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><collab>Genomes Project Consortium</collab>, <string-name><surname>Auton</surname> <given-names>A</given-names></string-name>, <string-name><surname>Brooks</surname> <given-names>LD</given-names></string-name>, <string-name><surname>Durbin</surname> <given-names>RM</given-names></string-name>, <string-name><surname>Garrison</surname> <given-names>EP</given-names></string-name> <etal>et al.</etal></person-group> (<year>2015</year>). <article-title>A global reference for human genetic variation</article-title>. <source>Nature</source> <volume>526</volume>: <fpage>68</fpage>&#x2013;<lpage>74</lpage>. <pub-id pub-id-type="doi">10.1038/nature15393</pub-id>; <pub-id pub-id-type="pmid">26432245</pub-id></mixed-citation></ref>
<ref id="ref-41"><label>George and Elston (1987)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>George</surname> <given-names>VT</given-names></string-name>, <string-name><surname>Elston</surname> <given-names>RC</given-names></string-name></person-group> (<year>1987</year>). <article-title>Testing the association between polymorphic markers and quantitative traits in pedigrees</article-title>. <source>Genetic Epidemiology</source> <volume>4</volume>: <fpage>193</fpage>&#x2013;<lpage>201</lpage>. <pub-id pub-id-type="doi">10.1002/gepi.1370040304</pub-id>; <pub-id pub-id-type="pmid">3609719</pub-id></mixed-citation></ref>
<ref id="ref-42"><label>Guillen-Guio <italic>et al</italic>. (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Guillen-Guio</surname> <given-names>B</given-names></string-name>, <string-name><surname>Lorenzo-Salazar</surname> <given-names>JM</given-names></string-name>, <string-name><surname>Ma</surname> <given-names>SF</given-names></string-name>, <string-name><surname>Hou</surname> <given-names>PC</given-names></string-name>, <string-name><surname>Hernandez-Beeftink</surname> <given-names>T</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>Sepsis-associated acute respiratory distress syndrome in individuals of European ancestry: A genome-wide association study</article-title>. <source>The Lancet Respiratory Medicine</source> <volume>8</volume>: <fpage>258</fpage>&#x2013;<lpage>266</lpage>. <pub-id pub-id-type="doi">10.1016/S2213-2600(19)30368-6</pub-id>; <pub-id pub-id-type="pmid">31982041</pub-id></mixed-citation></ref>
<ref id="ref-43"><label>Hanks <italic>et al</italic>. (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Hanks</surname> <given-names>SC</given-names></string-name>, <string-name><surname>Forer</surname> <given-names>L</given-names></string-name>, <string-name><surname>Sch&#x00F6;nherr</surname> <given-names>S</given-names></string-name>, <string-name><surname>LeFaive</surname> <given-names>J</given-names></string-name>, <string-name><surname>Martins</surname> <given-names>T</given-names></string-name> <etal>et al.</etal></person-group> (<year>2022</year>). <article-title>Extent to which array genotyping and imputation with large reference panels approximate deep whole-genome sequencing</article-title>. <source>American Journal of Human Genetics</source> <volume>109</volume>: <fpage>1653</fpage>&#x2013;<lpage>1666</lpage>. <pub-id pub-id-type="doi">10.1016/j.ajhg.2022.07.012</pub-id>; <pub-id pub-id-type="pmid">35981533</pub-id></mixed-citation></ref>
<ref id="ref-44"><label>Hernandez-Beeftink <italic>et al</italic>. (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Hernandez-Beeftink</surname> <given-names>T</given-names></string-name>, <string-name><surname>Guillen-Guio</surname> <given-names>B</given-names></string-name>, <string-name><surname>Lorenzo-Salazar</surname> <given-names>JM</given-names></string-name>, <string-name><surname>Corrales</surname> <given-names>A</given-names></string-name>, <string-name><surname>Suarez-Pajes</surname> <given-names>E</given-names></string-name> <etal>et al.</etal></person-group> (<year>2022</year>). <article-title>A genome-wide association study of survival in patients with sepsis</article-title>. <source>Critical Care</source> <volume>26</volume>: <fpage>341</fpage>. <pub-id pub-id-type="doi">10.1186/s13054-022-04208-5</pub-id>; <pub-id pub-id-type="pmid">36335405</pub-id></mixed-citation></ref>
<ref id="ref-45"><label>Heslot <italic>et al</italic>. (2013)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Heslot</surname> <given-names>N</given-names></string-name>, <string-name><surname>Rutkoski</surname> <given-names>J</given-names></string-name>, <string-name><surname>Poland</surname> <given-names>J</given-names></string-name>, <string-name><surname>Jannink</surname> <given-names>JL</given-names></string-name>, <string-name><surname>Sorrells</surname> <given-names>ME</given-names></string-name></person-group> (<year>2013</year>). <article-title>Impact of marker ascertainment bias on genomic selection accuracy and estimates of genetic diversity</article-title>. <source>PLoS One</source> <volume>8</volume>: <fpage>e74612</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0074612</pub-id>; <pub-id pub-id-type="pmid">24040295</pub-id></mixed-citation></ref>
<ref id="ref-46"><label>Homburger <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Homburger</surname> <given-names>JR</given-names></string-name>, <string-name><surname>Neben</surname> <given-names>CL</given-names></string-name>, <string-name><surname>Mishne</surname> <given-names>G</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>AY</given-names></string-name>, <string-name><surname>Kathiresan</surname> <given-names>S</given-names></string-name>, <string-name><surname>Khera</surname> <given-names>AV</given-names></string-name></person-group> (<year>2019</year>). <article-title>Low coverage whole genome sequencing enables accurate assessment of common variants and calculation of genome-wide polygenic scores</article-title>. <source>Genome Medicine</source> <volume>11</volume>: <fpage>74</fpage>. <pub-id pub-id-type="doi">10.1186/s13073-019-0682-2</pub-id>; <pub-id pub-id-type="pmid">31771638</pub-id></mixed-citation></ref>
<ref id="ref-47"><label>Howie <italic>et al</italic>. (2009)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Howie</surname> <given-names>BN</given-names></string-name>, <string-name><surname>Donnelly</surname> <given-names>P</given-names></string-name>, <string-name><surname>Marchini</surname> <given-names>J</given-names></string-name></person-group> (<year>2009</year>). <article-title>A flexible and accurate genotype imputation method for the next generation of genome-wide association studies</article-title>. <source>PLoS Genetics</source> <volume>5</volume>: <fpage>e1000529</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1000529</pub-id>; <pub-id pub-id-type="pmid">19543373</pub-id></mixed-citation></ref>
<ref id="ref-48"><label>Hui <italic>et al</italic>. (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Hui</surname> <given-names>R</given-names></string-name>, <string-name><surname>D&#x2019;Atanasio</surname> <given-names>E</given-names></string-name>, <string-name><surname>Cassidy</surname> <given-names>LM</given-names></string-name>, <string-name><surname>Scheib</surname> <given-names>CL</given-names></string-name>, <string-name><surname>Kivisild</surname> <given-names>T</given-names></string-name></person-group> (<year>2020</year>). <article-title>Evaluating genotype imputation pipeline for ultra-low coverage ancient genomes</article-title>. <source>Scientific Reports</source> <volume>10</volume>: <fpage>18542</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-020-75387-w</pub-id>; <pub-id pub-id-type="pmid">33122697</pub-id></mixed-citation></ref>
<ref id="ref-49"><label>International HapMap Consortium (2005)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><collab>International HapMap Consortium</collab></person-group> (<year>2005</year>). <article-title>A haplotype map of the human genome</article-title>. <source>Nature</source> <volume>437</volume>: <fpage>1299</fpage>&#x2013;<lpage>1320</lpage>. <pub-id pub-id-type="doi">10.1038/nature04226</pub-id>; <pub-id pub-id-type="pmid">16255080</pub-id></mixed-citation></ref>
<ref id="ref-50"><label>International HapMap Consortium et al. (2007)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><collab>International HapMap Consortium</collab>, <string-name><surname>Frazer</surname> <given-names>KA</given-names></string-name>, <string-name><surname>Ballinger</surname> <given-names>DG</given-names></string-name>, <string-name><surname>Cox</surname> <given-names>DR</given-names></string-name>, <string-name><surname>Hinds</surname> <given-names>DA</given-names></string-name> <etal>et al.</etal></person-group> (<year>2007</year>). <article-title>A second generation human haplotype map of over 3.1 million SNPs</article-title>. <source>Nature</source> <volume>449</volume>: <fpage>851</fpage>&#x2013;<lpage>861</lpage>. <pub-id pub-id-type="doi">10.1038/nature06258</pub-id>; <pub-id pub-id-type="pmid">17943122</pub-id></mixed-citation></ref>
<ref id="ref-51"><label>Jia <italic>et al</italic>. (2013)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Jia</surname> <given-names>X</given-names></string-name>, <string-name><surname>Han</surname> <given-names>B</given-names></string-name>, <string-name><surname>Onengut-Gumuscu</surname> <given-names>S</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>WM</given-names></string-name>, <string-name><surname>Concannon</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Rich</surname> <given-names>SS</given-names></string-name>, <string-name><surname>Raychaudhuri</surname> <given-names>S</given-names></string-name>, <string-name><surname>de Bakker</surname> <given-names>PI</given-names></string-name></person-group> (<year>2013</year>). <article-title>Imputing amino acid polymorphisms in human leukocyte antigens</article-title>. <source>PLoS One</source> <volume>8</volume>: <fpage>e64683</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0064683</pub-id>; <pub-id pub-id-type="pmid">23762245</pub-id></mixed-citation></ref>
<ref id="ref-52"><label>Jiang <italic>et al</italic>. (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Jiang</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Song</surname> <given-names>H</given-names></string-name>, <string-name><surname>Gao</surname> <given-names>H</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Q</given-names></string-name>, <string-name><surname>Ding</surname> <given-names>X</given-names></string-name></person-group> (<year>2022</year>). <article-title>Exploring the optimal strategy of imputation from SNP array to whole-genome sequencing data in farm animals</article-title>. <source>Frontiers in Genetics</source> <volume>13</volume>: <fpage>963654</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2022.963654</pub-id>; <pub-id pub-id-type="pmid">36092888</pub-id></mixed-citation></ref>
<ref id="ref-53"><label>Jim&#x00E9;nez-Kaufmann <italic>et al</italic>. (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Jim&#x00E9;nez-Kaufmann</surname> <given-names>A</given-names></string-name>, <string-name><surname>Chong</surname> <given-names>AY</given-names></string-name>, <string-name><surname>Cort&#x00E9;s</surname> <given-names>A</given-names></string-name>, <string-name><surname>Quinto-Cort&#x00E9;s</surname> <given-names>CD</given-names></string-name>, <string-name><surname>Fernandez-Valverde</surname> <given-names>SL</given-names></string-name> <etal>et al.</etal></person-group> (<year>2022</year>). <article-title>Imputation performance in Latin American populations: Improving rare variants representation with the inclusion of native American genomes</article-title>. <source>Frontiers in Genetics</source> <volume>12</volume>: <fpage>719791</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2021.719791</pub-id>; <pub-id pub-id-type="pmid">35046991</pub-id></mixed-citation></ref>
<ref id="ref-54"><label>Johnson <italic>et al</italic>. (2013)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Johnson</surname> <given-names>EO</given-names></string-name>, <string-name><surname>Hancock</surname> <given-names>DB</given-names></string-name>, <string-name><surname>Levy</surname> <given-names>JL</given-names></string-name>, <string-name><surname>Gaddis</surname> <given-names>NC</given-names></string-name>, <string-name><surname>Saccone</surname> <given-names>NL</given-names></string-name>, <string-name><surname>Bierut</surname> <given-names>LJ</given-names></string-name>, <string-name><surname>Page</surname> <given-names>GP</given-names></string-name></person-group> (<year>2013</year>). <article-title>Imputation across genotyping arrays for genome-wide association studies: Assessment of bias and a correction strategy</article-title>. <source>Human Genetics</source> <volume>132</volume>: <fpage>509</fpage>&#x2013;<lpage>522</lpage>. <pub-id pub-id-type="doi">10.1007/s00439-013-1266-7</pub-id>; <pub-id pub-id-type="pmid">23334152</pub-id></mixed-citation></ref>
<ref id="ref-55"><label>J&#x00F8;rsboe and Albrechtsen (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>J&#x00F8;rsboe</surname> <given-names>E</given-names></string-name>, <string-name><surname>Albrechtsen</surname> <given-names>A</given-names></string-name></person-group> (<year>2022</year>). <article-title>Efficient approaches for large-scale GWAS with genotype uncertainty</article-title>. <source>G3 Genes/Genomes/Genetics</source> <volume>12</volume>: <fpage>jkab385</fpage>. <pub-id pub-id-type="doi">10.1093/g3journal/jkab385</pub-id>; <pub-id pub-id-type="pmid">34865001</pub-id></mixed-citation></ref>
<ref id="ref-56"><label>Kars <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kars</surname> <given-names>ME</given-names></string-name>, <string-name><surname>Ba&#x015F;ak</surname> <given-names>AN</given-names></string-name>, <string-name><surname>Onat</surname> <given-names>OE</given-names></string-name>, <string-name><surname>Bilguvar</surname> <given-names>K</given-names></string-name>, <string-name><surname>Choi</surname> <given-names>J</given-names></string-name> <etal>et al.</etal></person-group> (<year>2021</year>). <article-title>The genetic structure of the Turkish population reveals high levels of variation and admixture</article-title>. <source>Proceedings of the National Academy of Sciences of the United States of America</source> <volume>118</volume>: <fpage>e2026076118</fpage>. <pub-id pub-id-type="doi">10.1073/pnas.2026076118</pub-id>; <pub-id pub-id-type="pmid">34426522</pub-id></mixed-citation></ref>
<ref id="ref-57"><label>Kember <italic>et al</italic>. (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kember</surname> <given-names>RL</given-names></string-name>, <string-name><surname>Vickers-Smith</surname> <given-names>R</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>H</given-names></string-name>, <string-name><surname>Toikumo</surname> <given-names>S</given-names></string-name>, <string-name><surname>Niarchou</surname> <given-names>M</given-names></string-name> <etal>et al.</etal></person-group> (<year>2022</year>). <article-title>Cross-ancestry meta-analysis of opioid use disorder uncovers novel loci with predominant effects in brain regions associated with addiction</article-title>. <source>Nature Neuroscience</source> <volume>25</volume>: <fpage>1279</fpage>&#x2013;<lpage>1287</lpage>. <pub-id pub-id-type="doi">10.1038/s41593-022-01160-z</pub-id>; <pub-id pub-id-type="pmid">36171425</pub-id></mixed-citation></ref>
<ref id="ref-58"><label>Kennedy <italic>et al</italic>. (2017)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kennedy</surname> <given-names>AE</given-names></string-name>, <string-name><surname>Ozbek</surname> <given-names>U</given-names></string-name>, <string-name><surname>Dorak</surname> <given-names>MT</given-names></string-name></person-group> (<year>2017</year>). <article-title>What has GWAS done for HLA and disease associations?</article-title> <source>International Journal of Immunogenetics</source> <volume>44</volume>: <fpage>195</fpage>&#x2013;<lpage>211</lpage>. <pub-id pub-id-type="doi">10.1111/iji.12332</pub-id>; <pub-id pub-id-type="pmid">28877428</pub-id></mixed-citation></ref>
<ref id="ref-59"><label>Khankhanian <italic>et al</italic>. (2015)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Khankhanian</surname> <given-names>P</given-names></string-name>, <string-name><surname>Din</surname> <given-names>L</given-names></string-name>, <string-name><surname>Caillier</surname> <given-names>SJ</given-names></string-name>, <string-name><surname>Gourraud</surname> <given-names>PA</given-names></string-name>, <string-name><surname>Baranzini</surname> <given-names>SE</given-names></string-name></person-group> (<year>2015</year>). <article-title>SNP imputation bias reduces effect size determination</article-title>. <source>Frontiers in Genetics</source> <volume>6</volume>: <fpage>30</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2015.00030</pub-id>; <pub-id pub-id-type="pmid">25709616</pub-id></mixed-citation></ref>
<ref id="ref-60"><label>Korku&#x0107; <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Korku&#x0107;</surname> <given-names>P</given-names></string-name>, <string-name><surname>Arends</surname> <given-names>D</given-names></string-name>, <string-name><surname>Brockmann</surname> <given-names>GA</given-names></string-name></person-group> (<year>2019</year>). <article-title>Finding the optimal imputation strategy for small cattle populations</article-title>. <source>Frontiers in Genetics</source> <volume>10</volume>: <fpage>52</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2019.00052</pub-id>; <pub-id pub-id-type="pmid">30833959</pub-id></mixed-citation></ref>
<ref id="ref-61"><label>Kowalski <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kowalski</surname> <given-names>MH</given-names></string-name>, <string-name><surname>Qian</surname> <given-names>H</given-names></string-name>, <string-name><surname>Hou</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Rosen</surname> <given-names>JD</given-names></string-name>, <string-name><surname>Tapia</surname> <given-names>AL</given-names></string-name> <etal>et al.</etal></person-group> (<year>2019</year>). <article-title>Use of &#x003E;100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations</article-title>. <source>PLoS Genetics</source> <volume>15</volume>: <fpage>e1008500</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1008500</pub-id>; <pub-id pub-id-type="pmid">31869403</pub-id></mixed-citation></ref>
<ref id="ref-62"><label>Kreiner-M&#x00F8;ller <italic>et al</italic>. (2015)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Kreiner-M&#x00F8;ller</surname> <given-names>E</given-names></string-name>, <string-name><surname>Medina-Gomez</surname> <given-names>C</given-names></string-name>, <string-name><surname>Uitterlinden</surname> <given-names>AG</given-names></string-name>, <string-name><surname>Rivadeneira</surname> <given-names>F</given-names></string-name>, <string-name><surname>Estrada</surname> <given-names>K</given-names></string-name></person-group> (<year>2015</year>). <article-title>Improving accuracy of rare variant imputation with a two-step imputation approach</article-title>. <source>European Journal of Human Genetics</source> <volume>23</volume>: <fpage>395</fpage>&#x2013;<lpage>400</lpage>. <pub-id pub-id-type="doi">10.1038/ejhg.2014.91</pub-id>; <pub-id pub-id-type="pmid">24939589</pub-id></mixed-citation></ref>
<ref id="ref-63"><label>Lent <italic>et al</italic>. (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lent</surname> <given-names>S</given-names></string-name>, <string-name><surname>Deng</surname> <given-names>X</given-names></string-name>, <string-name><surname>Cupples</surname> <given-names>LA</given-names></string-name>, <string-name><surname>Lunetta</surname> <given-names>KL</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>CT</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>Y</given-names></string-name></person-group> (<year>2016</year>). <article-title>Imputing rare variants in families using a two-stage approach</article-title>. <source>BMC Proceedings</source> <volume>10</volume>: <fpage>209</fpage>&#x2013;<lpage>214</lpage>. <pub-id pub-id-type="doi">10.1186/s12919-016-0032-y</pub-id>; <pub-id pub-id-type="pmid">27980638</pub-id></mixed-citation></ref>
<ref id="ref-64"><label>Li <italic>et al</italic>. (2006)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Li</surname> <given-names>M</given-names></string-name>, <string-name><surname>Atmaca-Sonmez</surname> <given-names>P</given-names></string-name>, <string-name><surname>Othman</surname> <given-names>M</given-names></string-name>, <string-name><surname>Branham</surname> <given-names>KE</given-names></string-name>, <string-name><surname>Khanna</surname> <given-names>R</given-names></string-name> <etal>et al.</etal></person-group> (<year>2006</year>). <article-title>CFH haplotypes without the Y402H coding variant show strong association with susceptibility to age-related macular degeneration</article-title>. <source>Nature Genetics</source> <volume>38</volume>: <fpage>1049</fpage>&#x2013;<lpage>1054</lpage>. <pub-id pub-id-type="doi">10.1038/ng1871</pub-id>; <pub-id pub-id-type="pmid">16936733</pub-id></mixed-citation></ref>
<ref id="ref-65"><label>Li <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Li</surname> <given-names>L</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>P</given-names></string-name>, <string-name><surname>Sun</surname> <given-names>X</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>S</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>M</given-names></string-name> <etal>et al.</etal></person-group> (<year>2021</year>). <article-title>The ChinaMAP reference panel for the accurate genotype imputation in Chinese populations</article-title>. <source>Cell Research</source> <volume>31</volume>: <fpage>1308</fpage>&#x2013;<lpage>1310</lpage>. <pub-id pub-id-type="doi">10.1038/s41422-021-00564-z</pub-id>; <pub-id pub-id-type="pmid">34489580</pub-id></mixed-citation></ref>
<ref id="ref-66"><label>Li <italic>et al</italic>. (2009)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Li</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Willer</surname> <given-names>C</given-names></string-name>, <string-name><surname>Sanna</surname> <given-names>S</given-names></string-name>, <string-name><surname>Abecasis</surname> <given-names>G</given-names></string-name></person-group> (<year>2009</year>). <article-title>Genotype imputation</article-title>. <source>Annual Review of Genomics and Human Genetics</source> <volume>10</volume>: <fpage>387</fpage>&#x2013;<lpage>406</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.genom.9.081307.164242</pub-id>; <pub-id pub-id-type="pmid">19715440</pub-id></mixed-citation></ref>
<ref id="ref-67"><label>Lin <italic>et al</italic>. (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lin</surname> <given-names>M</given-names></string-name>, <string-name><surname>Caberto</surname> <given-names>C</given-names></string-name>, <string-name><surname>Wan</surname> <given-names>P</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Lum-Jones</surname> <given-names>A</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>Population-specific reference panels are crucial for genetic analyses: an example of the CREBRF locus in Native Hawaiians</article-title>. <source>Human Molecular Genetics</source> <volume>29</volume>: <fpage>2275</fpage>&#x2013;<lpage>2284</lpage>. <pub-id pub-id-type="doi">10.1093/hmg/ddaa083</pub-id>; <pub-id pub-id-type="pmid">32491157</pub-id></mixed-citation></ref>
<ref id="ref-68"><label>Lin <italic>et al</italic>. (2010)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lin</surname> <given-names>P</given-names></string-name>, <string-name><surname>Hartz</surname> <given-names>SM</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Saccone</surname> <given-names>SF</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>J</given-names></string-name> <etal>et al.</etal></person-group> (<year>2010</year>). <article-title>A new statistic to evaluate imputation reliability</article-title>. <source>PLoS One</source> <volume>5</volume>: <fpage>e9697</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0009697</pub-id>; <pub-id pub-id-type="pmid">20300623</pub-id></mixed-citation></ref>
<ref id="ref-69"><label>Lin <italic>et al</italic>. (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lin</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Liu</surname> <given-names>L</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>S</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Lin</surname> <given-names>D</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>X</given-names></string-name>, <string-name><surname>Yin</surname> <given-names>X</given-names></string-name></person-group> (<year>2018</year>). <article-title>Genotype imputation for Han Chinese population using Haplotype Reference Consortium as reference</article-title>. <source>Human Genetics</source> <volume>137</volume>: <fpage>431</fpage>&#x2013;<lpage>436</lpage>. <pub-id pub-id-type="doi">10.1007/s00439-018-1894-z</pub-id>; <pub-id pub-id-type="pmid">29855708</pub-id></mixed-citation></ref>
<ref id="ref-70"><label>Liu <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Liu</surname> <given-names>CT</given-names></string-name>, <string-name><surname>Deng</surname> <given-names>X</given-names></string-name>, <string-name><surname>Fisher</surname> <given-names>V</given-names></string-name>, <string-name><surname>Heard-Costa</surname> <given-names>N</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>H</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Vasan</surname> <given-names>RS</given-names></string-name>, <string-name><surname>Cupples</surname> <given-names>LA</given-names></string-name></person-group> (<year>2019</year>). <article-title>Revisit population-based and family-based genotype imputation</article-title>. <source>Scientific Reports</source> <volume>9</volume>: <fpage>1800</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-018-38469-4</pub-id>; <pub-id pub-id-type="pmid">30755687</pub-id></mixed-citation></ref>
<ref id="ref-71"><label>Liu <italic>et al</italic>. (2013)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Liu</surname> <given-names>EY</given-names></string-name>, <string-name><surname>Li</surname> <given-names>M</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>W</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Y</given-names></string-name></person-group> (<year>2013</year>). <article-title>MaCH-admix: Genotype imputation for admixed populations</article-title>. <source>Genetic Epidemiology</source> <volume>37</volume>: <fpage>25</fpage>&#x2013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.1002/gepi.21690</pub-id>; <pub-id pub-id-type="pmid">23074066</pub-id></mixed-citation></ref>
<ref id="ref-72"><label>Loh <italic>et al</italic>. (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Loh</surname> <given-names>PR</given-names></string-name>, <string-name><surname>Danecek</surname> <given-names>P</given-names></string-name>, <string-name><surname>Palamara</surname> <given-names>PF</given-names></string-name>, <string-name><surname>Fuchsberger</surname> <given-names>C</given-names></string-name>, <string-name><surname>Reshef</surname> <given-names>A</given-names></string-name> <etal>et al.</etal></person-group> (<year>2016</year>). <article-title>Reference-based phasing using the Haplotype Reference Consortium panel</article-title>. <source>Nature Genetics</source> <volume>48</volume>: <fpage>1443</fpage>&#x2013;<lpage>1448</lpage>. <pub-id pub-id-type="doi">10.1038/ng.3679</pub-id>; <pub-id pub-id-type="pmid">27694958</pub-id></mixed-citation></ref>
<ref id="ref-73"><label>Loh <italic>et al</italic>. (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Loh</surname> <given-names>PR</given-names></string-name>, <string-name><surname>Palamara</surname> <given-names>PF</given-names></string-name>, <string-name><surname>Price</surname> <given-names>AL</given-names></string-name></person-group> (<year>2016</year>). <article-title>Fast and accurate long-range phasing in a UK Biobank cohort</article-title>. <source>Nature Genetics</source> <volume>48</volume>: <fpage>811</fpage>&#x2013;<lpage>816</lpage>. <pub-id pub-id-type="doi">10.1038/ng.3571</pub-id>; <pub-id pub-id-type="pmid">27270109</pub-id></mixed-citation></ref>
<ref id="ref-74"><label>Lu <italic>et al</italic>. (2010)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lu</surname> <given-names>K</given-names></string-name>, <string-name><surname>Jiang</surname> <given-names>L</given-names></string-name>, <string-name><surname>Tsiatis</surname> <given-names>AA</given-names></string-name></person-group> (<year>2010</year>). <article-title>Multiple imputation approaches for the analysis of dichotomized responses in longitudinal studies with missing data</article-title>. <source>Biometrics</source> <volume>66</volume>: <fpage>1202</fpage>&#x2013;<lpage>1208</lpage>. <pub-id pub-id-type="doi">10.1111/j.1541-0420.2010.01405.x</pub-id>; <pub-id pub-id-type="pmid">20337628</pub-id></mixed-citation></ref>
<ref id="ref-75"><label>Luo <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Luo</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Kanai</surname> <given-names>M</given-names></string-name>, <string-name><surname>Choi</surname> <given-names>W</given-names></string-name>, <string-name><surname>Li</surname> <given-names>X</given-names></string-name>, <string-name><surname>Sakaue</surname> <given-names>S</given-names></string-name> <etal>et al.</etal></person-group> (<year>2021</year>). <article-title>A high-resolution HLA reference panel capturing global population diversity enables multi-ancestry fine-mapping in HIV host response</article-title>. <source>Nature Genetics</source> <volume>53</volume>: <fpage>1504</fpage>&#x2013;<lpage>1516</lpage>. <pub-id pub-id-type="doi">10.1038/s41588-021-00935-7</pub-id>; <pub-id pub-id-type="pmid">34611364</pub-id></mixed-citation></ref>
<ref id="ref-76"><label>Lyons <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Lyons</surname> <given-names>PA</given-names></string-name>, <string-name><surname>Peters</surname> <given-names>JE</given-names></string-name>, <string-name><surname>Alberici</surname> <given-names>F</given-names></string-name>, <string-name><surname>Liley</surname> <given-names>J</given-names></string-name>, <string-name><surname>Coulson</surname> <given-names>RMR</given-names></string-name> <etal>et al.</etal></person-group> (<year>2019</year>). <article-title>Genome-wide association study of eosinophilic granulomatosis with polyangiitis reveals genomic loci stratified by ANCA status</article-title>. <source>Nature Communications</source> <volume>10</volume>: <fpage>5120</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-019-12515-9</pub-id>; <pub-id pub-id-type="pmid">31719529</pub-id></mixed-citation></ref>
<ref id="ref-77"><label>Malerba <italic>et al</italic>. (2000)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Malerba</surname> <given-names>G</given-names></string-name>, <string-name><surname>Lauciello</surname> <given-names>MC</given-names></string-name>, <string-name><surname>Scherpbier</surname> <given-names>T</given-names></string-name>, <string-name><surname>Trabetti</surname> <given-names>E</given-names></string-name>, <string-name><surname>Galavotti</surname> <given-names>R</given-names></string-name> <etal>et al.</etal></person-group> (<year>2000</year>). <article-title>Linkage analysis of chromosome 12 markers in Italian families with atopic asthmatic children</article-title>. <source>American Journal of Respiratory and Critical Care Medicine</source> <volume>162</volume>: <fpage>1587</fpage>&#x2013;<lpage>1590</lpage>. <pub-id pub-id-type="doi">10.1164/ajrccm.162.4.9909031</pub-id>; <pub-id pub-id-type="pmid">11029380</pub-id></mixed-citation></ref>
<ref id="ref-78"><label>Marchani <italic>et al</italic>. (2012)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Marchani</surname> <given-names>EE</given-names></string-name>, <string-name><surname>Chapman</surname> <given-names>NH</given-names></string-name>, <string-name><surname>Cheung</surname> <given-names>CY</given-names></string-name>, <string-name><surname>Ankenman</surname> <given-names>K</given-names></string-name>, <string-name><surname>Stanaway</surname> <given-names>IB</given-names></string-name>, <string-name><surname>Coon</surname> <given-names>HH</given-names></string-name>, <string-name><surname>Nickerson</surname> <given-names>D</given-names></string-name>, <string-name><surname>Bernier</surname> <given-names>R</given-names></string-name>, <string-name><surname>Brkanac</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Wijsman</surname> <given-names>EM</given-names></string-name></person-group> (<year>2012</year>). <article-title>Identification of rare variants from exome sequence in a large pedigree with autism</article-title>. <source>Human Heredity</source> <volume>74</volume>: <fpage>153</fpage>&#x2013;<lpage>164</lpage>. <pub-id pub-id-type="doi">10.1159/000346560</pub-id>; <pub-id pub-id-type="pmid">23594493</pub-id></mixed-citation></ref>
<ref id="ref-79"><label>Marchini and Howie (2010)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Marchini</surname> <given-names>J</given-names></string-name>, <string-name><surname>Howie</surname> <given-names>B</given-names></string-name></person-group> (<year>2010</year>). <article-title>Genotype imputation for genome-wide association studies</article-title>. <source>Nature Reviews Genetics</source> <volume>11</volume>: <fpage>499</fpage>&#x2013;<lpage>511</lpage>. <pub-id pub-id-type="doi">10.1038/nrg2796</pub-id>; <pub-id pub-id-type="pmid">20517342</pub-id></mixed-citation></ref>
<ref id="ref-80"><label>Marchini <italic>et al</italic>. (2007)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Marchini</surname> <given-names>J</given-names></string-name>, <string-name><surname>Howie</surname> <given-names>B</given-names></string-name>, <string-name><surname>Myers</surname> <given-names>S</given-names></string-name>, <string-name><surname>McVean</surname> <given-names>G</given-names></string-name>, <string-name><surname>Donnelly</surname> <given-names>P</given-names></string-name></person-group> (<year>2007</year>). <article-title>A new multipoint method for genome-wide association studies by imputation of genotypes</article-title>. <source>Nature Genetics</source> <volume>39</volume>: <fpage>906</fpage>&#x2013;<lpage>913</lpage>. <pub-id pub-id-type="doi">10.1038/ng2088</pub-id>; <pub-id pub-id-type="pmid">17572673</pub-id></mixed-citation></ref>
<ref id="ref-81"><label>Martiniano <italic>et al</italic>. (2017)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Martiniano</surname> <given-names>R</given-names></string-name>, <string-name><surname>Cassidy</surname> <given-names>LM</given-names></string-name>, <string-name><surname>&#x00D3;&#x2019;Maold&#x00FA;in</surname> <given-names>R</given-names></string-name>, <string-name><surname>McLaughlin</surname> <given-names>R</given-names></string-name>, <string-name><surname>Silva</surname> <given-names>NM</given-names></string-name> <etal>et al.</etal></person-group> (<year>2017</year>). <article-title>The population genomics of archaeological transition in west Iberia: Investigation of ancient substructure using imputation and haplotype-based methods</article-title>. <source>PLoS Genetics</source> <volume>13</volume>: <fpage>e1006852</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1006852</pub-id>; <pub-id pub-id-type="pmid">28749934</pub-id></mixed-citation></ref>
<ref id="ref-82"><label>Mathias <italic>et al</italic>. (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Mathias</surname> <given-names>RA</given-names></string-name>, <string-name><surname>Taub</surname> <given-names>MA</given-names></string-name>, <string-name><surname>Gignoux</surname> <given-names>CR</given-names></string-name>, <string-name><surname>Fu</surname> <given-names>W</given-names></string-name>, <string-name><surname>Musharoff</surname> <given-names>S</given-names></string-name> <etal>et al.</etal></person-group> (<year>2016</year>). <article-title>A continuum of admixture in the Western Hemisphere revealed by the African Diaspora genome</article-title>. <source>Nature Communications</source> <volume>7</volume>: <fpage>12522</fpage>. <pub-id pub-id-type="doi">10.1038/ncomms12522</pub-id>; <pub-id pub-id-type="pmid">27725671</pub-id></mixed-citation></ref>
<ref id="ref-83"><label>McBride <italic>et al</italic>. (1982)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>McBride</surname> <given-names>OW</given-names></string-name>, <string-name><surname>Battey</surname> <given-names>J</given-names></string-name>, <string-name><surname>Hollis</surname> <given-names>GF</given-names></string-name>, <string-name><surname>Swan</surname> <given-names>DC</given-names></string-name>, <string-name><surname>Siebenlist</surname> <given-names>U</given-names></string-name>, <string-name><surname>Leder</surname> <given-names>P</given-names></string-name></person-group> (<year>1982a</year>). <article-title>Localization of human variable and constant region immunoglobulin heavy chain genes on subtelomeric band q32 of chromosome 14</article-title>. <source>Nucleic Acids Research</source> <volume>10</volume>: <fpage>8155</fpage>&#x2013;<lpage>8170</lpage>. <pub-id pub-id-type="doi">10.1093/nar/10.24.8155</pub-id>; <pub-id pub-id-type="pmid">6819544</pub-id></mixed-citation></ref>
<ref id="ref-84"><label>McBride <italic>et al</italic>. (1982)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>McBride</surname> <given-names>OW</given-names></string-name>, <string-name><surname>Hieter</surname> <given-names>PA</given-names></string-name>, <string-name><surname>Hollis</surname> <given-names>GF</given-names></string-name>, <string-name><surname>Swan</surname> <given-names>D</given-names></string-name>, <string-name><surname>Otey</surname> <given-names>MC</given-names></string-name>, <string-name><surname>Leder</surname> <given-names>P</given-names></string-name></person-group> (<year>1982b</year>). <article-title>Chromosomal location of human kappa and lambda immunoglobulin light chain constant region genes</article-title>. <source>The Journal of Experimental Medicine</source> <volume>155</volume>: <fpage>1480</fpage>&#x2013;<lpage>1490</lpage>. <pub-id pub-id-type="doi">10.1084/jem.155.5.1480</pub-id>; <pub-id pub-id-type="pmid">6802926</pub-id></mixed-citation></ref>
<ref id="ref-85"><label>McCarthy <italic>et al</italic>. (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>McCarthy</surname> <given-names>S</given-names></string-name>, <string-name><surname>Das</surname> <given-names>S</given-names></string-name>, <string-name><surname>Kretzschmar</surname> <given-names>W</given-names></string-name>, <string-name><surname>Delaneau</surname> <given-names>O</given-names></string-name>, <string-name><surname>Wood</surname> <given-names>AR</given-names></string-name> <etal>et al.</etal></person-group> (<year>2016</year>). <article-title>A reference panel of 64,976 haplotypes for genotype imputation</article-title>. <source>Nature Genetics</source> <volume>48</volume>: <fpage>1279</fpage>&#x2013;<lpage>1283</lpage>. <pub-id pub-id-type="doi">10.1038/ng.3643</pub-id>; <pub-id pub-id-type="pmid">27548312</pub-id></mixed-citation></ref>
<ref id="ref-86"><label>Meyer and Nunes (2017)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Meyer</surname> <given-names>D</given-names></string-name>, <string-name><surname>Nunes</surname> <given-names>K</given-names></string-name></person-group> (<year>2017</year>). <article-title>HLA imputation, what is it good for?</article-title> <source>Human Immunology</source> <volume>78</volume>: <fpage>239</fpage>&#x2013;<lpage>241</lpage>. <pub-id pub-id-type="doi">10.1016/j.humimm.2017.02.007</pub-id>; <pub-id pub-id-type="pmid">28317600</pub-id></mixed-citation></ref>
<ref id="ref-87"><label>Mijatovic <italic>et al</italic>. (2012)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Mijatovic</surname> <given-names>V</given-names></string-name>, <string-name><surname>Iacobucci</surname> <given-names>I</given-names></string-name>, <string-name><surname>Sazzini</surname> <given-names>M</given-names></string-name>, <string-name><surname>Xumerle</surname> <given-names>L</given-names></string-name>, <string-name><surname>Mori</surname> <given-names>A</given-names></string-name>, <string-name><surname>Pignatti</surname> <given-names>PF</given-names></string-name>, <string-name><surname>Martinelli</surname> <given-names>G</given-names></string-name>, <string-name><surname>Malerba</surname> <given-names>G</given-names></string-name></person-group> (<year>2012</year>). <article-title>Imputation reliability on DNA biallelic markers for drug metabolism studies</article-title>. <source>BMC Bioinformatics</source> <volume>13</volume>: <fpage>S7</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2105-13-S14-S7</pub-id>; <pub-id pub-id-type="pmid">23095502</pub-id></mixed-citation></ref>
<ref id="ref-88"><label>Nair <italic>et al</italic>. (2009)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Nair</surname> <given-names>RP</given-names></string-name>, <string-name><surname>Duffin</surname> <given-names>KC</given-names></string-name>, <string-name><surname>Helms</surname> <given-names>C</given-names></string-name>, <string-name><surname>Ding</surname> <given-names>J</given-names></string-name>, <string-name><surname>Stuart</surname> <given-names>PE</given-names></string-name> <etal>et al.</etal></person-group> (<year>2009</year>). <article-title>Genome-wide scan reveals association of psoriasis with IL-23 and NF-&#x03BA;B pathways</article-title>. <source>Collaborative Association Study of Psoriasis</source> <volume>41</volume>: <fpage>199</fpage>&#x2013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1038/ng.311</pub-id>; <pub-id pub-id-type="pmid">19169254</pub-id></mixed-citation></ref>
<ref id="ref-89"><label>Naito and Okada (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Naito</surname> <given-names>T</given-names></string-name>, <string-name><surname>Okada</surname> <given-names>Y</given-names></string-name></person-group> (<year>2022</year>). <article-title>HLA imputation and its application to genetic and molecular fine-mapping of the MHC region in autoimmune diseases</article-title>. <source>Seminars in Immunopathology</source> <volume>44</volume>: <fpage>15</fpage>&#x2013;<lpage>28</lpage>. <pub-id pub-id-type="doi">10.1007/s00281-021-00901-9</pub-id>; <pub-id pub-id-type="pmid">34786601</pub-id></mixed-citation></ref>
<ref id="ref-90"><label>Nettelblad (2012)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Nettelblad</surname> <given-names>C</given-names></string-name></person-group> (<year>2012</year>). <article-title>Inferring haplotypes and parental genotypes in larger full sib-ships and other pedigrees with missing or erroneous genotype data</article-title>. <source>BMC Genetics</source> <volume>13</volume>: <fpage>85</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2156-13-85</pub-id>; <pub-id pub-id-type="pmid">23046532</pub-id></mixed-citation></ref>
<ref id="ref-91"><label>Newton-Cheh <italic>et al</italic>. (2009)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Newton-Cheh</surname> <given-names>C</given-names></string-name>, <string-name><surname>Johnson</surname> <given-names>T</given-names></string-name>, <string-name><surname>Gateva</surname> <given-names>V</given-names></string-name>, <string-name><surname>Tobin</surname> <given-names>MD</given-names></string-name>, <string-name><surname>Bochud</surname> <given-names>M</given-names></string-name> <etal>et al.</etal></person-group> (<year>2009</year>). <article-title>Genome-wide association study identifies eight loci associated with blood pressure</article-title>. <source>Nature Genetics</source> <volume>41</volume>: <fpage>666</fpage>&#x2013;<lpage>676</lpage>. <pub-id pub-id-type="doi">10.1038/ng.361</pub-id>; <pub-id pub-id-type="pmid">19430483</pub-id></mixed-citation></ref>
<ref id="ref-92"><label>Orho-Melander <italic>et al</italic>. (2008)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Orho-Melander</surname> <given-names>M</given-names></string-name>, <string-name><surname>Melander</surname> <given-names>O</given-names></string-name>, <string-name><surname>Guiducci</surname> <given-names>C</given-names></string-name>, <string-name><surname>Perez-Martinez</surname> <given-names>P</given-names></string-name>, <string-name><surname>Corella</surname> <given-names>D</given-names></string-name> <etal>et al.</etal></person-group> (<year>2008</year>). <article-title>Common missense variant in the glucokinase regulatory protein gene is associated with increased plasma triglyceride and C-reactive protein but lower fasting glucose concentrations</article-title>. <source>Diabetes</source> <volume>57</volume>: <fpage>3112</fpage>&#x2013;<lpage>3121</lpage>. <pub-id pub-id-type="doi">10.2337/db08-0516</pub-id>; <pub-id pub-id-type="pmid">18678614</pub-id></mixed-citation></ref>
<ref id="ref-93"><label>O&#x2019;Connell <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>O&#x2019;Connell</surname> <given-names>J</given-names></string-name>, <string-name><surname>Yun</surname> <given-names>T</given-names></string-name>, <string-name><surname>Moreno</surname> <given-names>M</given-names></string-name>, <string-name><surname>Li</surname> <given-names>H</given-names></string-name>, <string-name><surname>Litterman</surname> <given-names>N</given-names></string-name> <etal>et al.</etal></person-group> (<year>2021</year>). <article-title>A population-specific reference panel for improved genotype imputation in African Americans</article-title>. <source>Communications Biology</source> <volume>4</volume>: <fpage>1269</fpage>. <pub-id pub-id-type="doi">10.1038/s42003-021-02777-9</pub-id>; <pub-id pub-id-type="pmid">34741098</pub-id></mixed-citation></ref>
<ref id="ref-94"><label>Pasaniuc <italic>et al</italic>. (2012)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Pasaniuc</surname> <given-names>B</given-names></string-name>, <string-name><surname>Rohland</surname> <given-names>N</given-names></string-name>, <string-name><surname>McLaren</surname> <given-names>PJ</given-names></string-name>, <string-name><surname>Garimella</surname> <given-names>K</given-names></string-name>, <string-name><surname>Zaitlen</surname> <given-names>N</given-names></string-name> <etal>et al.</etal></person-group> (<year>2012</year>). <article-title>Extremely low-coverage sequencing and imputation increases power for genome-wide association studies</article-title>. <source>Nature Genetics</source> <volume>44</volume>: <fpage>631</fpage>&#x2013;<lpage>635</lpage>. <pub-id pub-id-type="doi">10.1038/ng.2283</pub-id>; <pub-id pub-id-type="pmid">22610117</pub-id></mixed-citation></ref>
<ref id="ref-95"><label>Pilia <italic>et al</italic>. (2006)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Pilia</surname> <given-names>G</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>WM</given-names></string-name>, <string-name><surname>Scuteri</surname> <given-names>A</given-names></string-name>, <string-name><surname>Orru</surname> <given-names>M</given-names></string-name>, <string-name><surname>Albai</surname> <given-names>Dei</given-names></string-name> <etal>et al.</etal></person-group> (<year>2006</year>). <article-title>Heritability of cardiovascular and personality traits in 6,148 Sardinians</article-title>. <source>PLoS Genetics</source> <volume>2</volume>: <fpage>e132</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.0020132</pub-id>; <pub-id pub-id-type="pmid">16934002</pub-id></mixed-citation></ref>
<ref id="ref-96"><label>Purcell <italic>et al</italic>. (2007)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Purcell</surname> <given-names>S</given-names></string-name>, <string-name><surname>Neale</surname> <given-names>B</given-names></string-name>, <string-name><surname>Todd-Brown</surname> <given-names>K</given-names></string-name>, <string-name><surname>Thomas</surname> <given-names>L</given-names></string-name>, <string-name><surname>Ferreira</surname> <given-names>MA</given-names></string-name> <etal>et al.</etal></person-group> (<year>2007</year>). <article-title>PLINK: A tool set for whole-genome association and population-based linkage analyses</article-title>. <source>American Journal of Human Genetics</source> <volume>81</volume>: <fpage>559</fpage>&#x2013;<lpage>575</lpage>. <pub-id pub-id-type="doi">10.1086/519795</pub-id>; <pub-id pub-id-type="pmid">17701901</pub-id></mixed-citation></ref>
<ref id="ref-97"><label>Querfeld <italic>et al</italic>. (2010)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Querfeld</surname> <given-names>U</given-names></string-name>, <string-name><surname>Anarat</surname> <given-names>A</given-names></string-name>, <string-name><surname>Bayazit</surname> <given-names>AK</given-names></string-name>, <string-name><surname>Bakkaloglu</surname> <given-names>AS</given-names></string-name>, <string-name><surname>Bilginer</surname> <given-names>Y</given-names></string-name> <etal>et al.</etal></person-group> (<year>2010</year>). <article-title>The cardiovascular comorbidity in children with chronic kidney disease (4C) study: Objectives, design, and methodology</article-title>. <source>Clinical Journal of the American Society of Nephrology</source> <volume>5</volume>: <fpage>1642</fpage>&#x2013;<lpage>1648</lpage>. <pub-id pub-id-type="doi">10.2215/CJN.08791209</pub-id>; <pub-id pub-id-type="pmid">20576824</pub-id></mixed-citation></ref>
<ref id="ref-98"><label>Quick <italic>et al</italic>. (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Quick</surname> <given-names>C</given-names></string-name>, <string-name><surname>Anugu</surname> <given-names>P</given-names></string-name>, <string-name><surname>Musani</surname> <given-names>S</given-names></string-name>, <string-name><surname>Weiss</surname> <given-names>ST</given-names></string-name>, <string-name><surname>Burchard</surname> <given-names>EG</given-names></string-name> <etal>et al.</etal></person-group> (<year>2020</year>). <article-title>Sequencing and imputation in GWAS: Cost-effective strategies to increase power and genomic coverage across diverse populations</article-title>. <source>Genetic Epidemiology</source> <volume>44</volume>: <fpage>537</fpage>&#x2013;<lpage>549</lpage>. <pub-id pub-id-type="doi">10.1002/gepi.22326</pub-id>; <pub-id pub-id-type="pmid">32519380</pub-id></mixed-citation></ref>
<ref id="ref-99"><label>Ramnarine <italic>et al</italic>. (2015)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ramnarine</surname> <given-names>S</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>LS</given-names></string-name>, <string-name><surname>Culverhouse</surname> <given-names>R</given-names></string-name>, <string-name><surname>Duan</surname> <given-names>W</given-names></string-name> <etal>et al.</etal></person-group> (<year>2015</year>). <article-title>When does choice of accuracy measure alter imputation accuracy assessments?</article-title> <source>PLoS One</source> <volume>10</volume>: <fpage>e0137601</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0137601</pub-id>; <pub-id pub-id-type="pmid">26458263</pub-id></mixed-citation></ref>
<ref id="ref-100"><label>Razali <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Razali</surname> <given-names>RM</given-names></string-name>, <string-name><surname>Rodriguez-Flores</surname> <given-names>J</given-names></string-name>, <string-name><surname>Ghorbani</surname> <given-names>M</given-names></string-name>, <string-name><surname>Naeem</surname> <given-names>H</given-names></string-name>, <string-name><surname>Aamer</surname> <given-names>W</given-names></string-name> <etal>et al.</etal></person-group> (<year>2021</year>). <article-title>Thousands of Qatari genomes inform human migration history and improve imputation of Arab haplotypes</article-title>. <source>Nature Communications</source> <volume>12</volume>: <fpage>5929</fpage>. <pub-id pub-id-type="doi">10.1038/s41467-021-25287-y</pub-id>; <pub-id pub-id-type="pmid">34642339</pub-id></mixed-citation></ref>
<ref id="ref-101"><label>Reilly <italic>et al</italic>. (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Reilly</surname> <given-names>JP</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>F</given-names></string-name>, <string-name><surname>Jones</surname> <given-names>TK</given-names></string-name>, <string-name><surname>Palakshappa</surname> <given-names>JA</given-names></string-name>, <string-name><surname>Anderson</surname> <given-names>BJ</given-names></string-name> <etal>et al.</etal></person-group> (<year>2018</year>). <article-title>Plasma angiopoietin-2 as a potential causal marker in sepsis-associated ARDS development: Evidence from Mendelian randomization and mediation analysis</article-title>. <source>Intensive Care Medicine</source> <volume>44</volume>: <fpage>1849</fpage>&#x2013;<lpage>1858</lpage>. <pub-id pub-id-type="doi">10.1007/s00134-018-5328-0</pub-id>; <pub-id pub-id-type="pmid">30343317</pub-id></mixed-citation></ref>
<ref id="ref-102"><label>Robinson <italic>et al</italic>. (2013)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Robinson</surname> <given-names>J</given-names></string-name>, <string-name><surname>Halliwell</surname> <given-names>JA</given-names></string-name>, <string-name><surname>McWilliam</surname> <given-names>H</given-names></string-name>, <string-name><surname>Lopez</surname> <given-names>R</given-names></string-name>, <string-name><surname>Parham</surname> <given-names>P</given-names></string-name>, <string-name><surname>Marsh</surname> <given-names>SG</given-names></string-name></person-group> (<year>2013</year>). <article-title>The IMGT/HLA database</article-title>. <source>Nucleic Acids Research</source> <volume>41</volume>: <fpage>D1222</fpage>&#x2013;<lpage>D1227</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gks949</pub-id>; <pub-id pub-id-type="pmid">23080122</pub-id></mixed-citation></ref>
<ref id="ref-103"><label>Roshyara <italic>et al</italic>. (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Roshyara</surname> <given-names>NR</given-names></string-name>, <string-name><surname>Horn</surname> <given-names>K</given-names></string-name>, <string-name><surname>Kirsten</surname> <given-names>H</given-names></string-name>, <string-name><surname>Ahnert</surname> <given-names>P</given-names></string-name>, <string-name><surname>Scholz</surname> <given-names>M</given-names></string-name></person-group> (<year>2016</year>). <article-title>Comparing performance of modern genotype imputation methods in different ethnicities</article-title>. <source>Scientific Reports</source> <volume>6</volume>: <fpage>34386</fpage>. <pub-id pub-id-type="doi">10.1038/srep34386</pub-id>; <pub-id pub-id-type="pmid">27698363</pub-id></mixed-citation></ref>
<ref id="ref-104"><label>Roshyara and Scholz (2015)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Roshyara</surname> <given-names>NR</given-names></string-name>, <string-name><surname>Scholz</surname> <given-names>M</given-names></string-name></person-group> (<year>2015</year>). <article-title>Impact of genetic similarity on imputation accuracy</article-title>. <source>BMC Genetics</source> <volume>16</volume>: <fpage>90</fpage>. <pub-id pub-id-type="doi">10.1186/s12863-015-0248-2</pub-id>; <pub-id pub-id-type="pmid">26193934</pub-id></mixed-citation></ref>
<ref id="ref-105"><label>Rubinacci <italic>et al</italic>. (2020)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Rubinacci</surname> <given-names>S</given-names></string-name>, <string-name><surname>Delaneau</surname> <given-names>O</given-names></string-name>, <string-name><surname>Marchini</surname> <given-names>J</given-names></string-name></person-group> (<year>2020</year>). <article-title>Genotype imputation using the Positional Burrows Wheeler Transform</article-title>. <source>PLoS Genetics</source> <volume>16</volume>: <fpage>e1009049</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1009049</pub-id>; <pub-id pub-id-type="pmid">33196638</pub-id></mixed-citation></ref>
<ref id="ref-106"><label>Saad and Wijsman (2014)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Saad</surname> <given-names>M</given-names></string-name>, <string-name><surname>Wijsman</surname> <given-names>EM</given-names></string-name></person-group> (<year>2014a</year>). <article-title>Power of family-based association designs to detect rare variants in large pedigrees using imputed genotypes</article-title>. <source>Genetic Epidemiology</source> <volume>38</volume>: <fpage>1</fpage>&#x2013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1002/gepi.21776</pub-id>; <pub-id pub-id-type="pmid">24243664</pub-id></mixed-citation></ref>
<ref id="ref-107"><label>Saad and Wijsman (2014)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Saad</surname> <given-names>M</given-names></string-name>, <string-name><surname>Wijsman</surname> <given-names>EM</given-names></string-name></person-group> (<year>2014b</year>). <article-title>Combining family- and population-based imputation data for association analysis of rare and common variants in large pedigrees</article-title>. <source>Genetic Epidemiology</source> <volume>38</volume>: <fpage>579</fpage>&#x2013;<lpage>590</lpage>. <pub-id pub-id-type="doi">10.1002/gepi.21844</pub-id>; <pub-id pub-id-type="pmid">25132070</pub-id></mixed-citation></ref>
<ref id="ref-108"><label>Sargolzaei <italic>et al</italic>. (2014)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Sargolzaei</surname> <given-names>M</given-names></string-name>, <string-name><surname>Chesnais</surname> <given-names>JP</given-names></string-name>, <string-name><surname>Schenkel</surname> <given-names>FS</given-names></string-name></person-group> (<year>2014</year>). <article-title>A new approach for efficient genotype imputation using information from relatives</article-title>. <source>BMC Genomics</source> <volume>15</volume>: <fpage>478</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2164-15-478</pub-id>; <pub-id pub-id-type="pmid">24935670</pub-id></mixed-citation></ref>
<ref id="ref-109"><label>Sazonovs and Barrett (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Sazonovs</surname> <given-names>A</given-names></string-name>, <string-name><surname>Barrett</surname> <given-names>JC</given-names></string-name></person-group> (<year>2018</year>). <article-title>Rare-variant studies to complement genome-wide association studies</article-title>. <source>Annual Review of Genomics and Human Genetics</source> <volume>19</volume>: <fpage>97</fpage>&#x2013;<lpage>112</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-genom-083117-021641</pub-id>; <pub-id pub-id-type="pmid">29801418</pub-id></mixed-citation></ref>
<ref id="ref-110"><label>Scott <italic>et al</italic>. (2007)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Scott</surname> <given-names>LJ</given-names></string-name>, <string-name><surname>Mohlke</surname> <given-names>KL</given-names></string-name>, <string-name><surname>Bonnycastle</surname> <given-names>LL</given-names></string-name>, <string-name><surname>Willer</surname> <given-names>CJ</given-names></string-name>, <string-name><surname>Li</surname> <given-names>Y</given-names></string-name> <etal>et al.</etal></person-group> (<year>2007</year>). <article-title>A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants</article-title>. <source>Science</source> <volume>316</volume>: <fpage>1341</fpage>&#x2013;<lpage>1345</lpage>. <pub-id pub-id-type="doi">10.1126/science.1142382</pub-id>; <pub-id pub-id-type="pmid">17463248</pub-id></mixed-citation></ref>
<ref id="ref-111"><label>Scuteri <italic>et al</italic>. (2007)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Scuteri</surname> <given-names>A</given-names></string-name>, <string-name><surname>Sanna</surname> <given-names>S</given-names></string-name>, <string-name><surname>Chen</surname> <given-names>WM</given-names></string-name>, <string-name><surname>Uda</surname> <given-names>M</given-names></string-name>, <string-name><surname>Albai</surname> <given-names>G</given-names></string-name> <etal>et al.</etal></person-group> (<year>2007</year>). <article-title>Genome-wide association scan shows genetic variants in the FTO gene are associated with obesity-related traits</article-title>. <source>PLoS Genetics</source> <volume>3</volume>: <fpage>e115</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.0030115</pub-id>; <pub-id pub-id-type="pmid">17658951</pub-id></mixed-citation></ref>
<ref id="ref-112"><label>Shi <italic>et al</italic>. (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Shi</surname> <given-names>S</given-names></string-name>, <string-name><surname>Yuan</surname> <given-names>N</given-names></string-name>, <string-name><surname>Yang</surname> <given-names>M</given-names></string-name>, <string-name><surname>Du</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>J</given-names></string-name>, <string-name><surname>Sheng</surname> <given-names>X</given-names></string-name>, <string-name><surname>Wu</surname> <given-names>J</given-names></string-name>, <string-name><surname>Xiao</surname> <given-names>J</given-names></string-name></person-group> (<year>2018</year>). <article-title>Comprehensive assessment of genotype imputation performance</article-title>. <source>Human Heredity</source> <volume>83</volume>: <fpage>107</fpage>&#x2013;<lpage>116</lpage>. <pub-id pub-id-type="doi">10.1159/000489758</pub-id>; <pub-id pub-id-type="pmid">30669139</pub-id></mixed-citation></ref>
<ref id="ref-113"><label>Song <italic>et al</italic>. (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Song</surname> <given-names>M</given-names></string-name>, <string-name><surname>Wheeler</surname> <given-names>W</given-names></string-name>, <string-name><surname>Caporaso</surname> <given-names>NE</given-names></string-name>, <string-name><surname>Landi</surname> <given-names>MT</given-names></string-name>, <string-name><surname>Chatterjee</surname> <given-names>N</given-names></string-name></person-group> (<year>2018</year>). <article-title>Using imputed genotype data in the joint score tests for genetic association and gene-environment interactions in case-control studies</article-title>. <source>Genetic Epidemiology</source> <volume>42</volume>: <fpage>146</fpage>&#x2013;<lpage>155</lpage>. <pub-id pub-id-type="doi">10.1002/gepi.22093</pub-id>; <pub-id pub-id-type="pmid">29178451</pub-id></mixed-citation></ref>
<ref id="ref-114"><label>Spencer <italic>et al</italic>. (2009)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Spencer</surname> <given-names>CCA</given-names></string-name>, <string-name><surname>Su</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Donnelly</surname> <given-names>P</given-names></string-name>, <string-name><surname>Marchini</surname> <given-names>J</given-names></string-name></person-group> (<year>2009</year>). <article-title>Designing genome-wide association studies: Sample size, power, imputation, and the choice of genotyping chip</article-title>. <source>PLoS Genetics</source> <volume>5</volume>: <fpage>e1000477</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pgen.1000477</pub-id>; <pub-id pub-id-type="pmid">19492015</pub-id></mixed-citation></ref>
<ref id="ref-115"><label>Stefani <italic>et al</italic>. (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Stefani</surname> <given-names>C</given-names></string-name>, <string-name><surname>Sangalli</surname> <given-names>A</given-names></string-name>, <string-name><surname>Locatelli</surname> <given-names>E</given-names></string-name>, <string-name><surname>Federico</surname> <given-names>T</given-names></string-name>, <string-name><surname>Malerba</surname> <given-names>G</given-names></string-name> <etal>et al.</etal></person-group> (<year>2022</year>). <article-title>Increased prevalence of HLA-C unstable variants in HIV-1 rapid progressor patients</article-title>. <source>International Journal of Molecular Sciences</source> <volume>23</volume>: <fpage>14852</fpage>. <pub-id pub-id-type="doi">10.3390/ijms232314852</pub-id>; <pub-id pub-id-type="pmid">36499177</pub-id></mixed-citation></ref>
<ref id="ref-116"><label>Stephens and Balding (2009)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Stephens</surname> <given-names>M</given-names></string-name>, <string-name><surname>Balding</surname> <given-names>DJ</given-names></string-name></person-group> (<year>2009</year>). <article-title>Bayesian statistical methods for genetic association studies</article-title>. <source>Nature Reviews Genetics</source> <volume>10</volume>: <fpage>681</fpage>&#x2013;<lpage>690</lpage>. <pub-id pub-id-type="doi">10.1038/nrg2615</pub-id>; <pub-id pub-id-type="pmid">19763151</pub-id></mixed-citation></ref>
<ref id="ref-117"><label>Stephens and Donnelly (2000)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Stephens</surname> <given-names>M</given-names></string-name>, <string-name><surname>Donnelly</surname> <given-names>P</given-names></string-name></person-group> (<year>2000</year>). <article-title>Inference in molecular population genetics</article-title>. <source>Journal of the Royal Statistical Society Series B: Statistical Methodology</source> <volume>62</volume>: <fpage>605</fpage>&#x2013;<lpage>635</lpage>. <pub-id pub-id-type="doi">10.1111/1467-9868.00254</pub-id></mixed-citation></ref>
<ref id="ref-118"><label>Tan <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Tan</surname> <given-names>AG</given-names></string-name>, <string-name><surname>Kifley</surname> <given-names>A</given-names></string-name>, <string-name><surname>Flood</surname> <given-names>VM</given-names></string-name>, <string-name><surname>Holliday</surname> <given-names>EG</given-names></string-name>, <string-name><surname>Scott</surname> <given-names>RJ</given-names></string-name>, <string-name><surname>Cumming</surname> <given-names>RG</given-names></string-name>, <string-name><surname>Mitchell</surname> <given-names>P</given-names></string-name>, <string-name><surname>Wang</surname> <given-names>JJ</given-names></string-name></person-group> (<year>2019</year>). <article-title>Evaluating the associations between obesity and age-related cataract: A Mendelian randomization study</article-title>. <source>The American Journal of Clinical Nutrition</source> <volume>110</volume>: <fpage>969</fpage>&#x2013;<lpage>976</lpage>. <pub-id pub-id-type="doi">10.1093/ajcn/nqz167</pub-id>; <pub-id pub-id-type="pmid">31401654</pub-id></mixed-citation></ref>
<ref id="ref-119"><label>Trowsdale and Knight (2013)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Trowsdale</surname> <given-names>J</given-names></string-name>, <string-name><surname>Knight</surname> <given-names>JC</given-names></string-name></person-group> (<year>2013</year>). <article-title>Major histocompatibility complex genomics and human disease</article-title>. <source>Annual Review of Genomics and Human Genetics</source> <volume>14</volume>: <fpage>301</fpage>&#x2013;<lpage>323</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-genom-091212-153455</pub-id>; <pub-id pub-id-type="pmid">23875801</pub-id></mixed-citation></ref>
<ref id="ref-120"><label>Turner (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Turner</surname> <given-names>SD</given-names></string-name></person-group> (<year>2018</year>). <article-title>qqman: an R package for visualising GWAS results using Q-Q and manhattan plots</article-title>. <source>Journal of Open Source Software</source> <volume>3</volume>: <fpage>731</fpage>. <pub-id pub-id-type="doi">10.21105/joss.00731</pub-id></mixed-citation></ref>
<ref id="ref-121"><label>Uffelmann <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Uffelmann</surname> <given-names>E</given-names></string-name>, <string-name><surname>Huang</surname> <given-names>QQ</given-names></string-name>, <string-name><surname>Munung</surname> <given-names>NS</given-names></string-name>, <string-name><surname>de Vries</surname> <given-names>J</given-names></string-name>, <string-name><surname>Okada</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Martin</surname> <given-names>AR</given-names></string-name>, <string-name><surname>Martin</surname> <given-names>HC</given-names></string-name>, <string-name><surname>Lappalainen</surname> <given-names>T</given-names></string-name>, <string-name><surname>Posthuma</surname> <given-names>D</given-names></string-name></person-group> (<year>2021</year>). <article-title>Genome-wide association studies</article-title>. <source>Nature Reviews Methods Primers</source> <volume>1</volume>: <fpage>59</fpage>. <pub-id pub-id-type="doi">10.1038/s43586-021-00056-9</pub-id></mixed-citation></ref>
<ref id="ref-122"><label>Ullah <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Ullah</surname> <given-names>E</given-names></string-name>, <string-name><surname>Mall</surname> <given-names>R</given-names></string-name>, <string-name><surname>Abbas</surname> <given-names>MM</given-names></string-name>, <string-name><surname>Kunji</surname> <given-names>K</given-names></string-name>, <string-name><surname>Nato</surname> <given-names>AQ</given-names></string-name>, <string-name><surname>Bensmail</surname> <given-names>H</given-names></string-name>, <string-name><surname>Wijsman</surname> <given-names>EM</given-names></string-name>, <string-name><surname>Saad</surname> <given-names>M</given-names></string-name></person-group> (<year>2019</year>). <article-title>Comparison and assessment of family- and population-based genotype imputation methods in large pedigrees</article-title>. <source>Genome Research</source> <volume>29</volume>: <fpage>125</fpage>&#x2013;<lpage>134</lpage>. <pub-id pub-id-type="doi">10.1101/gr.236315.118</pub-id>; <pub-id pub-id-type="pmid">30514702</pub-id></mixed-citation></ref>
<ref id="ref-123"><label>Vergara <italic>et al</italic>. (2018)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Vergara</surname> <given-names>C</given-names></string-name>, <string-name><surname>Parker</surname> <given-names>MM</given-names></string-name>, <string-name><surname>Franco</surname> <given-names>L</given-names></string-name>, <string-name><surname>Cho</surname> <given-names>MH</given-names></string-name>, <string-name><surname>Valencia-Duarte</surname> <given-names>AV</given-names></string-name>, <string-name><surname>Beaty</surname> <given-names>TH</given-names></string-name>, <string-name><surname>Duggal</surname> <given-names>P</given-names></string-name></person-group> (<year>2018</year>). <article-title>Genotype imputation performance of three reference panels using African ancestry individuals</article-title>. <source>Human Genetics</source> <volume>137</volume>: <fpage>281</fpage>&#x2013;<lpage>292</lpage>. <pub-id pub-id-type="doi">10.1007/s00439-018-1881-4</pub-id>; <pub-id pub-id-type="pmid">29637265</pub-id></mixed-citation></ref>
<ref id="ref-124"><label>Villalba-Mouco <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Villalba-Mouco</surname> <given-names>V</given-names></string-name>, <string-name><surname>van de Loosdrecht</surname> <given-names>MS</given-names></string-name>, <string-name><surname>Posth</surname> <given-names>C</given-names></string-name>, <string-name><surname>Mora</surname> <given-names>R</given-names></string-name>, <string-name><surname>Mart&#x00ED;nez-Moreno</surname> <given-names>J</given-names></string-name> <etal>et al.</etal></person-group> (<year>2019</year>). <article-title>Survival of late pleistocene hunter-gatherer ancestry in the Iberian Peninsula</article-title>. <source>Current Biology</source> <volume>29</volume>: <fpage>1169</fpage>&#x2013;<lpage>1177.e7</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2019.02.006</pub-id>; <pub-id pub-id-type="pmid">30880015</pub-id></mixed-citation></ref>
<ref id="ref-125"><label>Vissers <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Vissers</surname> <given-names>LET</given-names></string-name>, <string-name><surname>Sluijs</surname> <given-names>I</given-names></string-name>, <string-name><surname>van der Schouw</surname> <given-names>YT</given-names></string-name>, <string-name><surname>Forouhi</surname> <given-names>NG</given-names></string-name>, <string-name><surname>Imamura</surname> <given-names>F</given-names></string-name> <etal>et al.</etal></person-group> (<year>2019</year>). <article-title>Dairy product intake and risk of type 2 diabetes in EPIC-InterAct: A mendelian randomization study</article-title>. <source>Diabetes Care</source> <volume>42</volume>: <fpage>568</fpage>&#x2013;<lpage>575</lpage>. <pub-id pub-id-type="doi">10.2337/dc18-2034</pub-id>; <pub-id pub-id-type="pmid">30728219</pub-id></mixed-citation></ref>
<ref id="ref-126"><label>Wellcome Trust Case Control Consortium (2007)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><collab>Wellcome Trust Case Control Consortium</collab></person-group> (<year>2007</year>). <article-title>Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls</article-title>. <source>Nature</source> <volume>447</volume>: <fpage>661</fpage>&#x2013;<lpage>678</lpage>. <pub-id pub-id-type="doi">10.1038/nature05911</pub-id>; <pub-id pub-id-type="pmid">17554300</pub-id></mixed-citation></ref>
<ref id="ref-127"><label>Whalen <italic>et al</italic>. (2019)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Whalen</surname> <given-names>A</given-names></string-name>, <string-name><surname>Gorjanc</surname> <given-names>G</given-names></string-name>, <string-name><surname>Hickey</surname> <given-names>JM</given-names></string-name></person-group> (<year>2019</year>). <article-title>Family-specific genotype arrays increase the accuracy of pedigree-based imputation at very low marker densities</article-title>. <source>Genetics Selection Evolution</source> <volume>51</volume>: <fpage>33</fpage>. <pub-id pub-id-type="doi">10.1186/s12711-019-0478-2</pub-id>; <pub-id pub-id-type="pmid">31242856</pub-id></mixed-citation></ref>
<ref id="ref-128"><label>Wuttke <italic>et al</italic>. (2016)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Wuttke</surname> <given-names>M</given-names></string-name>, <string-name><surname>Wong</surname> <given-names>CS</given-names></string-name>, <string-name><surname>W&#x00FC;hl</surname> <given-names>E</given-names></string-name>, <string-name><surname>Epting</surname> <given-names>D</given-names></string-name>, <string-name><surname>Luo</surname> <given-names>L</given-names></string-name> <etal>et al.</etal></person-group> (<year>2016</year>). <article-title>Genetic loci associated with renal function measures and chronic kidney disease in children: The pediatric investigation for genetic factors linked with renal progression consortium</article-title>. <source>Nephrology Dialysis Transplantation</source> <volume>31</volume>: <fpage>262</fpage>&#x2013;<lpage>269</lpage>. <pub-id pub-id-type="doi">10.1093/ndt/gfv342</pub-id>; <pub-id pub-id-type="pmid">26420894</pub-id></mixed-citation></ref>
<ref id="ref-129"><label>Yu <italic>et al</italic>. (2022)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Yu</surname> <given-names>WY</given-names></string-name>, <string-name><surname>Yan</surname> <given-names>SS</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>SH</given-names></string-name>, <string-name><surname>Ni</surname> <given-names>JJ</given-names></string-name>, <string-name><surname>Li</surname> <given-names>B</given-names></string-name>, <string-name><surname>Pei</surname> <given-names>YF</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>L</given-names></string-name></person-group> (<year>2022</year>). <article-title>Efficient identification of trait-associated loss-of-function variants in the UK Biobank cohort by exome-sequencing based genotype imputation</article-title>. <source>Genetic Epidemiology</source> <volume>47</volume>: <fpage>121</fpage>&#x2013;<lpage>134</lpage>. <pub-id pub-id-type="doi">10.1002/gepi.22511</pub-id>; <pub-id pub-id-type="pmid">36490288</pub-id></mixed-citation></ref>
<ref id="ref-130"><label>Zeggini <italic>et al</italic>. (2008)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zeggini</surname> <given-names>E</given-names></string-name>, <string-name><surname>Scott</surname> <given-names>LJ</given-names></string-name>, <string-name><surname>Saxena</surname> <given-names>R</given-names></string-name>, <string-name><surname>Voight</surname> <given-names>BF</given-names></string-name>, <string-name><surname>Marchini</surname> <given-names>JL</given-names></string-name> <etal>et al.</etal></person-group> (<year>2008</year>). <article-title>Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes</article-title>. <source>Nature Genetics</source> <volume>40</volume>: <fpage>638</fpage>&#x2013;<lpage>645</lpage>. <pub-id pub-id-type="doi">10.1038/ng.120</pub-id>; <pub-id pub-id-type="pmid">18372903</pub-id></mixed-citation></ref>
<ref id="ref-131"><label>Zhang <italic>et al</italic>. (2021)</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><string-name><surname>Zhang</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Xiao</surname> <given-names>X</given-names></string-name>, <string-name><surname>Zhou</surname> <given-names>W</given-names></string-name>, <string-name><surname>Zhu</surname> <given-names>D</given-names></string-name>, <string-name><surname>Amos</surname> <given-names>CI</given-names></string-name></person-group> (<year>2021</year>). <article-title>False positive findings during genome-wide association studies with imputation: Influence of allele frequency and imputation accuracy</article-title>. <source>Human Molecular Genetics</source> <volume>31</volume>: <fpage>146</fpage>&#x2013;<lpage>155</lpage>. <pub-id pub-id-type="doi">10.1093/hmg/ddab203</pub-id>; <pub-id pub-id-type="pmid">34368847</pub-id></mixed-citation></ref>
</ref-list>
</back>
</article>