Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
NON-HUMAN ANIMALS COMPRISING HUMANIZED ACE2 AND TMPRSS LOCI
Document Type and Number:
WIPO Patent Application WO/2023/122506
Kind Code:
A1
Abstract:
Non-human animal cells and non-human animals comprising a humanized ACE2 locus and a humanized TMPRSS locus, and methods of using such non-human animal cells and non-human animals are provided. Non-human animal cells or non-human animals comprising a humanized ACE2 locus and a humanized TMPRSS locus express a human ACE2 protein or a chimeric ACE2 protein, fragments of which are from human ACE2; and a human TMPRSS or chimeric TMPRSS protein, fragments of which are from human TMPRSS. Methods are also provided for using such non-human animals comprising a humanized ACE2 locus and a humanized TMPRSS locus to assess in vivo ACE2 activity, e.g., coronavirus infection and/or the treatment or prevention thereof.

Inventors:
BRYDGES SUSANNAH (US)
KYRATSOUS CHRISTOS (US)
BAUM ALINA (US)
Application Number:
PCT/US2022/081833
Publication Date:
June 29, 2023
Filing Date:
December 16, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
REGENERON PHARMA (US)
International Classes:
A01K67/027; C07K14/705; C12N9/48
Domestic Patent References:
WO2017151453A12017-09-08
WO2021242815A12021-12-02
WO2013176772A12013-11-28
WO2014033644A22014-03-06
WO2014089290A12014-06-12
WO1999005266A21999-02-04
WO2008017234A12008-02-14
WO2016081923A22016-05-26
Foreign References:
US20110020722A12011-01-27
US8586713B22013-11-19
US8697851B22014-04-15
US20130312129A12013-11-21
US20110104799A12011-05-05
US10070631B22018-09-11
US10070632B22018-09-11
US20140235933A12014-08-21
US20050144655A12005-06-30
US20130309670A12013-11-21
US20150159175A12015-06-11
US7294754B22007-11-13
US20080092249A12008-04-17
US20040177390A12004-09-09
US7612250B22009-11-03
US20140178879A12014-06-26
US20160145646A12016-05-26
US7576259B22009-08-18
US7659442B22010-02-09
US20080007800A12008-01-10
US6586251B22003-07-01
Other References:
RIVA LAURA ET AL: "Discovery of SARS-CoV-2 antiviral drugs through large-scale compound repurposing", NATURE, NATURE PUBLISHING GROUP UK, LONDON, vol. 586, no. 7827, 24 July 2020 (2020-07-24), pages 113 - 119, XP037258306, ISSN: 0028-0836, [retrieved on 20200724], DOI: 10.1038/S41586-020-2577-1
MANCEK-KEBER MATEJA ET AL: "Disruption of disulfides within RBD of SARS-CoV-2 spike protein prevents fusion and represents a target for viral entry inhibition by registered drugs", THE FASEB JOURNAL, vol. 35, no. 6, 18 May 2021 (2021-05-18), US, XP055844245, ISSN: 0892-6638, Retrieved from the Internet DOI: 10.1096/fj.202100560R
SOLDATOV VLADISLAV O. ET AL: "On the way from SARS-CoV-sensitive mice to murine COVID-19 model", RESEARCH RESULTS IN PHARMACOLOGY, vol. 6, no. 2, 4 May 2020 (2020-05-04), pages 1 - 7, XP055863283, Retrieved from the Internet DOI: 10.3897/rrpharmacology.6.53633
JARNAGIN KURT ET AL: "Animal models for SARS-Cov2/Covid19 research-A commentary", BIOCHEMICAL PHARMACOLOGY, vol. 188, 1 June 2021 (2021-06-01), US, pages 114543, XP093039356, ISSN: 0006-2952, DOI: 10.1016/j.bcp.2021.114543
KISHIMOTO MAI ET AL: "TMPRSS11D and TMPRSS13 Activate the SARS-CoV-2 Spike Protein", VIRUSES, vol. 13, no. 3, 28 February 2021 (2021-02-28), pages 384, XP093039548, Retrieved from the Internet DOI: 10.3390/v13030384
ZANG RUOCHEN ET AL: "TMPRSS2 and TMPRSS4 promote SARS-CoV-2 infection of human small intestinal enterocytes", SCIENCE IMMUNOLOGY, vol. 5, no. 47, 19 May 2020 (2020-05-19), pages 3582, XP093017750, Retrieved from the Internet DOI: 10.1126/sciimmunol.abc3582
NAKAMURA ET AL., NUCLEIC ACIDS RESEARCH, vol. 28, 2000, pages 292
KASPAREKHUMPHREY, SEMINARS IN CELL & DEV. BIOL., vol. 22, 2011, pages 886 - 897
MARESCA ET AL., GENOME RES, vol. 23, no. 3, 2013, pages 539 - 546
WANG ET AL., CELL, vol. 153, 2013, pages 910 - 918
MANDALOS ET AL., PLOS ONE, vol. 7, no. 45768, 2012, pages 1 - 9
WANG ET AL., NAT BIOTECHNOL., 2013
"Methods in Molecular Biology", vol. 66, article "Epitope Mapping Protocols"
SUBBARAOROBERTS, TRENDS MICROBIOL., vol. 14, 2006, pages 299 - 303
MCCRAY ET AL., J. VIROL, vol. 81, 2007, pages 813 - 821
WAN, J. VIROL, vol. 94, 2020, pages 1 - 9
SUN ET AL., CELL HOST & MICROBE, vol. 28, 2020, pages 1 - 10
"NCBI", Database accession no. NP-001123985.1
"UniProt Accession", Database accession no. Q9BYF1-1
"UniProt", Database accession no. Q8R0I0-1
BUGGE ET AL., J. BIOL. CHEM, vol. 284, no. 35, 2009, pages 23177 - 23181
SZABO ET AL., INT. J. BIOCHEM. CELL BIOL, vol. 40, 2008, pages 1297 - 1316
HOFFMAN ET AL., CELL, vol. 181, 2020, pages 271 - 80
MATSUYAMA ET AL., PROC. NATL. ACAD. SCI. USA, vol. 17, 2020, pages 7001 - 3
J. BIOL. CHEM., 2020
FESTING ET AL., MAMMALIAN GENOME, vol. 10, 1999, pages 836
CHO ET AL., CURRENT PROTOCOLS IN CELL BIOLOGY, vol. 42, no. 19, 2009
GAMA SOSA ET AL., BRAIN STRUCT. FUNCT, vol. 214, no. 2-3, 2010, pages 91 - 109
FRENDEWEY ET AL., METHODS ENZYMOL., vol. 476, 2010, pages 295 - 307
POUEYMIROU ET AL.: "F0 generation mice that are essentially fully derived from the donor gene-targeted ES cells allowing immediate phenotypic analyses", NATURE BIOTECH, vol. 25, no. 1, 2007, pages 91 - 99
VALENZUELA ET AL.: "High-throughput engineering of the mouse genome coupled with high-resolution expression analysis", NATURE BIOTECH., vol. 21, no. 6, 2003, pages 652 - 659
Attorney, Agent or Firm:
WU, Rita S. (US)
Download PDF:
Claims:
CLAIMS

What is claimed is:

1. A non-human animal, non-human animal cell, or non-human animal genome comprising:

(I) a humanized ACE2 gene encoding a recombinant ACE2 protein, wherein the recombinant ACE2 protein comprises in operable linkage:

(a) an ACE2 signal sequence of a non-human animal ACE2 protein or an ACE2 signal sequence of a human ACE2 protein,

(b) an extracellular domain substantially identical to an extracellular domain of a human ACE2 protein,

(c) a transmembrane domain of anon-human animal ACE2 protein or a transmembrane domain of a human ACE2 protein, and

(d) a cytoplasmic domain of a non-human animal ACE2 protein or a cytoplasmic domain of a human ACE2 protein, and

(II) one or more distinct and humanized TMPRSS genes, each respectively encoding one or more distinct recombinant TMPRSS proteins that comprises in operable linkage:

(a) a TMPRSS signal sequence of a non-human animal TMPRSS protein or TMPRSS signal sequence of a human TMPRSS protein,

(b) an extracellular domain substantially identical to the extracellular domain of the human TMPRSS protein, and

(c) a transmembrane domain of the non-human animal TMPRSS protein or a transmembrane domain of the human TMPRSS protein, and

(d) a cytoplasmic domain of the non-human TMPRSS protein or a cytoplasmic domain of the human TMPRSS protein, wherein the non-human animal TMPRSS protein and the human TMPRSS protein are orthologous.

83

2. The non-human animal, non-human animal cell, or non-human animal genome of claim 1, wherein therecombinant ACE2 protein comprises in operable linkage:

(a) an ACE2 signal sequence of an endogenous non-human animal ACE2 protein,

(b) an extracellular domain substantially identical to the extracellular domain of a human ACE2 protein,

(c) a transmembrane domain of an endogenous non-human animal ACE2 protein, and

(d) a cytoplasmic domain of an endogenous non-human animal ACE2 protein, and

(II) wherein the recombinant TMPRSS protein comprises in operable likage:

(a) a TMPRSS signal sequence of the non-human animal TMPRSS protein

(b) an extracellular domain substantially identical to the extracellular domain of a human TMPRSS protein, and

(c) a transmembrane domain of the non-human TMPRSS protein, and

(d) a cytoplasmic domain of the non-human TMPRSS protein, wherein the non-human animal TMPRSS protein and the human TMPRSS protein are orthologous.

3. The non-human animal, non-human animal cell, or non-human animal genome of claim 1 or claim 2, wherein

(I) the humanized ACE2 gene comprises a nucleotide sequence encoding an extracellular domain substantially identical to the extracellular domain of a human ACE2 protein operably linked to an endogenous nucleotide sequence encoding (iii) the transmembrane domain of an endogenous non-human animal ACE2 protein, and (iv) the cytoplasmic domain of an endogenous non-human animal ACE2 protein; and/or

84 (II) each of the one or more distinct and humanized TMPRSS gene comprises a nucleotide sequence encoding an extracellular domain substantially identical to the extracellular domain of a human TMPRSS protein operably linked to an endogenous nucleotide sequence encoding (iii) the transmembrane domain of an endogenous and cognate animal TMPRSS protein and (iv) the cytoplasmic domain of an endogenous and non-human animal TMPRSS protein, wherein the non-human animal TMPRSS protein and the human TMPRSS protein are orthologous.

4. The non-human animal, non-human animal cell, or non-human animal genome of any one of the preceding claims, wherein

(I) an endogenous ACE2 locus comprises the humanized ACE2 gene, wherein the endogenous ACE2 locus comprises a replacement of the nucleotide sequence encoding the extracellular domain of the endogenous ACE2 protein with the nucleotide sequence encoding the extracellular domain of a human ACE2 protein such that the nucleotide sequence encoding the extracellular domain of a human ACE2 protein is operably linked to an endogenous nucleotide sequence encoding (iii) the transmembrane domain of an endogenous non-human animal ACE2 protein, and (iv) the cytoplasmic domain of an endogenous non-human animal ACE2 protein and/or

(II) an endogenous TMPRSS locus comprises one of the one or more distinct and humanized TMPRSS genes, wherein the endogenous TMPRSS locus comprises a replacement of the nucleotide sequence encoding the extracellular domain of the endogenous TMPRSS protein with the nucleotide sequence encoding the extracellular domain of a human TMPRSS protein such that the nucleotide sequence encoding the extracellular domain of the human TMPRSS protein is operably linked to an endogenous nucleotide sequence encoding (iii) the transmembrane domain of an endogenous TMPRSS protein at the endogenous TMPRSS locus, and (iv) the cytoplasmic domain of the endogenous TMPRSS protein, wherein the non-human animal TMPRSS protein and the human TMPRSS protein are orthologous.

85

5. The non-human animal, non-human animal cell, or non-human animal genome of any one of the preceding claims, wherein

(I) the humanized ACE2 gene encodes a recombinant ACE2 protein comprising in operable linkage:

(i) an ACE2 signal sequence of an endogenous ACE2 protein,

(ii) an extracellular domain of a human ACE2 protein,

(iii) a transmembrane domain of an endogenous ACE2 protein, and

(iv) a cytoplasmic domain of an endogenous mouse ACE2 protein, optionally wherein the extracellular domain of a human ACE2 protein is encoded by part of the coding sequence of coding exon 1, all of the coding sequences of coding exon 2 to coding exon 16, inclusive, and part of coding exon 17 of a human ACE2 gene and/or wherein the amino acid sequence of the recombinant ACE2 protein is set forth in SEQ ID NO:20, and

(II) one of the one or more distinct and humanized TMPRSS genes is:

(A) a humanized TMPRSS2 gene, optionally wherein the humanized TMPRSS2 gene:

(i) is at an endogenous TMPRSS2 locus, optionally under control of the endogenous TMPRSS2 promoter at the endogenous TMPRSS2 locus,

(ii) is formed by a replacement of an endogenous nucleotide sequence of an endogenous TMPRSS2 gene, or portion thereof, with an orthologous nucleotide sequence of a human TMPRSS2 gene,

(iii) comprises coding exon 4 through the stop codon in coding exon 13 of the human TMPRSS2 gene, and optionally further comprises the 3’ UTR of the human TMPRSS2 gene,

(iv) comprises coding exons 1-2 of the endogenous TMPRSS2 gene and coding exon 4 through coding exon 13 of the human TMPRSS2 gene, and optionally the 3’ UTR of the human TMPRSS2 gene

86 (v) comprises an exon 3 that comprises a 5’ portion of coding exon 3 of the endogenous TMPRSS2 gene and a 3’ portion of coding exon 3 of the human TMPRSS2 gene and/or

(vi) encodes a humanized TMPRSS2 protein that comprises (i) an extracellular domain of the human TMPRSS2 protein encoded by a human TMPRSS2 gene, and (ii) a cytoplasmic and transmembrane portion encoded by the endogenous TMPRSS2 gene, optionally wherein the extracellular domain of the human TMPRSS2 protein comprises the residues W 106 to G492 of SEQ ID NO:71;

(B) a humanized TMPRSS4, optionally wherein the humanized TMPRSS4 gene:

(i) is at an endogenous TMPRSS4 locus, optionally under control of the endogenous TMPRSS4 promoter at the endogenous TMPRSS4 locus,

(ii) is formed by a replacement of an endogenous nucleotide sequence of an endogenous TMPRSS4 gene, or portion thereof, with an orthologous nucleotide sequence of a human TMPRSS4 gene,

(iii) comprises coding exon 4 through the stop codon in coding exon 13 of the human TMPRSS4 gene, and optionally further comprises the 3’UTR of the human TMPRSS2 gene,

(iv) comprises coding exons 1-3 of the endogenous TMPRSS2 gene and coding exon 4 through coding exon 13 of the human TMPRSS4 gene, and optionally the 3’ UTR of the human TMPRSS4 gene and/or

(v) encodes a humanized TMPRSS4 protein that comprises (i) an extracellular domain of the human TMPRSS4 protein encoded by a human TMPRSS4 gene, and (ii) a cytoplasmic and transmembrane portion encoded by the endogenous TMPRSS4 gene, optionally wherein the extracellular domain of the human TMPRSS4 protein comprises the residues K54-L437 of SEQ ID NO:78; and/or

(C) a humanized TMPRSS11, optionally wherein the humanized TMPRSS11 gene:

(i) is at an endogenous TMPRSS11 locus, optionally under control of the endogenous TMPRSS11 promoter at the endogenous TMPRSS11 locus,

87 (ii) is formed by a replacement of an endogenous nucleotide sequence of an endogenous TMPRSS11 gene, or portion thereof, with an orthologous nucleotide sequence of a human TMPRSS11 gene,

(iii) comprises coding exon 3 through the stop codon in coding exon 10 of the human TMPRSS11 gene, and optionally further comprises the 3’ UTR of the human TMPRSS 11 gene,

(iv) comprises coding exons 1-2 of the endogenous TMPRSS11 gene and coding exon 3 through coding exon 10 of the human TMPRSS11 gene, and optionally the 3’ UTR of the human TMPRSS11 gene and/or

(v) encodes a humanized TMPRSS11 protein that comprises (i) an extracellular domain of the human TMPRSS11 protein encoded by a human TMPRSS11 gene, and (ii) a cytoplasmic and transmembrane portion encoded by the endogenous TMPRSS1 1 gene, optionally wherein the extracellular domain of the human TMPRSS11 protein comprises the residues A42 to 1418 of SEQ ID NO:85.

6. The non-human animal, non-human animal cell, or non-human animal genome of any one of the preceding claims, wherein the non-human animal, non-human animal cell, or non- human animal genome comprises at least two humanized TMPRSS genes, each at orthologous endogenous TMPRSS gene loci.

7. The non-human animal, non-human animal cell, or non-human animal genome of any one of the preceding claims wherein

(I) the humanized ACE2 gene is under control of the promoter of the endogenous ACE2 gene and/or

(II) each of the one or more distinct and human TMPRSS genes is under the control of the promoter of the endogneous and orthologous TMPRESS gene.

8. The non-human animal, non-human animal cell, or non-human animal genome of any one of the preceding claims, wherein the non-human animal is heterozygous for the humanized ACE2 and /or humanized TMPRSS gene(s).

88

9. The non-human animal, non-human animal cell, or non-human animal genome of any one of claims 108 wherein the non-human animal is homozygous for the humanized ACE2 and/or humanized TMPRSS gene(s).

10. The non-human animal, non-human animal cell, or non-human animal genome of any one of the preceding claims, wherein the non-human animal is a rodent, the non-human animal cell is a rodent cell, or the non-human animal genome is a rodent genome.

11. The non-human animal, non-human animal cell, or non-human animal genome of any one of the preceding claims, wherein the non-human animal is a rat, the non-human animal cell is a rat cell, or the non-human animal genome is a rat genome.

12. The non-human animal, non-human animal cell, or non-human animal genome of any one of claims 1-10, wherein the non-human animal is a mouse, the non-human animal cell is a mouse cell, or the non-human animal genome is a mouse genome.

13. The non-human animal, non-human animal cell, or non-human animal genome of any one of the preceding claims, wherein the non-human animal, non-human animal cell, or non- human animal genome expresses the recombinant ACE2 protein and/or the one or more distinct recombinant TMPRSS proteins.

14. A non-human animal cell of any one of claims 1-12, wherein the cell is an embryonic stem (ES) cell.

89

15. A non-human animal tissue comprising the non-human animal cell of any one of the preceding claims.

16. A method for making a non-human animal, non-human animal cell, or non-human animal genome comprising a humanized ACE2 and a humanized TMPRSS gene, the method comprising:

(A) generating a genetically modified non-human embryonic stem (ES) cell comprising:

(I) humanized ACE2 gene encoding a recombinant ACE2 protein, wherein the recombinant ACE2 protein comprises in operable linkage:

(a) an ACE2 signal sequence of a non-human animal ACE2 protein or an ACE2 signal sequence of a human ACE2 protein,

(b) an extracellular domain substantially identical to an extracellular domain of a human ACE2 protein,

(c) a transmembrane domain of a non-human animal ACE2 protein or a transmembrane domain of a human ACE2 protein, and

(d) a cytoplasmic domain of a non-human animal ACE2 protein or a cytoplasmic domain of a human ACE2 protein, and

(II) one or more distinct and humanized TMPRSS genes, each encoding a distinct recombinant TMPRSS protein that comprises in operable likage:

(a) a TMPRSS signal sequence of a non-human animal TMPRSS protein or TMPRSS signal sequence of a human TMPRSS protein,

(b) an extracellular domain substantially identical to the extracellular domain of the human TMPRSS protein, and

(c) a transmembrane domain of the non-human animal TMPRSS protein or a transmembrane domain of the human TMPRSS protein, and

(d) a cytoplasmic domain of the non-human TMPRSS protein or a cytoplasmic domain of the human TMPRSS protein,

90 wherein the non-human animal TMPRSS protein and the human TMPRSS protein are orthologous

(B) introducing the modified non-human animal ES cell into a host embryo of non-human animal to form a donor cell-non-human animal embryo complex; and

(C) gestating the donor cell-non-human animal embryo in a surrogate non-human animal mother, wherein the surrogate non-human animal mother produces rodent progeny that express the humanized ACE2 and TMPRSS proteins.

17. The method of claim 16, wherein generating the genetically modified non-human animal ES cell comprises:

(i) obtaining a non-human ES cell that comprises the humanized ACE2 gene, and

(ii) modifying the genome of the obtained non-human ES cell that comprises the humanized ACE2 gene to further comprise the one or more distinct humanized TMPRSS genes, wherein modifying the genome comprises replacing an endogneous nucleotide seuqence encoding an extracellular domain of an endogenous TMPRSS protein with a nucleotide seuqence encoding encoding a transmembrane domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human TMPRSS protein that is orthologous to the endogenous TMPRSS protein.

18. The method of claim 16, wherein generating the genetically modified non-human animal ES cell comprises:

(i) obtaining a non-human ES cell that comprises the one or more distinct humanized TMPRSS genes, and

(ii) modifying the genome of the obtained non-human ES cell that comprises the tone or more distinct humanized TMPRSS genes to further comprise the humanized ACE2 gene, wherein modifying the genome comprises replacing an endogneous nucleotide seuqence encoding an extracellular domain of an endogenous ACE2 protein with a nucleotide seuqence encoding encoding an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human ACE2 protein.

19. A non-human animal, non-human animal cell, or non-human animal genome made according to the method of any one of claims claim 16-18.

20. A composition comprising the non-human animal cell or tissue of any one of claims 1-15 and 19.

21. The composition of claim 20, further comprising a spike protein of a coronavirus, wherein the spike protein binds the human ACE2 protein.

22. The composition of claim 20 or 21, further comprising a therapeutic agent that inhibits or prevents binding of an ACE2 ligand to the recombinant ACE2 protein, optionally wherein the ACE2 ligand comprises a spike protein of a coronavirus.

23. The composition of claim 22, wherein the therapeutic agent is an antigen-binding protein that binds the spike protein of a coronavirus.

24. A non-human animal model of coronavirus infection comprising

(a) a non-human animal or non-human animal cell according to any one of claims 1- 14 and 19 or made according to any one of claims 16-18, wherein the non-human animal or non-human animal cell expresses the recombinant ACE2 protein and/or the one or more distinct recombinant TMPRSS proteins, and

(b) a coronavirus comprising a spike protein that binds to a human ACE2 protein.

25. A method of screening drug candidates that target a ligand of a human ACE2 protein, comprising: a. introducing the ligand of a human ACE2 protein into a genetically modified non-human animal according to any one of claims 1-14 and 19 or made according to a method of any one of claims 16-18, wherein the non-human animal or non-human animal cell expresses the recombinant ACE2 protein and/or the one or more distinct recombinant TMPRSS proteins, b. contacting the non-human animal with a drug candidate of interest, wherein the drug candidate is directed against the ligand of a human ACE2 protein, and c. determining if the drug candidate is efficacious in preventing, reducing or eliminating binding of the ligand of a human ACE2 protein to the recombinant ACE2 protein.

26. The method of claim 25, wherein the step of introducing comprises infecting the non-human animal with a coronavirus, wherein the coronavirus comprises a spike protein, and wherein the spike protein comprises the ligand of a human ACE2 protein.

93

Description:
NON-HUMAN ANIMALS COMPRISING HUMANIZED ACE2 AND TMPRSS LOCI

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application claims benefit of priority to U.S. Provisional Application No. 63/291,502, filed December 20, 2021, which is incorporated by reference in its entirety.

FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

[0002] This invention was made with Government support under Agreement HHS0100201700020C, awarded by the U.S. Department of Health and Human Services. The Government has certain rights in the invention.

REFERENCE TO A SEQUENCE LISTING

[0003] The Sequence Listing written in file “10833W001_ST26.xml” is 459 kilobytes, was created on December 15, 2022, and is hereby incorporated by reference.

TECHNICAL FIELD

[0004] Described herein are (1) non-human animals, e.g., rodents (e.g., rats and mice), and tissues or cells derived therefrom, that comprise (a) a human or humanized Angiotensinconverting enzyme 2 (ACE2) gene, e.g., at an endogenous ACE2 locus and (b) a human or humanized transmembrane serine protease (TMPRSS) gene, e.g., at an endogenous TMPRSS locus, and that express therefrom a recombinant ACE2 protein and a recombinant ACE2 protein, respectively, (2) non-human animal embryonic stem cells and/or genomes comprising the human or humanized ACE2 gene, and the human or humanized TMPRSS gene, (3) methods of making and using the non-human animal embryonic stem cells and/or non-human animal genomes, including methods of making the non-human animals from the non-human animal embryonic stem cells, and (4) methods of making and using the non- human animals that comprise the human or humanized ACE2 gene, and the human or humanized TMPRSS gene. Such non-human animals express a recombinant ACE2 protein comprising at least an extracellular domain of a human ACE2 protein and the human or humanized TMPRSS gene, and thus, may be used to delineate the biological activity of ligand binding to human ACE2 protein, particularly viral ligands requiring a TMPRSS protein for viral infection. Such models may be useful, e.g., for understanding coronavirus infections, e.g., SARS-CoV and/or SARS-CoV-2 infection, and/or evaluating the efficacy of a vaccine or treatment protocol for same. BACKGROUND

[0005] Angiotensin-converting enzyme 2 (ACE2) is an enzyme found on the cell surface of cells in the lungs, arteries, heart, kidney, and intestines. A primary function of ACE2 is to cleave the carboxyl-terminal amino acid phenylalanine from angiotensin II and hydrolyze it into the vasodilator angiotensin. ACE2 also serves as the main entry point into cells for some coronaviruses (CoVs).

[0006] CoVs can cause diseases in animals. In humans, CoVs are responsible for many of the epidemics of recent years. In 2002, the Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) was responsible for the SARS epidemic, which was contained in July 2003. Since 2004, there have not been any known cases of SARS reported. In 2012, the Middle East Respiratory Syndrome Coronavirus (MERS-CoV) emerged as the second coronavirus resulting in a global public health crisis, although an outbreak with a 32.97% fatality rate did not occur until 2014. Cases of MERS continue to be reported, with a 34.4% case-fatality ratio. SARS-CoV-2, identified in 2019, is the cause of the disease COVID- 19 and infection with SARS-CoV-2 reached pandemic levels within months of its first identification. The COVID-19 pandemic is currently ongoing.

[0007] Infection with SARS-CoV-2 causes symptoms ranging from mild or nonexistent to death resulting from severe damage to the lungs and possibly other organs. Due to variability in testing capabilities among different countries, reported fatality rates vary between 1 to 5 percent. If an infected patient is over 50 years old or if they suffer from an underlying condition such as asthma, diabetes, or heart disease, then their chances of surviving COVID-19 decreases. Since infected individuals who are asymptomatic may contribute to the spread of the virus, and since an effective vaccine or treatment protocol is not yet available, the only means by which to reduce transmission of SARS-CoV-2 is requiring all individuals to practice social distancing. This had led to a demand for shortcuts in the regulatory approval process for a potential vaccine or treatment, with human trials of lead vaccine or treatment candidates set to proceed at an unprecedented fast pace.

[0008] Infection with SARS-CoV and SARS-CoV-2 is mediated through ACE2 and members of the transmembrane serine protease (TMPRSS) family. Accordingly, a nonhuman animal model for coronavirus infection, e.g., SARS-CoV and/or SARS-CoV-2 infection, may be beneficial for the testing of putative vaccines and/or treatments against current and future infection. SUMMARY

[0009] Disclosed herein are a non-human animal, non-human animal cell, or nonhuman animal genome comprising (I) a humanized ACE2 gene, e.g., at an endogenous non- human animal ACE2 locus and optionally under the control of an endogenous ACE2 promoter, that encodes a recombinant ACE2 protein and and (II) a humanized TMPRSS gene, e.g., at an endogenous TMPRSS locus and optionally under the control of an endogenous TMPRSS promoter, that encodes a recombinant TMPRSS protein, optionally wherein the recombinant ACE2 and/or TMPRSS proteins are expressed by the non-human animal, non-human animal cell, or non-human animal genome.

[0010] In some embodiments, a recombinant ACE2 protein comprises in operable linkage: (i) an ACE2 signal sequence of a non-human animal ACE2 protein or an ACE2 signal sequence of a human ACE2 protein, (ii) an extracellular domain of a human ACE2 protein, (iii) a transmembrane domain of a non-human animal ACE2 protein or a transmembrane domain of a human ACE2 protein, and (iv) a cytoplasmic domain of a non- human animal ACE2 protein or a cytoplasmic domain of a human ACE2 protein. In some embodiments, a recombinant ACE2 protein comprises in operable linkage: (i) an ACE2 signal sequence of a non-human animal ACE2 protein, (ii) an extracellular domain of a human ACE2 protein, (iii) a transmembrane domain of a non-human animal ACE2 protein, and (iv) a cytoplasmic domain of a non-human animal ACE2 protein. In some embodiments a recombinant ACE2 protein comprises an amino acid sequence set forth as SEQ ID NO:24. [0011] In some embodiments, a modified endogenous ACE2 locus of a non-human animal comprises a replacement of the nucleotide sequence encoding the extracellular domain of the endogenous ACE2 protein with a nucleotide sequence encoding the extracellular domain of a human ACE2 protein such that the nucleotide sequence encoding the extracellular domain of a human ACE2 protein is operably linked to an endogenous nucleotide sequence encoding (iii) the transmembrane domain of an endogenous non-human animal ACE2 protein, and (iv) the cytoplasmic domain of an endogenous non-human animal ACE2 protein. In some embodiments the modified endogenous ACE2 locuscomprises a nucleotide sequence set forth as SEQ ID NO:5 or set forth as SEQ ID NO:22.

[0012] In some embodiments, the nucleotide sequence encoding the extracellular domain of a human ACE2 protein comprises part of the coding sequence of coding exon 1, all of the coding sequences of coding exon 2 to coding exon 16, inclusive, and part of the coding seqeence of coding exon 17 of a human ACE2 gene. [0013] In some embodiments, a humanized ACE2 gene as described herein comprises a nucleotide sequence encoding an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human ACE2 protein operably linked to a nucleotide sequence encoding a transmembrane domain substantially (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the transmembrane domain of a non-human animal ACE2 protein, and a cytoplasmic domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the cytoplasmic domain of a non-human animal ACE2 protein. In some embodiments, an endogenous non-human animal ACE2 locus comprises the humanized ACE2 gene, e.g., the endogeneous non-human animal ACE2 locus comprises a replacement of the nucleotide sequence encoding the extracellular domain of the endogenous ACE2 protein with a nucleotide sequence encoding an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human ACE2 protein such that the nucleotide sequence encoding the extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to a human ACE2 protein is operably linked to an endogenous nucleotide sequence encoding the transmembrane and cytoplasmic domains of an endogenous non-human animal ACE2 protein of an endogenous non-human animal ACE2 protein. In some embodiments, the humanized ACE2 gene is under the control of a non-human animal promoter, e.g., an endogenous non-human animal promoter, e.g., at an endogenous non- human animal ACE2 locus. In some embodiments, the nucleotide sequence encoding an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human ACE2 protein comprises part of the coding sequence of coding exon 1, all of the coding sequences of coding exon 2 to coding exon 16, inclusive, and part of the coding sequence of coding exon 17 of a human ACE2 gene, and degenerate variants thereof.

[0014] In some embodiments, a recombinant ACE2 protein is encoded by a humanized ACE2 gene and expressed by the non-human animal, non-human animal cell, or non-human animal genome. In some embodiments, a recombinant ACE2 protein comprises in operable linkage: (i) an ACE2 signal sequence of a non-human animal ACE2 protein or an ACE2 signal sequence of a human ACE2 protein, (ii) an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human ACE2 protein, (iii) a transmembrane domain of a non-human animal ACE2 protein or a transmembrane domain of a human ACE2 protein, and (iv) a cytoplasmic domain of a non-human animal ACE2 protein or a cytoplasmic domain of a human ACE2 protein. In some embodiments, a recombinant ACE2 protein comprises in operable linkage:

(i) an ACE2 signal sequence of a non-human animal ACE2 protein, (ii) an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to an extracellular domain of a human ACE2 protein, (iii) a transmembrane domain of a non-human animal ACE2 protein, and (iv) a cytoplasmic domain of a non- human animal ACE2 protein. In some mature recombinant ACE2 protein embodiments, a recombinant ACE2 protein comprises only a mature sequence, e.g., (i) an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human ACE2 protein, (ii) a transmembrane domain of a non-human animal ACE2 protein or a transmembrane domain of a human ACE2 protein, and (iii) a cytoplasmic domain of a non-human animal ACE2 protein or a cytoplasmic domain of a human ACE2 protein. In some mature recombinant ACE2 protein embodiments, a recombinant ACE2 protein comprises (i) an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to an extracellular domain of a human ACE2 protein, (ii) a transmembrane domain of a non-human animal ACE2 protein, and (iiii) a cytoplasmic domain of a non-human animal ACE2 protein. In some embodiments a recombinant ACE2 protein comprises an amino acid sequence set forth as SEQ ID NO:24.

[0015] Also described herein are nucleotide molecules, e.g., targeting vectors, which may be useful in making non-human animals comprising a modified endogenous ACE2 locus that expresses a human or humanized ACE2 protein. In some embodiments, a targeting vector comprises an insert nucleotide that (a) comprises a nucleotide sequence that encodes at least an extracellular domain of a human ACE2 protein and (b) is flanked by 5’ and 3’ homology arms that undergo homologous recombination with an endogenous ACE2 locus of a non-human animal, wherein following the homologous recombination of the endogenous ACE2 locus with the 5’ and 3’ homology arms, the genetically modified endogenous ACE2 locus of the non-human animal encodes, under the control of an endogenous ACE2 promoter, a recombinant ACE2 protein that comprises in operable linkage: (i) an ACE2 signal sequence of a non-human animal ACE2 protein or an ACE2 signal sequence of a human ACE2 protein

(ii) the extracellular domain of a human ACE2 protein, (iii) a transmembrane domain of a non-human animal ACE2 protein or a transmembrane domain of a human ACE2 protein, and (iv) a cytoplasmic domain of a non-human animal ACE2 protein or a cytoplasmic domain of a human ACE2 protein. In some embodiments, following the homologous recombination of the endogenous ACE2 locus with the 5’ and 3’ homology arms, the genetically modified endogenous ACE2 locus of the non-human animal encodes, under the control of an endogenous ACE2 promoter, a recombinant ACE2 protein that comprises in operable linkage: (i) an ACE2 signal sequence of an endogenous non-human animal ACE2 protein, (ii) an extracellular domain of a human ACE2 protein, (iii) a transmembrane domain of an endogenous non-human animal ACE2 protein, and (iv) a cytoplasmic domain of an endogenous non-human animal ACE2 protein. In some embodiments, following homologous recombination, the nucleotide sequence that encodes at least an extracellular domain of a human ACE2 protein replaces an orthologous sequence at the endogenous ACE2 locus. In some targeting vector embodiments, the nucleotide sequence that encodes at least an extracellular domain of a human ACE2 protein comprises part of the coding sequence of coding exon 1 and all of the coding sequences of coding exon 2 to coding exon 16, inclusive, and part of the coding sequence of exon 17 of a human ACE2 gene.

[0016] In some embodiments, a targeting vector or nucleic acid comprises a nucleotide sequence set forth as SEQ ID NO:5, SEQ ID NO:22, or SEQ ID NO:25. [0017] In some embodiments, an insert nucleic acid further comprises a second nucleic acid sequence comprising a sequence encoding a selectable marker, preferably wherein the sequence encoding a selectable marker is operably linked to a promoter. In some embodiments, the insert nucleotide comprises site-specific recombination sites flanking the second nucleic acid sequence. In some embodiments, the second nucleic acid sequence further comprises a sequence encoding a site-specific recombinase, preferably wherein the sequence encoding the selectable marker is operably linked to a promoter. In some embodiments, a targeting vector comprises from 5’ to 3’ a nucleotide sequence comprising the nucleotide sequences set forth as SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, and SEQ ID NO:21.

[0018] Also described herein are nucleic acids. A nucleic acid embodiment herein may comprises a sequence set forth as SEQ ID NO: 5. In some embodiments, a nucleic acid comprises a sequence set forth as SEQ ID NO:22. In some embodiments, a nucleic acid comprises a sequence set forth as SEQ ID NO:25.

[0019] Also described herein are a non-human animal, non-human animal cell, or non-human animal genome comprising a genetically modified endogenous ACE2 locus. In some embodiments, a non-human animal, non-human animal cell, or non-human animal genome is modified at an endogenous ACE2 locus with a targeting vector as described herein or to comprise a nucleic acid described herein. [0020] In some embodiments, a non-human animal cell as described herein comprises a modified ACE2 locus that expresses a human or humanized ACE2 protein, wherein the amino acid sequence of the extracellular domain of a human ACE2 protein is set forth in SEQ ID NO:27. In some embodiments, a non-human animal, non-human animal cell, or non- human animal genome is heterozygous for the genetically modified endogenous ACE2 locus. In some embodiments, a non-human animal, non-human animal cell, or non-human animal genome is homozygous for the genetically modified endogenous ACE2 locus. In some embodiments, a non-human animal is a mammal, a non-human animal cell is a mammalian cell, or a non-human animal genome is a mammalian genome. In some embodimetns, a non- human animal is a rodent, a non-human animal cell is a rodent cell, or a non-human animal genome is a rodent genome. In some embodiments, a non-human animal is a rat or mouse, tanon-human animal cell is a rat cell or a mouse cell, or a non-human animal genome is a rat genome or a mouse genome. In some embodiments, a non-human animal is a mouse, a non- human animal cell is a mouse cell, or a non-human animal genome is a mouse genome. In some animal, cell and/or genome embodiments, the animal, cell and/or genome comprises a sequence encoding a recombinant ACE2 protein and the animal, cell and/or genome expresses the recombinant ACE2 protein.

[0021] In some embodiments, a non-human animal ecomprising a genetically modified endogenous ACE2 locus as described herein expresses a recombinant ACE2 protein in an organ selected from the group consisting of colon, duodenum, kidney, heart, liver, lung, trachea, and any combination thereof. In some embodiments, the expression pattern of a recombinant ACE2 protein in a genetically modified non-human animal as described herein follows the expression pattern of a non-human animal ACE2 protein in a control non-human animal comprising a wildtype endogenous ACE2 locus.

[0022] In some embodiments, the recombinant ACE2 protein is expressed on epithelial cells. Accordingly, also described herein is a non-human animal cell expressing a recombinant ACE2 protein, optionally wherein the non-human animal cell (e.g., rat cell or mouse cell) is a somatic cell, optionally wherein the somatic cell is an epithelial cell. Nonlimiting exmaples of epithethial cells that may express a recombinant ACE2 protein as described herein include respiratory and/or gastrointestinal epithelial cells, e.g., an alveolar cell of the lung, an esophagus upper and stratified epithelial cell, an absorptive enterocyte from the ileum or colon, etc. In some embodiments, a non-human animal cell as described hererein expresses the recombinant ACE2 protein in the epithelium of small intestine villi, surface epithelium of the large intestine (colon), the epithelium of large to small bronchioles and bronchi of the lung, respiratory epithelium of the trachea, proximal tubular epithelium of the kidney, respiratory epithelium of th nasal cavity, and/or the stratum granulosum and/or stratum spinosum of oral mucosa/tongue in the oral cavity.

[0023] In some embodiments, a humanized TMPRSS gene comprises a nucleotide sequence encoding an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human TMPRSS protein operably linked to a non-human animal nucleotide sequence encoding a transmembrane domain substantially (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the transmembrane domain of a non-human animal TMPRSS protein, and a cytoplasmic domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the cytoplasmic domain of a non-human animal TMPRSS protein, wherein the human TMPRSS and non-human animal TMPRSS proteins are orthologous. In some embodiments, a non-human animal, non-human animal cell, or non-human animal genome comprises am endogenous non-human animal TMPRSS locus modified to comprise the humanized TMPRSS gene, e.g., the endogeneous non-human animal TMPRSS locus comprises a replacement of the nucleotide sequence encoding the extracellular domain of the endogenous TMPRSS protein with a nucleotide sequence encoding an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human TMPRSS protein such that the nucleotide sequence encoding the extracellular domain is an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human TMPRSS protein, and is operably linked to an endogenous nucleotide sequence encoding the transmembrane and cytoplasmic domains of the endogenous non-human animal TMPRSS protein at the endogenous TMPRSS locus, wherein the human TMPRSS and non-human animal TMPRSS proteins are orthologous. In some embodiments, the humanized TMPRSS gene is under the control of a non-human animal promoter, e.g., an endogenous non-human animal promoter, e.g., at an endogenous non-human animal TMPRSS locus.

[0024] In some embodiments, a humanized TMPRSS gene comprises a humanized TMPRSS2 gene, which humanized TMPRSS2 gene comprises a nucleotide sequence of a human TMPRSS2 gene and a nucleotide sequence of the non-human animal TMPRSS2 gene. In some embodiments, a humanized TMPRSS2 gene comprises a nucleotide sequence of a human TMPRSS2 gene and a nucleotide sequence of the non-human animal TMPRSS2 gene, wherein the nucleotide sequence of the human TMPRSS2 gene encodes an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99% or 100% identical in sequence) with the extracellular domain of the human TMPRSS2 protein encoded by the human TMPRSS2 gene. In specific embodiments, the nucleotide sequence of a human TMPRSS2 gene is a contiguous genomic sequence of a human TMPRSS2 gene containing coding exon 4 through the stop codon in coding exon 13 of the human TMPRSS2 gene. In particular embodiments, the contiguous genomic sequence of a human TMPRSS2 gene further contains the 3’ UTR of the human TMPRSS2 gene. In some embodiments, the nucleotide sequence of the non-human animal TMPRSS2 gene included in a humanized TMPRSS2 gene encodes a cytoplasmic and transmembrane portion that is substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99% or 100% identical) with the cytoplasmic and transmembrane portion of the non-human animal TMPRSS2 protein encoded by the non- human animal TMPRSS2 gene. In some embodiments, a humanized TMPRSS2 gene comprises coding exons 1-2 of the non-human animal TMPRSS2 gene, and coding exon 4 through coding exon 13 of a human TMPRSS2 gene, wherein the humanized TMPRSS2 gene encodes a humanized TMPRSS2 protein comprising an extracellular domain that is substantially identical with the extracellular domain of the human TMPRSS2 protein encoded by the human TMPRSS2 gene and a transmembrane and cytoplasmic portion that is substantially identical with the cytoplasmic and transmembrane portion of the non-human animal TMPRSS2 protein encoded by the non-human animal TMPRSS2 gene. In some embodiments, the humanized TMPRSS2 gene contains an exon 3 that in some embodiments is coding exon 3 of a human TMPRSS2 gene, and in other embodiments is coding exon 3 of an endogenous rodent TMPRSS2 gene. In some embodiments, the humanized TMPRSS2 gene comprises an exon 3 that includes a 5’ portion of coding exon 3 of the non-human animal TMPRSS2 gene and a 3’ portion of coding exon 3 of a human TMPRSS2 gene. In some embodiments, a non-human animal, non-human animal cell, or non-human animal genome comprises an endogenous non-human animal TMPRSS2 locus modified to comprise the humanized TMPRSS2 gene, e.g., the endogeneous non-human animal TMPRSS2 locus comprises a replacement of the nucleotide sequence encoding the extracellular domain of the endogenous TMPRSS2 protein with a nucleotide sequence encoding an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human TMPRSS2 protein such that the nucleotide sequence encoding the extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human TMPRSS2 protein is operably linked to an endogenous nucleotide sequence encoding the transmembrane and cytoplasmic domains of the endogenous non-human animal TMPRSS2 protein at the endogenous TMPRSS2 locus. In some embodiments, the humanized TMPRSS2 gene is under the control of a non-human animal TMPRSS2 promoter, e.g., an endogenous non-human animal TMPRSS2 promoter, e.g., at an endogenous non-human animal TMPRSS2 locus.

[0025] In some embodiments, a humanized TMPRSS gene comprises a humanized TMPRSS4 gene comprising a nucleotide sequence of a human TMPRSS4 gene and a nucleotide sequence of the non-human animal TMPRSS4 gene. In some embodiments, a humanized TMPRSS4 gene comprises a nucleotide sequence of a human TMPRSS4 gene and a nucleotide sequence of a non-human animal TMPRSS4 gene, wherein the nucleotide sequence of a human TMPRSS4 gene encodes an extracellular domain substantially identical with the extracellular domain of the human TMPRSS4 protein encoded by the human TMPRSS4 gene. In specific embodiments, the nucleotide sequence of a human TMPRSS4 gene is a contiguous genomic sequence containing coding exon 4 through the stop codon in coding exon 13 of a human TMPRSS4 gene. In some embodiments, the nucleotide sequence of an on-human animal TMPRSS4 gene included in a humanized TMPRSS4 gene encodes a cytoplasmic and transmembrane portion that is substantially identical with the cytoplasmic and transmembrane portion of the rodent TMPRSS4 protein encoded by the non-human animal TMPRSS4 gene. In particular embodiments, a humanized TMPRSS4 gene comprises coding exon 1 through coding exon 3 of a non-human animal TMPRSS4 gene, and coding exon 4 through the stop codon in coding exon 13 of a human TMPRSS4 gene. In some embodiments, a non-human animal, non-human animal cell, or non-human animal genome comprises an endogenous non-human animal TMPRSS4 locus modified to comprise the humanized TMPRSS4 gene, e.g., the endogeneous non-human animal TMPRSS4 locus comprises a replacement of the nucleotide sequence encoding the extracellular domain of the endogenous TMPRSS4 protein with a nucleotide sequence encoding an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human TMPRSS4 protein such that the nucleotide sequence encoding the extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of human TMPRSS4 protein is operably linked to an endogenous nucleotide sequence encoding the transmembrane and cytoplasmic domains of the endogenous non-human animal TMPRSS4 protein at the endogenous TMPRSS4 locus. In some embodiments, the humanized TMPRSS4 gene is under the control of a non-human animal TMPRSS4 promoter, e.g., an endogenous non-human animal TMPRSS4 promoter, e.g., at an endogenous non-human animal TMPRSS4 locus.

[0026] In some embodiments, a humanized TMPRSS gene comprises comprises a humanized TMPRSS1 ID gene comprising a nucleotide sequence of a human TMPRSS1 ID gene and a nucleotide sequence of a non-human animal TMPRSS 1 ID gene. In some embodiments, a non-human animal, non-human animal cell, or non-human animal genome comprises a humanized TMPRSS1 ID gene comprising a nucleotide sequence of an endogenous rodent TMPRSS 1 ID gene and a nucleotide sequence of a human TMPRSS 1 ID gene, wherein the nucleotide sequence of the human TMPRSS 1 ID gene encodes an extracellular domain substantially identical with the extracellular domain of the human TMPRSS 1 ID protein encoded by the human TMPRSS 1 ID gene. In specific embodiments, the nucleotide sequence of a human TMPRSS1 ID gene is a contiguous genomic sequence containing coding exon 3 through the stop codon in coding exon 10 of a human TMPRSS 1 ID gene. In particular embodiments, the contiguous genomic sequence of a human TMPRSS1 ID gene further contains the 3’ UTR of the human TMPRSS 1 ID gene. In some embodiments, the nucleotide sequence of a non-human animal TMPRSS 1 ID gene included in a humanized TMPRSS 1 ID gene encodes a cytoplasmic and transmembrane portion that is substantially identical with the cytoplasmic and transmembrane portion of the non-human animal TMPRSS 1 ID protein encoded by the non-human animal TMPRSS 1 ID gene. In dome embodiments, a humanized TMPRSS1 ID gene contains coding exons 1-2 of an endogenous non-human animal TMPRSS1 ID gene, and coding exon 3 through coding exon 13 of a human TMPRSS1 ID gene. In some embodiments, a non-human animal, non-human animal cell, or non-human animal genome comprises an endogenous non-human animal TMPRSS 1 ID locus modified to comprise the humanized TMPRSS 1 ID gene, e.g., the endogeneous non-human animal TMPRSS 1 ID locus comprises a replacement of the nucleotide sequence encoding the extracellular domain of the endogenous TMPRSS 1 ID protein with a nucleotide sequence encoding an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human TMPRSS 1 ID protein such that the nucleotide sequence encoding the extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of human TMPRSS 1 ID protein is operably linked to an endogenous nucleotide sequence encoding the transmembrane and cytoplasmic domains of the endogenous non-human animal TMPRSS1 ID protein at the endogenous TMPRSS1 ID locus. In some embodiments, the humanized TMPRSS1 ID gene is under the control of a non-human animal TMPRSS1 ID promoter, e.g., an endogenous nonhuman animal TMPRSS1 ID promoter, e.g., at an endogenous non-human animal TMPRSS1 ID locus.

[0027] In some embodiments, a recombinant TMPRSS protein described herein is encoded by a humanized TMPRSS gene as described herein. In some embodiments, a recombinant TMPRSS protein comprises in operable linkage: (i) a TMPRSS signal sequence of a non-human animal TMPRSS protein or a TMPRSS signal sequence of a human TMPRSS protein, (ii) an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human TMPRSS protein, (iii) a transmembrane domain of a non-human animal TMPRSS protein or a transmembrane domain of a human TMPRSS protein, and (iv) a cytoplasmic domain of a non-human animal TMPRSS protein or a cytoplasmic domain of a human TMPRSS protein, wherein the non-human TMPRSS protein and human TMPRSS protein are orthologous. In some embodiments, a recombinant TMPRSS protein comprises in operable linkage: (i) a TMPRSS signal sequence of a non-human animal TMPRSS protein, (ii) an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to an extracellular domain of a human TMPRSS protein, (iii) a transmembrane domain of a non-human animal TMPRSS protein, and (iv) a cytoplasmic domain of a non- human animal TMPRSS protein, wherein the non-human TMPRSS protein and human TMPRSS protein are orthologous.

[0028] In some embodiments, the orthologous non-human animal and human TMPRSS proteins are a non-human animal TMPRSS2 protein and a human TMPRSS2 protein, respectively. In some embodiments, a humanized TMPRSS2 gene encodes a recombinant TMPRSS2 protein comprising an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99% or 100% identical in sequence) with the extracellular domain of the human TMPRSS2 protein encoded by the human TMPRSS2 gene used in humanization. The recombinant TMPRSS2 protein comprises, in some embodiments, an amino acid sequence at least 85% identical (e.g., at least 90%, 95%, 98%, 99% or 100% identical) with the amino acid sequence as set forth in SEQ ID NO:71. In some embodiments, a recombinant TMPRSS2 protein comprises an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99% or 100% identical) with the amino acid sequence composed of residues W106 to G492 or the C-terminal 387 amino acids of a human TMPRSS2 protein as set forth in, e.g., SEQ ID NO:71. In some embodiments, the humanized TMPRSS2 gene encodes a humanized TMPRSS2 protein that further contains a cytoplasmic and transmembrane portion that is substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99% or 100% identical) with the cytoplasmic and transmembrane portion of the non-human animal TMPRSS2 protein encoded by the non-human animal TMPRSS2 gene being humanized. An exemplary endogenous non-human animal TMPRSS2 protein is set forth in SEQ ID NO:69.

[0029] In some embodiments, the orthologous non-human animal and human TMPRSS proteins are a non-human animal TMPRSS4 protein and a human TMPRSS4 protein, respectively. In some embodiments, a humanized TMPRSS4 gene encodes a recombinant TMPRSS4 protein comprising an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99% or 100% identical in sequence) with the extracellular domain of the human TMPRSS4 protein encoded by the human TMPRSS4 gene used in humanization. The recombinant TMPRSS4 protein comprises, in some embodiments, an amino acid sequence at least 85% identical (e.g., at least 90%, 95%, 98%, 99% or 100% identical) with the amino acid sequence as set forth in SEQ ID NO:78. In some embodiments, a recombinant TMPRSS4 protein contains an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99% or 100% identical) with the amino acid sequence composed of residues K54 to L437 or the C-terminal 384 amino acids of a human TMPRSS4 protein as set forth in, e.g., SEQ ID NO:78. In some embodiments, the humanized TMPRSS4 gene encodes a recombinant TMPRSS4 protein that further comprises a cytoplasmic and transmembrane portion that is substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99% or 100% identical) with the cytoplasmic and transmembrane portion of the non-human animal TMPRSS4 protein encoded by the non-human animal TMPRSS4 gene being humanized. An exemplary non-human animal, e.g., rodent, TMPRSS4 protein is set forth in SEQ ID NO:76.

[0030] In some embodiments, the orthologous non-human animal and human TMPRSS proteins are a non-human animal TMPRSS 11 protein and a human TMPRSS 11 protein, respectively. In some embodiments, the orthologous non-human animal and human TMPRSS proteins are a non-human animal TMPRSS11 A protein and a human TMPRSS11 A protein, respectively. In some embodiments, the orthologous non-human animal and human TMPRSS proteins are a non-human animal TMPRSS 1 ID protein and a human TMPRSS1 ID protein, respectively. In some embodiments, the humanized TMPRSS 1 ID gene encodes a recombinant TMPRSS 1 ID protein comprising an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99% or 100% identical in sequence) with the extracellular domain of the human TMPRSS 1 ID protein encoded by the human TMPRSS1 ID gene used in humanization. The recombinant TMPRSS1 ID protein contains, in some embodiments, an amino acid sequence at least 85% identical (e.g., at least 90%, 95%, 98%, 99% or 100% identical) with the amino acid sequence as set forth in SEQ ID NO:85. In some embodiments, a recombinant TMPRSS1 ID protein comprises an extracellular domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99% or 100% identical) with the amino acid sequence composed of residues A42-I418 or the C-terminal 377 amino acids of a human TMPRSS1 ID protein as set forth in, e.g., SEQ ID NO:85. In some embodiments, the humanized TMPRSS1 ID gene encodes a recombinant TMPRSS1 ID protein further comprising a cytoplasmic and transmembrane portion that is substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99% or 100% identical) with the cytoplasmic and transmembrane portion of the non-human animal TMPRSS1 ID protein encoded by the endogenous rodent TMPRSS1 ID gene being humanized. An exemplary non-human animal, e.g., rodent, TMPRSS1 ID protein is set forth in SEQ ID NO:83.

[0031] Also provided herein are methods of making a non-human animal, non-human animal cell, or non-human animal genome comprising humanized endogenous ACE2 and TMPRSS loci. In some embodiments, a method comprises (A) generating a genetically modified non-human embryonic stem (ES) cell comprising (I) humanized ACE2 gene encoding a recombinant ACE2 protein that comprises in operable linkage: (a) an ACE2 signal sequence of a non-human animal ACE2 protein or an ACE2 signal sequence of a human ACE2 protein, (b) an extracellular domain substantially identical to a human ACE2 protein, (c) a transmembrane domain of a non-human animal ACE2 protein or a transmembrane domain of a human ACE2 protein, and (d) a cytoplasmic domain of a non- human animal ACE2 protein or a cytoplasmic domain of a human ACE2 protein, and (II) one or more distinct and humanized TMPRSS genes, each encoding a distinct recombinant TMPRSS protein that comprises in operable likage: (a) a TMPRSS signal sequence of a non- human animal TMPRSS protein or TMPRSS signal sequence of a human TMPRSS protein, (b) an extracellular domain substantially identical to the extracellular domain of the human TMPRSS protein, and (c) a transmembrane domain of the non-human animal TMPRSS protein or a transmembrane domain of the human TMPRSS protein, and (d) a cytoplasmic domain of the non-human TMPRSS protein or a cytoplasmic domain of the human TMPRSS protein, wherein the non-human animal TMPRSS protein and the human TMPRSS protein are orthologous, (B) introducing the modified non-human animal ES cell into a host embryo of non-human animal to form a donor cell-non-human animal embryo complex; and (C) gestating the donor cell-non-human animal embryo in a surrogate non-human animal mother, wherein the surrogate non-human animal mother produces rodent progeny that express the humanized ACE2 and TMPRSS proteins. In some embodiments, generating the genetically modified non-human animal ES cell comprises: (i) obtaining a non-human ES cell that comprises the humanized ACE2 gene, and (ii) modifying the genome of the obtained non- human ES cell that comprises the humanized ACE2 gene to further comprise the one or more distinct humanized TMPRSS genes, wherein modifying the genome comprises replacing an endogenous nucleotide sequence encoding an extracellular domain of an endogenous TMPRSS protein with a nucleotide seuqence encoding a transmembrane domain substantially identical (e.g., at least 85%, 90%, 95%, 98%, 99%, or 100% identical in sequence) to the extracellular domain of a human TMPRSS protein that is orthologous to the endogenous TMPRSS protein. In some embodiments, generating the genetically modified non-human animal ES cell comprises: (i) obtaining a non-human ES cell that comprises the one or more distinct humanized TMPRSS genes, and (ii) modifying the genome of the obtained non- human ES cell that comprises the one or more distinct humanized TMPRSS genes to further comprise the humanized ACE2 gene.

[0032] In some embodiments, a non-human animal cell as described herein comprises a modified ACE2 locus encoding a recombinant ACE2 protein and a modified TMPRSS locus encoding a recombinant TMPRSS protein as described herein, but does not express the recombinant ACE2 protein and/or does not express the recombinant TMPRSS protein. In some embodiments, such a cell is a non-human animal embryonic stem (ES) cell, pluripotent cell, or a germ cell. Methods for making such cells are also described, e.g., in methods of making the non-human animals.

[0033] Also described herein are a non-human animal tissue or a composition comprising the non-human animal cells described herein. In some composition embodiments, the composition further comprises a spike protein of a coronavirus, wherein the spike protein binds the recombinant ACE2 protein and/or a therapeutic agent that inhibits, prevents, or reduces the likelihood of binding of an ACE2 ligand to the recombinant ACE2 protein, optionally wherein the ACE2 ligand comprises a spike protein of a coronavirus. In some embodiments, a therapeutic agent may be an antigen-binding protein that binds the spike protein of a coronavirus.

[0034] Also described herein is use of a non-human animal as a model of coronavirus infection, wherein the non-human animal as described herein is contacted, e.g., infected with, a coronavirus comprising a spike protein that binds to a human ACE2 protein. Also provided is a method of screening drug candidates that target a ligand of a human ACE2 protein, comprising: a) introducing into a genetically modified non-human animal as described herein a ligand of a human ACE2 protein; b) contacting the non-human animal with a drug candidate of interest, wherein the drug candidate is directed against the ligand of a human ACE2 protein; and c) determining whether the drug candidate is efficacious in preventing, reducing or eliminating binding of the ligand of a human ACE2 protein to the recombinant ACE2 protein. In some embodiments, the introducing comprises infecting the non-human animal with a coronavirus, wherein the coronavirus comprises a spike protein, and wherein the spike protein comprises the ligand of a human ACE2 protein, e.g., wherein the coronavirus is SARS-CoV-2.

[0035] In some embodiments, preventing, reducing or eliminating binding of SARS- CoV-2 to the recombinant ACE2 protein results in preventing, reducing, or eliminating one or more COVID-19 symptoms in the non-human animal.

[0036] Also disclosed is an ACE2-null mouse comprising an endogenous ACE2 locus comprising a deletion of a sequence encoding ACE2. In some embodiments, an ACE2-null mouse comprises an endogenous A CE2 locus as depicted in Figure 2B, e.g., comprising a deletion of 46,894bp of ACE2 on the X chromosome, encompassing 27bp of ACE2 5’ untranslated region (UTR) and the entire coding sequence with intervening introns, except for 65bp at the 3’ end of the last coding exon, optionally wherein the endogenous locus also comprises a C to T point mutation in the ACE2 3’ UTR. In some embodiments, an endogenous A CE2 locus of an ACE2-null mouse comprises a sequence set forth as SEQ ID NO:55. In some ACE2-null embodiments, a mouse as described herein is heterozygous for the endogenous locus comprising a deletion of a sequence encoding ACE2, e.g., for the endogenous locus depicted in Figure 2B, e.g., for an endogenous Icous comprising a deletion of 46,894bp of ACE2 on the X chromosome, encompassing 27bp of ACE2 5’ untranslated region (UTR) and the entire coding sequence with intervening introns, except for 65bp at the 3’ end of the last coding exon, optionally wherein the endogenous locus also comprises a C to T point mutation in the ACE2 3’ UTR. In some embodiments, an ACE2-null mouse as described herein is homozygous for the endogenous locus comprising a deletion of a sequence encoding ACE2, e.g., for the endogenous locus depicted in Figure 2B, e.g., for an endogenous Icous comprising a deletion of 46,894bp of ACE2 on the X chromosome, encompassing 27bp of ACE2 5’ untranslated region (UTR) and the entire coding sequence with intervening introns, except for 65bp at the 3’ end of the last coding exon, optionally wherein the endogenous locus also comprises a C to T point mutation in the ACE2 3’ UTR. Also provided are nucleic acids for making an ACE2-null mouse, e.g., a nucleic acid comprising a sequence set forth as SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, and/or SEQ ID NO: 54.

DESCRIPTION OF THE DRAWINGS

[0037] Figure 1A shows schematics (not-to-scale) of the human and mouse ACE2 loci. The untranslated and coding exons are represented by rectangles, coding sequences are indicated by the filled boxes, the untranslated regions (UTRs) are indicated by the unfilled boxes, and various accession numbers for non-limiting examples of ACE2 genes, along with the chromosomal locations, are indicated at the top of the figure. The asterisks indicate the locations of (A) the upstream (7878hTU) and downstream (7878hTD) primers for the gain- of-allele assay or (B) the upstream (7878mTU) and downstream (7878mTD) primers for the loss-of-allele assay. The fragment to be inserted into the mouse ACE2 locus for humanization is shown underneath the human ACE2 allele, and the fragment to be deleted from the mouse ACE2 locus is shown underneath the mouse ACE2 allele.

[0038] Figure IB shows a schematic (not-to-scale) of a humanized A CE2 allele (7878 allele) containing a neomycin resistance self-deleting cassette, which is depicted as the unfilled arrow. The mouse exonic sequences are indicated by the gray filled boxes, the human exonic sequences are indicated by the black filled boxes, and the mouse 5’ and 3’ untranslated regions (UTRs) are indicated by the white unfilled boxes. The sequences between the different mouse/human, human/cassette, cassette/human, and human/mouse junctions are indicated by the lines labeled “A,” “B,” “C,” and “D,” respectively. The sequences of these junctions are provided below in Table 1.

Table 1 :

[0039] Figure 1C shows a schematic (not-to-scale) of a cassette-deleted version of the humanized ACE2 allele in Figure IB. The mouse exonic sequences are indicated by the gray boxes, the human exons sequences are indicated by the black boxes, and the mouse 5’ and 3’ UTRs are indicated by the white boxes. The sequences between the different 5’ mouse/human junction, the deleted loxP and cloning site, and 3’ human/mouse junction are indicated by the lines labeled A, E, and D, respectively, at the bottom of the figure. The sequences of these junctions are provided below in Table 2.

Table 2

[0040] Figure 2A shows a schematic (not-to-scale) of the mouse ACE2 locus. Untranslated and coding exons are represented by rectangles, and coding sequences are indicated by the filled boxes, the untranslated regions (UTRs) are indicated by the unfilled boxes, and various accession numbers for non-limiting examples of a mouse ACE2 gene, along with the chromosomal locations, are indicated at the top of the figure. The asterisks indicate the locations of (A) upstream (7878mTU) and downstream (7878mTD) primers for the loss-of-allele assay and (B) upstream (90034metU and 90034metU2) and downstream (90034metD, 90034metD2, 90034metD3, and 90034metD4) primers for a retention assay. Also shown are the locations targeted by guide RNAs (mGU, mGU2, mGD, and mGD2) used to collapse the ACE2 gene and create a null allele (ACE2-null) shown in Figure 2B.

[0041] Figure 2B shows a schematic (not-to-scale) of a mouse ACE2-null allele. The remaining coding sequence is indicated by the filled box, the untranslated regions (UTRs) are indicated by the unfilled boxes and a C to T point mutation in the 3’UTR is also depicted.

[0042] Figure 3 shows an alignment of the mouse ACE2 protein (mACE2; SEQ ID NO:2), the human ACE2 protein (hACE2; SEQ ID NO:4), and the humanized mouse ACE2 protein (7878 final prot; SEQ ID NO:24). The dotted line above the alignment denotes the signal peptide (amino acids 1-17 of SEQ ID NO:24). The underscored residues are those encoded by the introduced human sequences (amino acids 20-740 of SEQ ID NO:24). The boxed residues constitute the transmembrane domain of the humanized mouse ACE2 protein (amino acids 741-761 of SEQ ID NO:24).

[0043] Figure 4 provides relative levels (y-axis) of mRNA transcripts isolated from the colon, duodenum, kidney, or liver isolated from mice comprising a knockout of ACE2 (ACE2-KO), wildtype mice (ACE2-WT), mice comprising an endogenous ACE2 locus modified to encode a humanized ACE2 protein (hACE2), or isolated from a human.

[0044] Figure 5 provides relative levels (y-axis) of mRNA transcripts isolated from colon, duodenum, kidney, liver, heart, lung, or trachea isolated from mice comprising a knockout of ACE2 (ACE2-KO), wildtype mice (ACE2-WT), mice comprising an endogenous ACE2 locus modified to encode a humanized ACE2 protein (hACE2), or isolated from a human.

[0045] Figure 6 provides immunohistochemistry images of duodenum (i) isolated from a neonate wild-type mouse (ACE2-WT neonate mouse 6), a mouse comprising a knockout of ACE2 (ACE2-null), and two mice comprising an endogenous ACE2 locus modified to express a humanized ACE2 protein (7879 hACE2 Mouse) and (ii) stained with ACE2 antibodies that recognize mouse and human ACE2. Magnification is at 20x.

[0046] Figure 7 provides immunohistochemistry images of lung or trachea (i) isolated from a neonate wild-type mouse (ACE2-WT neonate mosue 6), a mouse comprising a knockout of ACE2 (ACE2-null), and a mouse comprising an endogenous ACE2 locus modified to express a humanized ACE2 protein (7879 hACE2 Mouse) and (ii) stained with ACE2 antibodies that recognize mouse and human ACE2. Magnification is at 20x.

[0047] Figure 8 provides the percent starting weight (y-axis) of mice comprising an endogenous ACE2 locus modified to express a humanzed ACE2 protein and inoculated on day 0 with PBS (control), 10 2 PFU of SARS-CoV-2 isolate, WAI, 10 3 PFU of SARS-CoV-2 isolate, WAI, 10 4 PFU of SARS-CoV-2 isolate, WAI, or 10 5 PFU of SARS-CoV-2 isolate, WAI over 8 days (x-axis).

[0048] Figure 9 provides plaque forming units (PFU) per lung isolated from mice comprising an endogenous ACE2 locus modified to express a humanized ACE2 protein and infected with varying doses of SARS-CoV-2 (10E2, 10E3, 10E4, or 10E5; x-axis) 2 days (D2), 4 days (D4), or 7 days (D7) after infection.

[0049] Figure 10 provides the level of subgenomic SARS-CoV-2 expression (Fold SARS2 Genome; y-axis) found in the lungs isolated from control mice (Sham) or mice from mice comprising an endogenous ACE2 locus modified to express a humanized ACE2 protein and infected with varying doses of SARS-CoV-2 (10E2, 10E3, 10E4, or 10E5; x-axis) 2 days (D2) or 4 days (D4) after infection.

[0050] Figure 11 provides the percent starting weight (y-axis) of mice comprising an endogenous ACE2 locus modified to express a humanized ACE2 protein and prophylactially treated with a placebo, with 50 mg/kg, 5 mg/kg, or 0.5 mg/kg of a single anti-spike protein antibody (left panel) or with a combination of 25/25 mg/kg, 2.5/2.5 mg/kg, or .51.5 mg/kg of two anti-spke protein antibodies (right panel) 2 days before inoculation on day 0 with 10 5 PFU SARS-CoV-2 isolate.

[0051] Figure 12 provides plaque forming units (PFU) per lung isolated from mice comprising an endogenous ACE2 locus modified to express a humanized ACE2 protein and prophylactially treated with a placebo, with 50 mg/kg, 5 mg/kg, or 0.5 mg/kg of a single antispike protein antibody or with a combination of 25/25 mg/kg, 2.5/2.5 mg/kg, or .51.5 mg/kg of two anti-spke protein antibodies 2 days before inoculation with 10 5 PFU SARS-CoV-2 isolate. Lungs were isolated 2 days after inoculation. LOD = limits of detection

[0052] Figure 13 provides the level of subgenomic SARS-CoV-2 expression (Fold SARS2 Genome; y-axis) found in the lungs isolated from mice comprising an endogenous ACE2 locus modified to express a humanized ACE2 protein and prophylactially treated with a placebo, with 50 mg/kg, 5 mg/kg, or 0.5 mg/kg of a single anti-spike protein antibody or with a combination of 25/25 mg/kg, 2.5/2.5 mg/kg, or .51.5 mg/kg of two anti-spke protein antibodies 2 days before inoculation with 10 5 PFU SARS-CoV-2 isolate. Lungs were isolated 2 days after inoculation.

[0053] Figure 14 presents representative H&E images of blood vessel of the lungs of mice comprising an endogenous ACE2 locus modified to express a humanized ACE2 protein that were either uninfected (left ) or infected with SARS-CoV-2 (right). Vascular lesions in the infected lungs include endothelial hyperplasia/hypertrophy, endothelial syncytia, and endothelialitis. Magnification is at 40x.

[0054] Figure 15 provides the pathology score (left panel) or plaque forming units (PFU) per lung (right panel) of mice comprising an endogenous ACE2 locus modified to express a humanized ACE2 protein and prophylactially treated with a placebo, with 50 mg/kg, 5 mg/kg, or 0.5 mg/kg of a single anti-spike protein antibody or with a combination of 25/25 mg/kg, 2.5/2.5 mg/kg, or .51.5 mg/kg of two anti-spke protein antibodies 2 days before inoculation with 10 5 PFU SARS-CoV-2 isolate. Lungs were isolated 2 days after inoculation. [0055] Figure 16A shows a diagram, not to scale, of the genomic organization of mouse TMPRSS2 and human TMPRSS2 genes. Exons are represented by thin bars placed across the genomic sequences, with the first coding exon for both genes indicated by the start codon "ATG" above the exon, and the last coding exon indicated by the "Stop" codon above the exon. A mouse genomic fragment of about 25,291 bp to be deleted and a human genomic fragment of about 25,091 bp to be inserted are indicated. Locations of probes used in an assay described in Example 4 are indicated. TM: transmembrane domain; SRCR: scavenger receptor cysteine-rich like domain; LDLRa: low density lipoprotein receptor class A.

[0056] Figure 16B illustrates, not to scale, an exemplary modified BAC vector for humanization of an endogenous mouse TMPRSS2 gene, along with the junction sequences (SEQ ID NOS:89, 90 and 91). [0057] Figure 16C illustrates, not to scale, a humanized TMPRSS2 allele after the neomycin cassette has been deleted, along with the junction sequences (SEQ ID NOS:89 and 92).

[0058] Figure 16D sets forth a sequence alignment of a human TMPRSS2 protein (SEQ ID NO:71), a mouse TMPRSS2 protein (SEQ ID NO:69), and a humanized TMPRSS2 protein ("7010 mutant pro") (SEQ ID NO:74).

[0059] Figure 17A shows a diagram, not to scale, of the genomic organization of mouse TMPRSS4 and human TMPRSS4 genes. Exons are represented by thin bars placed across the genomic sequences, with the first exon (also the first coding exon) for both genes indicated by the start codon " ATG" above the exon, and the last coding exon indicated by the "Stop" codon above the exon. The mouse genomic fragment of about 11,074 bp to be deleted and the human genomic fragment of about 14,963 bp to be inserted are indicated. Locations of probes used in an assay described in Example 5 are indicated. TM: transmembrane domain; SRCR: scavenger receptor cysteine-rich like domain; LDLRa: low density lipoprotein receptor class A.

[0060] Figure 17B illustrates, not to scale, an exemplary modified BAC vector for humanization of an endogenous mouse TMPRSS4 gene, along with the junction sequences (SEQ ID NOS: 105, 106 and 107).

[0061] Figure 17C illustrates, not to scale, a humanized TMPRSS4 allele after the neomycin cassette has been deleted, along with the junction sequences (SEQ ID NOS: 107 and 108).

[0062] Figure 17D sets forth a sequence alignment of a human TMPRSS4 protein (SEQ ID NO: 78), a mouse TMPRSS4 protein (SEQ ID NO:76), and a humanized TMPRSS4 protein ("7224 mutant pro") (SEQ ID NO:81).

[0063] Figure 18A shows a diagram, not to scale, of the genomic organization of mouse TMPRSS1 Id and human TMPRSS1 ID genes. Exons are represented by thin bars placed across the genomic sequences, with the first exon (also the first codon exon) for both genes indicated by the start codon "ATG" above the exon, and the last coding exon indicated by the "Stop" codon above the exon. A mouse genomic fragment of about 35,667 bp to be deleted and a human genomic fragment of about 33,927 bp to be inserted are indicated. Locations of probes used in an assay described in Example 6 are indicated. TM: transmembrane domain; SEA: domain found in sea urchin sperm protein, enterokinase and agnn. [0064] Figure 18B illustrates, not to scale, an exemplary modified BAC vector for humanization of an endogenous mouse TMPRSS1 Id gene, along with the junction sequences (SEQ ID NOS: 124, 125 and 126).

[0065] Figure 18C illustrates, not to scale, a humanized TMPRSS11 allele after the neomycin cassette has been deleted, along with the junction sequences (SEQ ID NOS: 124 and 127).

[0066] Figure 18D sets forth a sequence alignment of a human TMPRSS1 ID protein (SEQ ID NO:85), a mouse TMPRSS1 Id protein (SEQ ID NO:83), and a humanized TMPRSS1 Id protein ("7226 mutant pro") (SEQ ID NO:88).

[0067] Figure 19 provides relative levels (y-axis) of mRNA transcripts isolated from the colon, duodenum, kidney, liver, heart, lung, or trachea isolated from mice comprising a knockout of ACE2 (ACE2-KO), mice comprising an endogenous ACE2 locus modified to encode a recombinant ACE2 protein (hACE2), or mice comprising an endogenous ACE2 locus modified to encode a recombinant ACE2 protein (hACE2) and an endogenous TMPRSS2 locus modified to encode a humanized TMPRSS2 protein (hTMPRSS2).

DESCRIPTION

[0068] The terms “protein,” “polypeptide,” and “peptide,” used interchangeably herein, include polymeric forms of amino acids of any length, including coded and non-coded amino acids and chemically or biochemically modified or derivatized amino acids. The terms also include polymers that have been modified, such as polypeptides having modified peptide backbones. The term “domain” refers to any part of a protein or polypeptide having a particular function or structure.

[0069] Proteins are said to have an “N-terminus” and a “C-terminus.” The term “N- terminus” relates to the start of a protein or polypeptide, terminated by an amino acid with a free amine group (-NH2). The term “C-terminus” relates to the end of an amino acid chain (protein or polypeptide), terminated by a free carboxyl group (-COOH).

[0070] The terms “nucleic acid” and “polynucleotide,” used interchangeably herein, include polymeric forms of nucleotides of any length, including ribonucleotides, deoxyribonucleotides, or analogs or modified versions thereof. They include single-, double- , and multi -stranded DNA or RNA, genomic DNA, cDNA, DNA-RNA hybrids, and polymers comprising purine bases, pyrimidine bases, or other natural, chemically modified, biochemically modified, non-natural, or derivatized nucleotide bases. [0071] Nucleic acids are said to have “5’ ends” and “3’ ends” because mononucleotides are reacted to make oligonucleotides in a manner such that the 5’ phosphate of one mononucleotide pentose ring is attached to the 3’ oxygen of its neighbor in one direction via a phosphodiester linkage. An end of an oligonucleotide is referred to as the “5’ end” if its 5’ phosphate is not linked to the 3’ oxygen of a mononucleotide pentose ring. An end of an oligonucleotide is referred to as the “3’ end” if its 3’ oxygen is not linked to a 5’ phosphate of another mononucleotide pentose ring. A nucleic acid sequence, even if internal to a larger oligonucleotide, also may be said to have 5’ and 3’ ends. In either a linear or circular DNA molecule, discrete elements are referred to as being “upstream” or 5’ of the “downstream” or 3’ elements.

[0072] The term “genomically integrated” refers to a nucleic acid that has been introduced into a cell such that the nucleotide sequence integrates into the genome of the cell and is capable of being inherited by progeny thereof. Any protocol may be used for the stable incorporation of a nucleic acid into the genome of a cell.

[0073] The term “targeting vector” refers to a recombinant nucleic acid that can be introduced by homologous recombination, non-homologous-end-joining-mediated ligation, or any other means of recombination to a target position in the genome of a cell.

[0074] The term “wild-type” includes entities having a structure and/or activity as found in a normal (as contrasted with mutant, diseased, altered, or so forth) state or context. Wild-type genes and polypeptides often exist in multiple different forms (e.g., alleles).

[0075] The term “endogenous” refers to a nucleic acid sequence that occurs naturally within a cell or non-human animal. For example, an endogenous ACE2 sequence of a non-human animal refers to a native ACE2 sequence that naturally occurs at the ACE2 locus in the non-human animal.

[0076] “Exogenous” molecules or sequences include molecules or sequences that are not normally present in a cell in that form. Normal presence includes presence with respect to the particular developmental stage and environmental conditions of the cell. An exogenous molecule or sequence, for example, can include a mutated version of a corresponding endogenous sequence within the cell, such as a humanized version of the endogenous sequence, or can include a sequence corresponding to an endogenous sequence within the cell but in a different form (i.e., not within a chromosome). In contrast, endogenous molecules or sequences include molecules or sequences that are normally present in that form in a particular cell at a particular developmental stage under particular environmental conditions. [0077] The term “heterologous” when used in the context of a nucleic acid or a protein indicates that the nucleic acid or protein comprises at least two portions that do not naturally occur together in the same molecule. For example, the term “heterologous,” when used with reference to portions of a nucleic acid or portions of a protein, indicates that the nucleic acid or protein comprises two or more sub-sequences that are not found in the same relationship to each other (e.g., joined together) in nature. As one example, a “heterologous” region of a nucleic acid vector is a segment of nucleic acid within or attached to another nucleic acid molecule that is not found in association with the other molecule in nature. For example, a heterologous region of a nucleic acid vector could include a coding sequence flanked by sequences not found in association with the coding sequence in nature. Likewise, a “heterologous” region of a protein is a segment of amino acids within or attached to another peptide molecule that is not found in association with the other peptide molecule in nature (e.g., a fusion protein, or a protein with a tag). Similarly, a nucleic acid or protein can comprise a heterologous label or a heterologous secretion or localization sequence.

[0078] The term “orthologous” when used in the context of proteins or genes encoding proteins refer to those proteins or genes that diverge after a speciation event, but the gene and protein and their respective functions are conserved. As non-limiting examples, a non-human animal (e.g., rodent, e.g., rat or mouse) TMPRSS2 gene and the encoded non-human animal (e.g., rodent, e.g., rat or mouse) TMPRSS2 protein are orthologous to a human TMPRSS2 gene and the encodedhuman TMPRSS2 protein, respectively. Similarly, a non-human animal (e.g., rodent, e.g., rat or mouse) TMPRSS4 gene and the encoded non-human animal (e.g., rodent, e.g., rat or mouse) TMPRSS4 protein are orthologous to a human TMPRSS4 gene and the encoded human TMPRSS4 protein, respectively. In another non-limiting example, a non-human animal (e.g., rodent, e.g., rat or mouse) TMPRSS11 gene and the encoded non- human animal (e.g., rodent, e.g., rat or mouse) TMPRSS11 protein are orthologous to a human TMPRSS11 gene and the encoded human TMPRSS11 protein, respectively [0079] Codon optimization” takes advantage of the degeneracy of codons, as exhibited by the multiplicity of three-base pair codon combinations that specify an amino acid, and generally includes a process of modifying a nucleic acid sequence for enhanced expression in particular host cells by replacing at least one codon of the native sequence with a codon that is more frequently or most frequently used in the genes of the host cell while maintaining the native amino acid sequence. For example, a nucleic acid encoding a prokaryotic protein (i.e., a protein naturally expressed in a prokaryotic cell) can be modified to substitute codons having a higher frequency of usage in a given prokaryotic or eukaryotic cell, including a bacterial cell, a yeast cell, a human cell, a non-human cell, a mammalian cell, a rodent cell, a mouse cell, a rat cell, a hamster cell, or any other host cell, as compared to the naturally occurring nucleic acid sequence. Codon usage tables are readily available, for example, at the “Codon Usage Database.” These tables can be adapted in a number of ways. See Nakamura et al. (2000) Nucleic Acids Research 28:292, herein incorporated by reference in its entirety for all purposes. Computer algorithms for codon optimization of a particular sequence for expression in a particular host are also available (see, e.g., Gene Forge).

[0080] The term “locus” refers to a specific location of a gene (or significant sequence), DNA sequence, polypeptide-encoding sequence, or position on a chromosome of the genome of an organism. For example, an “ACE2 locus” may refer to the specific location of an ACE2 gene, ACE2 DNA sequence, ACE2-encoding sequence, or ACE2 position on a chromosome of the genome of an organism that has been identified as to where such a sequence resides. An “ACE2 locus” may comprise a regulatory element of an ACE2 gene, including, for example, an enhancer, a promoter, 5’ and/or 3’ untranslated region (UTR), or a combination thereof.

[0081] The term “gene” refers to a DNA sequence in a chromosome that codes for a product (e.g., an RNA product and/or a polypeptide product) and includes the coding region interrupted with non-coding introns and sequence located adjacent to the coding region on both the 5’ and 3’ ends such that the gene corresponds to the full-length mRNA (including the 5’ and 3’ untranslated sequences). The term “gene” also includes other non-coding sequences including regulatory sequences (e.g., promoters, enhancers, and transcription factor binding sites), polyadenylation signals, internal ribosome entry sites, silencers, insulating sequence, and matrix attachment regions. These sequences may be close to the coding region of the gene (e.g., within 10 kb) or at distant sites, and they influence the level or rate of transcription and translation of the gene.

[0082] The term “allele” refers to a variant form of a gene. Some genes have a variety of different forms, which are located at the same position, or genetic locus, on a chromosome. A diploid organism has two alleles at each genetic locus. Each pair of alleles represents the genotype of a specific genetic locus. Genotypes are described as homozygous if there are two identical alleles at a particular locus and as heterozygous if the two alleles differ.

[0083] A “promoter” is a regulatory region of DNA usually comprising a TATA box capable of directing RNA polymerase II to initiate RNA synthesis at the appropriate transcription initiation site for a particular polynucleotide sequence. A promoter may additionally comprise other regions which influence the transcription initiation rate. The promoter sequences disclosed herein modulate transcription of an operably linked polynucleotide. A promoter can be active in one or more of the cell types disclosed herein (e.g., a eukaryotic cell, a non-human mammalian cell, a human cell, a rodent cell, a pluripotent cell, a one-cell stage embryo, a differentiated cell, or a combination thereof). A promoter can be, for example, a constitutively active promoter, a conditional promoter, an inducible promoter, a temporally restricted promoter (e.g., a developmentally regulated promoter), or a spatially restricted promoter (e.g., a cell-specific or tissue-specific promoter). Examples of promoters can be found, for example, in WO 2013/176772, herein incorporated by reference in its entirety for all purposes.

[0084] “Operable linkage” or being “operably linked” includes juxtaposition of two or more components (e.g., a promoter and another sequence element) such that both components function normally and allow the possibility that at least one of the components can mediate a function that is exerted upon at least one of the other components. For example, a promoter can be operably linked to a coding sequence if the promoter controls the level of transcription of the coding sequence in response to the presence or absence of one or more transcriptional regulatory factors. Operable linkage can include such sequences being contiguous with each other or acting in trans (e.g., a regulatory sequence can act at a distance to control transcription of the coding sequence).

[0085] The term “variant” refers to a nucleotide sequence differing from the sequence most prevalent in a population (e.g., by one nucleotide) or a protein sequence different from the sequence most prevalent in a population (e.g., by one amino acid).

[0086] The term “fragment” when referring to a protein means a protein that is shorter or has fewer amino acids than the full-length protein. The term “fragment” when referring to a nucleic acid means a nucleic acid that is shorter or has fewer nucleotides than the full- length nucleic acid. A fragment can be, for example, an N-terminal fragment (i.e., removal of a portion of the C-terminal end of the protein), a C-terminal fragment (i.e., removal of a portion of the N-terminal end of the protein), or an internal fragment.

[0087] “Sequence identity” or “identity” in the context of two polynucleotides or polypeptide sequences makes reference to the residues in the two sequences that are the same when aligned for maximum correspondence over a specified comparison window. When percentage of sequence identity is used in reference to proteins, residue positions which are not identical often differ by conservative amino acid substitutions, where amino acid residues are substituted for other amino acid residues with similar chemical properties (e.g., charge or hydrophobicity) and therefore do not change the functional properties of the molecule. When sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Sequences that differ by such conservative substitutions are said to have “sequence similarity” or “similarity.” Means for making this adjustment are well known. Typically, this involves scoring a conservative substitution as a partial rather than a full mismatch, thereby increasing the percentage sequence identity. Thus, for example, where an identical amino acid is given a score of 1 and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and 1. The scoring of conservative substitutions is calculated, e.g., as implemented in the program PC/GENE (Intelligenetics, Mountain View, California).

[0088] “Percentage of sequence identity” includes the value determined by comparing two optimally aligned sequences (greatest number of perfectly matched residues) over a comparison window, wherein the portion of the polynucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid base or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison, and multiplying the result by 100 to yield the percentage of sequence identity. Unless otherwise specified (e.g., the shorter sequence includes a linked heterologous sequence), the comparison window is the full length of the shorter of the two sequences being compared.

[0089] Unless otherwise stated, sequence identity/ similarity values include the value obtained using GAP Version 10 using the following parameters: % identity and % similarity for a nucleotide sequence using GAP Weight of 50 and Length Weight of 3, and the nwsgapdna.cmp scoring matrix; % identity and % similarity for an amino acid sequence using GAP Weight of 8 and Length Weight of 2, and the BLOSUM62 scoring matrix; or any equivalent program thereof. “Equivalent program” includes any sequence comparison program that, for any two sequences in question, generates an alignment having identical nucleotide or amino acid residue matches and an identical percent sequence identity when compared to the corresponding alignment generated by GAP Version 10.

[0090] The term “conservative amino acid substitution” refers to the substitution of an amino acid that is normally present in the sequence with a different amino acid of similar size, charge, or polarity. Examples of conservative substitutions include the substitution of a non-polar (hydrophobic) residue such as isoleucine, valine, or leucine for another non-polar residue. Likewise, examples of conservative substitutions include the substitution of one polar (hydrophilic) residue for another such as between arginine and lysine, between glutamine and asparagine, or between glycine and serine. Additionally, the substitution of a basic residue such as lysine, arginine, or histidine for another, or the substitution of one acidic residue such as aspartic acid or glutamic acid for another acidic residue are additional examples of conservative substitutions. Examples of non-conservative substitutions include the substitution of a non-polar (hydrophobic) amino acid residue such as isoleucine, valine, leucine, alanine, or methionine for a polar (hydrophilic) residue such as cysteine, glutamine, glutamic acid or lysine and/or a polar residue for a non-polar residue. Typical amino acid categorizations are summarized below.

Alanine Ala A Nonpolar Neutral 1.8

Arginine Arg R Polar Positive -4.5

Asparagine Asn N Polar Neutral -3.5

Aspartic acid Asp D Polar Negative -3.5

Cysteine Cys C Nonpolar Neutral 2.5

Glutamic acid Glu E Polar Negative -3.5

Glutamine Gin Q Polar Neutral -3.5

Glycine Gly G Nonpolar Neutral -0.4

Histidine His H Polar Positive -3.2

Isoleucine He I Nonpolar Neutral 4.5

Leucine Leu L Nonpolar Neutral 3.8

Lysine Lys K Polar Positive -3.9

Methionine Met M Nonpolar Neutral 1.9

Phenylalanine Phe F Nonpolar Neutral 2.8

Proline Pro P Nonpolar Neutral -1.6

Serine Ser S Polar Neutral -0.8

Threonine Thr T Polar Neutral -0.7

Tryptophan Trp W Nonpolar Neutral -0.9

Tyrosine Tyr Y Polar Neutral -1.3

Valine Vai V Nonpolar Neutral 4.2

[0091] A “homologous” sequence (e.g., nucleic acid sequence) includes a sequence that is either identical or substantially similar to a known reference sequence, such that it is, for example, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical to the known reference sequence. Homologous sequences can include, for example, orthologous sequence and paralogous sequences. Homologous genes, for example, typically descend from a common ancestral DNA sequence, either through a speciation event (orthologous genes) or a genetic duplication event (paralogous genes). “Orthologous” genes include genes in different species that evolved from a common ancestral gene by speciation. Orthologs typically retain the same function in the course of evolution. “Paralogous” genes include genes related by duplication within a genome. Paralogs can evolve new functions in the course of evolution.

[0092] The term “in vitro” includes artificial environments and to processes or reactions that occur within an artificial environment (e.g., a test tube). The term “in vivo” includes natural environments (e.g., a cell or organism or body) and to processes or reactions that occur within a natural environment. The term “ex vivo” includes cells that have been removed from the body of an individual and to processes or reactions that occur within such cells.

[0093] The term “reporter gene” refers to a nucleic acid having a sequence encoding a gene product (typically an enzyme) that is easily and quantifiably assayed when a construct comprising the reporter gene sequence operably linked to a heterologous promoter and/or enhancer element is introduced into cells containing (or which can be made to contain) the factors necessary for the activation of the promoter and/or enhancer elements. Examples of reporter genes include, but are not limited, to genes encoding beta-galactosidase (lacZ), the bacterial chloramphenicol acetyltransferase (cat) genes, firefly luciferase genes, genes encoding beta-glucuronidase (GUS), and genes encoding fluorescent proteins. A “reporter protein” refers to a protein encoded by a reporter gene.

[0094] The term “fluorescent reporter protein” as used herein means a reporter protein that is detectable based on fluorescence wherein the fluorescence may be either from the reporter protein directly, activity of the reporter protein on a fluorogenic substrate, or a protein with affinity for binding to a fluorescent tagged compound. Examples of fluorescent proteins include green fluorescent proteins (e.g., GFP, GFP-2, tagGFP, turboGFP, eGFP, Emerald, Azami Green, Monomeric Azami Green, CopGFP, AceGFP, and ZsGreenl), yellow fluorescent proteins (e.g., YFP, eYFP, Citrine, Venus, YPet, PhiYFP, and ZsYellowl), blue fluorescent proteins (e.g., BFP, eBFP, eBFP2, Azurite, mKalamal, GFPuv, Sapphire, and T- sapphire), cyan fluorescent proteins (e.g., CFP, eCFP, Cerulean, CyPet, AmCyanl, and Midoriishi-Cyan), red fluorescent proteins (e.g., RFP, mKate, mKate2, mPlum, DsRed monomer, mCherry, mRFPl, DsRed-Express, DsRed2, DsRed-Monomer, HcRed-Tandem, HcRedl, AsRed2, eqFP611, mRaspberry, mStrawberry, and Jred), orange fluorescent proteins (e.g., mOrange, mKO, Kusabira-Orange, Monomeric Kusabira-Orange, mTangerine, and tdTomato), and any other suitable fluorescent protein whose presence in cells can be detected by flow cytometry methods.

[0095] The term “recombination” includes any process of exchange of genetic information between two polynucleotides and can occur by any mechanism. Recombination in response to double-strand breaks (DSBs) occurs principally through two conserved DNA repair pathways: non-homologous end joining (NHEJ) and homologous recombination (HR). See Kasparek & Humphrey (2011) Seminars in Cell & Dev. Biol. 22:886-897, herein incorporated by reference in its entirety for all purposes. Likewise, repair of a target nucleic acid mediated by an exogenous donor nucleic acid can include any process of exchange of genetic information between the two polynucleotides.

[0096] NHEJ includes the repair of double-strand breaks in a nucleic acid by direct ligation of the break ends to one another or to an exogenous sequence without the need for a homologous template. Ligation of non-contiguous sequences by NHEJ can often result in deletions, insertions, or translocations near the site of the double-strand break. For example, NHEJ can also result in the targeted integration of an exogenous donor nucleic acid through direct ligation of the break ends with the ends of the exogenous donor nucleic acid (i.e., NHEJ-based capture). Such NHEJ-mediated targeted integration can be preferred for insertion of an exogenous donor nucleic acid when homology directed repair (HDR) pathways are not readily usable (e.g., in non-dividing cells, primary cells, and cells which perform homology-based DNA repair poorly). In addition, in contrast to HDR, knowledge concerning large regions of sequence identity flanking the cleavage site is not needed, which can be beneficial when attempting targeted insertion into organisms that have genomes for which there is limited knowledge of the genomic sequence. The integration can proceed via ligation of blunt ends between the exogenous donor nucleic acid and the cleaved genomic sequence, or via ligation of sticky ends (i.e., having 5’ or 3’ overhangs) using an exogenous donor nucleic acid that is flanked by overhangs that are compatible with those generated by a nuclease agent in the cleaved genomic sequence. See, e.g., US 2011/020722, WO 2014/033644, WO 2014/089290, and Maresca et al. (2013) Genome Res. 23(3):539-546, each of which is herein incorporated by reference in its entirety for all purposes. If blunt ends are ligated, target and/or donor resection may be needed to generation regions of microhomology needed for fragment joining, which may create unwanted alterations in the target sequence.

[0097] Recombination can also occur via homology directed repair (HDR) or homologous recombination (HR). HDR or HR includes a form of nucleic acid repair that can require nucleotide sequence homology, uses a “donor” molecule as a template for repair of a “target” molecule (i.e., the one that experienced the double-strand break), and leads to transfer of genetic information from the donor to target. Without wishing to be bound by any particular theory, such transfer can involve mismatch correction of heteroduplex DNA that forms between the broken target and the donor, and/or synthesis-dependent strand annealing, in which the donor is used to resynthesize genetic information that will become part of the target, and/or related processes. In some cases, the donor polynucleotide, a portion of the donor polynucleotide, a copy of the donor polynucleotide, or a portion of a copy of the donor polynucleotide integrates into the target DNA. See Wang et al. (2013) Cell 153:910-918; Mandalos et al. (2012) PLOS ONE 7:e45768: l-9; and Wang et al. (2013) Nat Biotechnol. 31 :530-532, each of which is herein incorporated by reference in its entirety for all purposes. [0098] The term “antigen-binding protein” includes any protein that binds to an antigen. Examples of antigen-binding proteins include an antibody, an antigen-binding fragment of an antibody, a multispecific antibody (e.g., a bi-specific antibody), an scFV, a bis-scFV, a diabody, a triabody, a tetrabody, a V-NAR, a VHH, a VL, a F(ab), a F(ab)2, a DVD (dual variable domain antigen-binding protein), an SVD (single variable domain antigen-binding protein), a bispecific T-cell engager (BiTE), or a Davisbody (US Pat. No. 8,586,713, herein incorporated by reference herein in its entirety for all purposes).

[0099] The term “multi-specific” or “bi-specific” with reference to an antigen-binding protein means that the protein recognizes different epitopes, either on the same antigen or on different antigens. A multi-specific antigen-binding protein can be a single multifunctional polypeptide, or it can be a multimeric complex of two or more polypeptides that are covalently or non-covalently associated with one another. For example, an antibody or fragment thereof can be functionally linked (e.g., by chemical coupling, genetic fusion, non- covalent association or otherwise) to one or more other molecular entities, such as a protein or fragment thereof to produce a bi-specific or a multi-specific antigen-binding molecule with a second binding specificity.

[00100] The term “antigen” refers to a substance, whether an entire molecule or a domain within a molecule, which is capable of eliciting production of antibodies with binding specificity to that substance. The term antigen also includes substances, which in wild-type host organisms would not elicit antibody production by virtue of self-recognition, but can elicit such a response in a host animal with appropriate genetic engineering to break immunological tolerance. [00101] The term “epitope” refers to a site on an antigen to which an antigen-binding protein (e.g., antibody) binds. An epitope can be formed from contiguous amino acids or noncontiguous amino acids juxtaposed by tertiary folding of one or more proteins. Epitopes formed from contiguous amino acids (also known as linear epitopes) are typically retained on exposure to denaturing solvents whereas epitopes formed by tertiary folding (also known as conformational epitopes) are typically lost on treatment with denaturing solvents. An epitope typically includes at least 3, and more usually, at least 5 or 8-10 amino acids in a unique spatial conformation. Methods of determining spatial conformation of epitopes include, for example, x-ray crystallography and 2-dimensional nuclear magnetic resonance. See, e.g., Epitope Mapping Protocols, in Methods in Molecular Biology, Vol. 66, Glenn E. Morris, Ed. (1996), herein incorporated by reference in its entirety for all purposes.

[00102] An “antibody paratope” as described herein generally comprises at a minimum a complementarity determining region (CDR) that specifically recognizes the heterologous epitope (e.g., a CDR3 region of a heavy and/or light chain variable domain).

[00103] The term “antibody” includes immunoglobulin molecules comprising four polypeptide chains, two heavy (H) chains and two light (L) chains inter-connected by disulfide bonds. Each heavy chain comprises a heavy chain variable domain and a heavy chain constant region (CH). The heavy chain constant region comprises three domains: CHI, CH2 and CH3. Each light chain comprises a light chain variable domain and a light chain constant region (CL). The heavy chain and light chain variable domains can be further subdivided into regions of hypervariability, termed complementarity determining regions (CDR), interspersed with regions that are more conserved, termed framework regions (FR). Each heavy and light chain variable domain comprises three CDRs and four FRs, arranged from amino-terminus to carboxy -terminus in the following order: FR1, CDR1, FR2, CDR2, FR3, CDR3, FR4 (heavy chain CDRs may be abbreviated as HCDR1, HCDR2 and HCDR3; light chain CDRs may be abbreviated as LCDR1, LCDR2 and LCDR3). The term “high affinity” antibody refers to an antibody that has a KD with respect to its target epitope about of 10-9 M or lower (e.g., about 1 x 10-9 M, 1 x 10-10 M, 1 x 10-11 M, or about 1 x 10-12 M). In one embodiment, KD is measured by surface plasmon resonance, e.g., BIACORE™; in another embodiment, KD is measured by ELISA.

[00104] The term “bi-specific antibody” includes an antibody capable of selectively binding two or more epitopes. Bi-specific antibodies generally comprise two different heavy chains, with each heavy chain specifically binding a different epitope — either on two different molecules (e.g., on two different antigens) or on the same molecule (e.g., on the same antigen). If a bi-specific antibody is capable of selectively binding two different epitopes (a first epitope and a second epitope), the affinity of the first heavy chain for the first epitope will generally be at least one to two or three or four orders of magnitude lower than the affinity of the first heavy chain for the second epitope, and vice versa. The epitopes recognized by the bi-specific antibody can be on the same or a different target (e.g., on the same or a different protein). Bi-specific antibodies can be made, for example, by combining heavy chains that recognize different epitopes of the same antigen. For example, nucleic acid sequences encoding heavy chain variable sequences that recognize different epitopes of the same antigen can be fused to nucleic acid sequences encoding different heavy chain constant regions, and such sequences can be expressed in a cell that expresses an immunoglobulin light chain. A typical bi-specific antibody has two heavy chains each having three heavy chain CDRs, followed by (N-terminal to C-terminal) a CHI domain, a hinge, a CH2 domain, and a CH3 domain, and an immunoglobulin light chain that either does not confer antigenbinding specificity but that can associate with each heavy chain, or that can associate with each heavy chain and that can bind one or more of the epitopes bound by the heavy chain antigen-binding regions, or that can associate with each heavy chain and enable binding or one or both of the heavy chains to one or both epitopes.

[00105] The term “heavy chain,” or “immunoglobulin heavy chain” includes an immunoglobulin heavy chain sequence, including immunoglobulin heavy chain constant region sequence, from any organism. Heavy chain variable domains include three heavy chain CDRs and four FR regions, unless otherwise specified. Fragments of heavy chains include CDRs, CDRs and FRs, and combinations thereof. A typical heavy chain has, following the variable domain (from N-terminal to C-terminal), a CHI domain, a hinge, a CH2 domain, and a CH3 domain. A functional fragment of a heavy chain includes a fragment that is capable of specifically recognizing an epitope (e.g., recognizing the epitope with a KD in the micromolar, nanomolar, or picomolar range), that is capable of expressing and secreting from a cell, and that comprises at least one CDR. Heavy chain variable domains are encoded by variable region nucleotide sequences, which generally comprise VH, DH, and JH segments derived from a repertoire of VH, DH, and JH segments present in the germline. Sequences, locations and nomenclature for V, D, and J heavy chain segments for various organisms can be found in IMGT database, which is accessible via the internet on the World Wide Web (www) at the URL “imgt.org.”

[00106] The term “light chain” includes an immunoglobulin light chain sequence from any organism, and unless otherwise specified includes human kappa (K) and lambda ( ) light chains and a VpreB, as well as surrogate light chains. Light chain variable domains typically include three light chain CDRs and four framework (FR) regions, unless otherwise specified. Generally, a full-length light chain includes, from amino terminus to carboxyl terminus, a variable domain that includes FR1-CDR1-FR2-CDR2-FR3-CDR3-FR4, and a light chain constant region amino acid sequence. Light chain variable domains are encoded by light chain variable region nucleotide sequences, which generally comprise light chain VL and light chain JL gene segments, derived from a repertoire of light chain V and J gene segments present in the germline. Sequences, locations and nomenclature for light chain V and J gene segments for various organisms can be found in IMGT database, which is accessible via the internet on the World Wide Web (www) at the URL “imgt.org.” Light chains include those, e.g., that do not selectively bind either a first or a second epitope selectively bound by the epitope-binding protein in which they appear. Light chains also include those that bind and recognize, or assist the heavy chain with binding and recognizing, one or more epitopes selectively bound by the epitope-binding protein in which they appear.

[00107] The term “complementary determining region” or “CDR,” as used herein, includes an amino acid sequence encoded by a nucleic acid sequence of an organism’s immunoglobulin genes that normally (i.e., in a wild-type animal) appears between two framework regions in a variable region of a light or a heavy chain of an immunoglobulin molecule (e.g., an antibody or a T cell receptor). A CDR can be encoded by, for example, a germline sequence or a rearranged sequence, and, for example, by a naive or a mature B cell or a T cell. A CDR can be somatically mutated (e.g., vary from a sequence encoded in an animal’s germline), humanized, and/or modified with amino acid substitutions, additions, or deletions. In some circumstances (e.g., for a CDR3), CDRs can be encoded by two or more sequences (e.g., germline sequences) that are not contiguous (e.g., in an unrearranged nucleic acid sequence) but are contiguous in a B cell nucleic acid sequence, e.g., as a result of splicing or connecting the sequences (e.g., V-D-J recombination to form a heavy chain CDR3.

[00108] Specific binding of an antigen-binding protein to its target antigen includes binding with an affinity of at least 10 6 , 10 7 , 10 8 , 10 9 , or 10 10 M' 1 . Specific binding is detectably higher in magnitude and distinguishable from non-specific binding occurring to at least one unrelated target. Specific binding can be the result of formation of bonds between particular functional groups or particular spatial fit (e.g., lock and key type) whereas nonspecific binding is usually the result of van der Waals forces. Specific binding does not however necessarily imply that an antigen-binding protein binds one and only one target. [00109] Compositions or methods “comprising” or “including” one or more recited elements may include other elements not specifically recited. For example, a composition that “comprises” or “includes” a protein may contain the protein alone or in combination with other ingredients. The transitional phrase “consisting essentially of’ means that the scope of a claim is to be interpreted to encompass the specified elements recited in the claim and those that do not materially affect the basic and novel characteristic(s) of the claimed invention. Thus, the term “consisting essentially of’ when used in a claim of this invention is not intended to be interpreted to be equivalent to “comprising.”

[00110] “Optional” or “optionally” means that the subsequently described event or circumstance may or may not occur and that the description includes instances in which the event or circumstance occurs and instances in which it does not.

[00111] Designation of a range of values includes all integers within or defining the range, and all subranges defined by integers within the range.

[00112] Unless otherwise apparent from the context, the term “about” encompasses values within a standard margin of error of measurement (e.g., SEM) of a stated value.

[00113] The term “and/or” refers to and encompasses any and all possible combinations of one or more of the associated listed items, as well as the lack of combinations when interpreted in the alternative (“or”).

[00114] The term “or” refers to any one member of a particular list and also includes any combination of members of that list.

[00115] The singular forms of the articles “a,” “an,” and “the” include plural references unless the context clearly dictates otherwise. For example, the term “a protein” or “at least one protein” can include a plurality of proteins, including mixtures thereof.

[00116] Statistically significant means p <0.05.

I. OVERVIEW

[00117] Disclosed herein are non-human animals and non-human animal cells comprising a humanized ACE2 locus and a humanized TMPRSS locus. Also described are methods of using such non-human animals and non-human animals cells. Non-human animals or non-human cells comprising a humanized ACE2 locus and a humanized TMPRSS locus respectively express a human ACE2 protein or a chimeric (e.g., humanized) ACE2 protein comprising one or more fragments of a human ACE2 protein (e.g., all or part of the human ACE2 extracellular domain) and a human TMPRSS protein or a chimeric (e.g., humanized) TMPRSS protein. Ligands binding human ACE2 often will not bind to orthologous non-human animal ACE2 proteins such as mouse ACE2 due to the sequence differences between human ACE2 and the non-human animal ACE2. For example, coronaviruses that infect cells expressing human ACE2 will often not recognize rodent ACE2. (Subbarao and Roberts (2006) TRENDS Microbiol. 14:299-303; McCray et al. (2007) J. Virol. 81 :813-821; Wan (2020) J. Virol. 94: 1-9; Sun et al. (2020) Cell Host & Microbe 28: 1-10). Because of this, the progression of human- ACE2 -mediated coronavirus infection or therapy thereof cannot be effectively assessed in wild-type non-human animals with unmodified endogenous (i.e., native) ACE2 loci.

[00118] As a binding partner for coronavirus, e.g., SARS-CoV-2, non-human animals expressing human or humanized ACE2 can be utilized for studying SARS-CoV-2 infection and associated diseases, e.g., COVID-19, and for determining the efficacy of therapies thereto. For example, in early 2020, efforts to identify effective measures against COVID-19 were in full swing.

II. NON-HUMAN ANIMALS AND NON-HUMAN ANIMAL CELLS COMPRISING HUMANIZED ACE2 AND IMPRSS LOCI

[00119] The cells and non-human animals disclosed herein comprise humanized ACE2 and IMPRSS, e.g., TMPRSS2, loci, and optionally express the respective humanized ACE2 and TMPRSS proteins.

A, Angiotensin-converting enzyme 2 (ACE2) and Humanized ACE2 Loci

[00120] The cells and non-human animals described herein comprise a humanized ACE2 locus. Angiotensin-converting enzyme 2 (ACE2; ACEH) is encoded by the ACE2 gene. ACE2 is part of the angiotensin-converting enzyme family of dipeptidyl carboxydipeptidases and has considerable homology to human angiotensin 1 converting enzyme. ACE2 is a cell surface expressed aminopeptidase that catalyzes the cleavage of angiotensin I into angiotensin 1-9, and angiotensin II into the vasodilator angiotensin 1-7. The organ- and cellspecific expression of this gene suggests that it may play a role in the regulation of cardiovascular and renal function, as well as fertility. In addition, the encoded protein is a functional receptor for the spike glycoprotein of the human coronavirus HCoV-NL63 and the human severe acute respiratory syndrome coronaviruses, SARS-CoV and SARS-CoV-2 (COVID- 19 virus).

[00121] Both the human and mouse ACE2 genes are located on chromosome X, and each gene comprises 19 untranslated and coding exons of which 18 exons contain coding sequences. Coding exon numbering used throughout excludes the 5’ non-coding exon. Accordingly, “coding exon 1” refers to the first exon comprising coding sequences and subsequent coding exons are numbered accordingly. As such, and in connection with coding exons, the first intron following coding exon 1 may be referred to herein as intron 1.

[00122] An exemplary coding sequence for human ACE2 is assigned NCBI Accession Number NM_021804.3. An exemplary coding sequence for mouse ACE2 is assigned NCBI Accession Number NM_001130513.1. An exemplary human ACE2 protein is assigned UniProt Accession No. Q9BYF1-1. An exemplary mouse ACE2 protein is assigned UniProt Accession No. Q8R0I0-1. An exemplary humanized human/mouse ACE2 protein is set forth in SEQ ID NO:24, which comprises in operable linkage a mouse ACE2 signal peptide (SEQ ID NO:26), a human ACE2 extracellular domain (SEQ ID NO:27), a mouse ACE2 transmembrane domain (SEQ ID NO:28), and a mouse ACE2 cytoplasmic domain (SEQ ID NO:28).

[00123] A humanized ACE2 locus can be an ACE2 locus in which the entire ACE2 gene is replaced with the corresponding orthologous human ACE2 sequence, or it can be an ACE2 locus in which only a portion of the ACE2 gene is replaced with the corresponding orthologous human ACE2 sequence (i.e., humanized). Optionally, the corresponding orthologous human ACE2 sequence is modified to be codon-optimized based on codon usage in the non-human animal. Replaced (i.e., humanized) regions can include coding regions such as an exon, non-coding regions such as an intron, an untranslated region, or a regulatory region (e.g., a promoter, an enhancer, or a transcriptional repressor-binding element), or any combination thereof. As one example, exons corresponding to 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, or all 19 exons of the ACE2 gene are humanized. Likewise, introns corresponding to 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 16, or all 18 introns of the human ACE2 gene can be humanized. For example, the coding region of a non-human (e.g., rodent, e.g., rat or mouse) ACE2 gene could be replaced with the corresponding orthologous human ACE2 region. Alternatively, a region of ACE2 encoding an extracellular domain may be humanized. For example, at a non-human animal endogenous A CE2 locus, coding sequences starting in exon 2 (also referred to as coding exon 1) (e.g., after an endogenous ACE2 signal peptide coding region) to exon 18 (also referred to herein as coding exon 17) (e.g., up to an endogenous transmembrane domain coding region) may be replaced with corresponding human ACE2 coding sequences. Likewise, endogenous non-human ACE2 introns corresponding to coding introns 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, and 16 of the human ACE2 can be humanized. Flanking untranslated regions including regulatory sequences can also be humanized. For example, the 5’ untranslated region (UTR), the 3’UTR, or both the 5’ UTR and the 3’ UTR can be humanized, or the 5’ UTR, the 3 ’UTR, or both the 5’ UTR and the 3’ UTR can remain endogenous. In one specific example, the 3’ UTR is humanized, but the 5’ UTR remains endogenous. Depending on the extent of replacement by orthologous sequences, regulatory sequences, such as a promoter, can be endogenous or supplied by the replacing human orthologous sequence. For example, the humanized ACE2 locus can include the endogenous non-human animal ACE2 promoter. [00124] The ACE2 protein encoded by the humanized ACE2 locus can comprise one or more domains that are from a human ACE2 protein. For example, the ACE2 protein can comprise one or more or all of a human ACE2 extracellular domain, a human ACE2 transmembrane domain, and a human ACE2 cytoplasmic domain. As one example, the ACE2 protein can comprise only a human ACE2 extracellular domain. Optionally, the ACE2 protein encoded by the humanized A CE2 locus may also comprise one or more domains that are from the endogenous (i.e., native) non-human animal ACE2 protein. As one example, the ACE2 protein encoded by the humanized ACE2 locus can comprise an extracellular domain from a human ACE2 protein, a transmembrane domain from the endogenous (i.e., native) non-human animal ACE2 protein, and an N-terminal cytoplasmic domain from the endogenous (i.e., native) non-human animal ACE2 protein. Domains from a human ACE2 protein can be encoded by a fully humanized sequence (i.e., the entire sequence encoding that domain is replaced with the orthologous human ACE2 sequence) or can be encoded by a partially humanized sequence (i.e., some of the sequence encoding that domain is replaced with the orthologous human ACE2 sequence, and the remaining endogenous (i.e., native) sequence encoding that domain encodes the same amino acids as the orthologous human ACE2 sequence such that the encoded domain is identical to that domain in the human ACE2 protein.

[00125] As one example, the ACE2 protein encoded by the humanized A CE2 locus can comprise a human ACE2 extracellular domain. Optionally, the human ACE2 extracellular domain comprises, consists essentially of, or consists of a sequence that is at least 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO:27 and the ACE2 protein retains the activity of an ACE2 protein (e.g., retains the ability to catalyze the cleavage of angiotensin I into angiotensin 1-9, and angiotensin II into the vasodilator angiotensin 1-7, permit coronavirus (e.g., SARS-CoV-2) infection, etc.). The ACE2 protein encoded by the humanized ACE2 locus may comprise an endogenous non-human animal ACE2 transmembrane domain (e.g., a mouse ACE2 transmembrane domain). Optionally, the non- human animal ACE2 transmembrane domain comprises, consists essentially of, or consists of a sequence that is at least 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 28 and the ACE2 protein retains the activity of the native ACE2. For example, the ACE2 protein encoded by the humanized ACE2 locus can comprise, consist essentially of, or consist of a sequence that is at least 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 29 and the ACE2 protein retains the activity of the native ACE2. [00126] In some embodiments, a non-human animal disclosed herein contains a humanized ACE2 gene in the genome that includes a nucleotide sequence of a human ACE2 gene and a nucleotide sequence of an endogenous ACE2 gene, wherein the nucleotide sequence of the human ACE2 gene and the nucleotide sequence of the endogenous ACE2 gene are operably linked to each other such that the humanized ACE2 gene encodes an ACE2 protein and is under control of a 5’ regulatory element(s), such as the promoter and/or enhancer(s), of the endogenous rodent ACE2 gene.

[00127] Optionally, a humanized ACE2 gene can comprise other elements. Examples of such elements can include selection cassettes, reporter genes, recombinase recognition sites, or other elements. Alternatively, the humanized ACE2 locus can lack other elements (e.g., can lack a selection marker or selection cassette). Examples of suitable reporter genes and reporter proteins are disclosed elsewhere herein. Examples of suitable selection markers include neomycin phosphotransferase (neor), hygromycin B phosphotransferase (hygr), puromycin-N-acetyltransferase (puror), blasticidin S deaminase (bsr r ), xanthine/guanine phosphoribosyl transferase (gpt), and herpes simplex virus thymidine kinase (HSV-k). Examples of recombinases include Cre, Flp, and Dre recombinases. One example of a Cre recombinase gene is Crei, in which two exons encoding the Cre recombinase are separated by an intron to prevent its expression in a prokaryotic cell. Such recombinases can further comprise a nuclear localization signal to facilitate localization to the nucleus (e.g., NLS- Crei). Recombinase recognition sites include nucleotide sequences that are recognized by a site-specific recombinase and can serve as a substrate for a recombination event. Examples of recombinase recognition sites include FRT, FRT11, FRT71, attp, att, rox, and lox sites such as loxP, lox511, lox2272, lox66, lox71, loxM2, and lox5171.

[00128] Other elements such as reporter genes or selection cassettes can be self-deleting cassettes flanked by recombinase recognition sites. See, e.g., US 8,697,851 and US 2013/0312129, each of which is herein incorporated by reference in its entirety for all purposes. As an example, the self-deleting cassette can comprise a Crei gene (comprises two exons encoding a Cre recombinase, which are separated by an intron) operably linked to a mouse Prml promoter and a neomycin resistance gene operably linked to a human ubiquitin promoter. By employing the Prml promoter, the self-deleting cassette can be deleted specifically in male germ cells of F0 animals. The polynucleotide encoding the selection marker can be operably linked to a promoter active in a cell being targeted. Examples of promoters are described elsewhere herein. As another specific example, a self-deleting selection cassette can comprise a hygromycin resistance gene coding sequence operably linked to one or more promoters (e.g., both human ubiquitin and EM7 promoters) followed by a polyadenylation signal, followed by a Crei coding sequence operably linked to one or more promoters (e.g., an mPrml promoter), followed by another polyadenylation signal, wherein the entire cassette is flanked by loxP sites.

[00129] The humanized ACE2 locus can also be a conditional allele. For example, the conditional allele can be a multifunctional allele, as described in US 2011/0104799, herein incorporated by reference in its entirety for all purposes. For example, the conditional allele can comprise: (a) an actuating sequence in sense orientation with respect to transcription of a target gene; (b) a drug selection cassette (DSC) in sense or antisense orientation; (c) a nucleotide sequence of interest (NSI) in antisense orientation; and (d) a conditional by inversion module (COIN, which utilizes an exon-splitting intron and an invertible gene-trap- like module) in reverse orientation. See, e.g., US 2011/0104799. The conditional allele can further comprise recombinable units that recombine upon exposure to a first recombinase to form a conditional allele that (i) lacks the actuating sequence and the DSC; and (ii) contains the NSI in sense orientation and the COIN in antisense orientation. See, e.g., US 2011/0104799.

[00130] One exemplary humanized ACE2 locus (e.g., a humanized mouse ACE2 locus) is one in which part of the first coding exon through part of the 17th coding exon of the endogenous ACE2 gene are replaced with the corresponding human sequence. These exons encode the extracellular domain of ACE2. Optionally, the humanized sequence can be through the stop codon and 3’ UTR, and optionally into the sequence just downstream of the 3’ UTR. Optionally, a portion o the intron upstream of coding exon 1 is also humanized.

B, Transmembrane Serine Protease (TMPRSS) and Humanized TMPRSS Loci

[00131] Type II transmembrane serine proteases, also referred to herein as “TTSPs, "TMPRSS," "transmembrane serine protease,” and the like, are a family of proteins characterized by an N-terminal transmembrane domain and a C-terminal extracellular serine protease domain. At least 18 members have been identified in the family, which are grouped into four subfamilies (Bugge et al. (2009) J. Biol. Chem. 284 (35): 23177-23181). All members share several common structural features that define the family, including (i) a short N-terminal cytoplasmic domain, (ii) a transmembrane domain, and (iii) an extracellular that contains a protease domain and a stem region that links the transmembrane domain with the protease domain. The stem region contains a combination of modular structural domains of six different types: a SEA (sea urchin sperm protein/enteropeptidase/agrin) domain, a group A scavenger receptor domain, a LDLA (low-density lipoprotein receptor class A) domain, a CUB (Cls/Clr urchin embryonic growth factor, bone morphogenetic protein-1) domain, a MAM (meprin/A5 antigen/receptor protein phosphatase mu) domain, and a frizzled domain. See review by Bugge et al. (2009), supra. For example, TMPRSS2 and TMPRSS4, both of which belong to the hepsin/TMPRSS subfamily, have a group A scavenger receptor domain, preceded by a single LDLA domain in the stem region. TMPRSS1 ID, also known as "HAT" for human airway trypsin-like protease that belongs to the HAT/DESC subfamily, has a single SEA domain. See the first figure of Bugge et al. (2009), supra.

[00132] Type II transmembrane serine proteases are produced initially as inactive proenzymes that require activation by cleavage following a basic amino acid residue in a consensus activation motif preceding the protease domain. Some of the activated proteases remain membrane bound as a result of a disulfide bond between the prodomain and the protease domain. The extracellular domains are considered to be critical for cellular localization, activation, inhibition, and/or substrate specificity of these proteases (Bugge et al. (2009), supra; Szabo et al., hit. J. Biochem. Cell Biol. 40: 1297-1316 (2008)).

[00133] Various biochemical and pathophysiological information has been documented for members of the type II transmembrane serine proteases. TMPRSS2, TMPRSS4 and TMPRSS1 ID have been shown in vitro to cleave influenza A hemagglutinin (HA), which is the first essential step in the influenza viral life cycle. TMPRSS2 and TMPRSS4 appears to be involved in cleaving priming the S protein of coronavirus, e.g., SARS-CoV-2, to initiate viral infection. Following receptor engagement, SARS-CoV-2 S is processed by TMPRSS2 to allow release the viral contents into the host cell cytosol. Hoffman et al. (2020) Cell 181 : 271-80; Matsuyama et al. (2020) Proc. Natl. Acad. Sci. USA 17:7001-3. It has also been shown that TMPRSS2 and TMPRSS4 serine proteases mediate SARS-CoV-2 infection of human mature enterocytes from the apical surface by inducing cleavage of the S protein and enhancing membrane fusion. Zhang et al. recently showed that TMPRSS11A also cleaved the SARS-CoV-2 protein in cell culture-derived medium, suggesting that other TTSPs may participate in SARS-CoV-2 infection. Id. (2020) J. Biol. Chem. [00134] Accordingly, to increase the efficacy of the non-human animals and non-human animal cell described herein as models to evaluate the course of human coronavirus infection and/or therapies therefor, a non-human animal as described herein may comprise multiple humanized loci. In some non-limiting embodiments, a non-human animal herein (e.g., rodent, e.g., rat or mouse) comprising a human or humanized ACE2 locus that expresses a recombinant ACE2 protein may also further comprise a humanized TMPRSS, e.g., a TMPRSS2, TMPRSS4 and/or TMPRSS 11 locus. See, e.g., U.S. Patent Nos. 10,070,631 and 10,070,632, each of which is incorporated herein by reference in its entirety.

[00135] In some embodiments, the rodent disclosed herein contains a humanized TMPRSS gene in the genome that includes a nucleotide sequence of an endogenous non- human TMPRSS gene and a nucleotide sequence of a human TMPRSS gene, wherein the nucleotide sequence of the endogenous non-human TMPRSS gene and the nucleotide sequence of the human TMPRSS gene are operably linked to each other such that the humanized TMPRSS gene encodes a TMPRSS protein and is under control of a 5 ’regulatory element(s), such as the promoter and/or enhancer(s), of the endogenous non-human TMPRSS gene.

[00136] The present disclosure is particularly directed to like-for-like humanization; in other words, a nucleotide sequence of an endogenous non-human TMPRSS gene is operably linked to a nucleotide sequence of an orthologous, e.g., corresponding, human TMPRSS gene to form a humanized gene. For example, in some embodiments, a nucleotide sequence of an endogenous non-human TMPRSS2 gene is operably linked to a nucleotide sequence of a human TMPRSS2 gene to form a humanized TMPRSS2 gene. In other embodiments, a nucleotide sequence of an endogenous non-human TMPRSS4 gene is operably linked to a nucleotide sequence of a human TMPRSS4 gene to form a humanized TMPRSS4 gene. In still other embodiments, a nucleotide sequence of an endogenous non-human TMPRSS 1 Id gene is operably linked to a nucleotide sequence of a human TMPRSS1 ID gene to form a humanized TMPRSS1 Id gene.

[00137] In some embodiments, a genetically modified rodent of this invention contains a humanized TMPRSS gene in its genome, wherein the humanized TMPRSS gene encodes a recombinant TMPRSS protein that contains an extracellular portion that is substantially identical with the extracellular portion of a human TMPRSS protein. The term "extracellular" refers to the portion of a transmembrane protein that extends outside of the cell membrane, i.e., the extracellular portion of a transmembrane protein. The extracellular portion of a TMPRSS molecule includes a protease domain and a stem region that links the transmembrane domain with the protease domain. By an extracellular or polypeptide that is "substantially identical with the extracellular of a human TMPRSS protein", it is meant in some embodiments, a polypeptide that is at least 85%, 90%, 95%, 95%, 99% or 100% identical in sequence with the extracellular portion of a human TMPRSS protein; in some embodiments, a polypeptide that differs from the extracellular of a human TMPRSS protein by not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 amino acid(s); in some embodiments, a polypeptide that differs from the extracellular portion of a human TMPRSS protein only at the N- or C-terminus of the extracellular portion, e.g., by lacking amino acids or having additional amino acids at the at the N- or C-terminus of the extracellular portion; and in some embodiments, a polypeptide that is substantially the extracellular portion of a human TMPRSS protein. By "substantially the extracellular" of a human TMPRSS protein, it is meant a polypeptide that is identical with the extracellular portion, or differs from the extracellular portion by lacking 1-5 (i.e., 1, 2, 3, 4 or 5) amino acids or having additional 1-5 amino acids at the N- or C-terminus.

[00138] In some embodiments, the humanized TMPRSS gene encodes a recombinant TMPRSS protein that further contains a cytoplasmic and transmembrane portion that is substantially identical with the cytoplasmic and transmembrane portion of an endogenous non-human TMPRSS protein. By a cytoplasmic and transmembrane portion or polypeptide that is "substantially identical with the cytoplasmic and transmembrane portion of an endogenous non-human TMPRSS protein", it is meant in some embodiments, a polypeptide that is at least 85%, 90%, 95%, 95%, 99% or 100% identical in sequence with the cytoplasmic and transmembrane portion of an endogenous non-human TMPRSS protein; in some embodiments, a polypeptide that differs from the cytoplasmic and transmembrane portion of an endogenous non-human TMPRSS protein by not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 amino acid(s); in some embodiments, a polypeptide that differs from the cytoplasmic and transmembrane portion of an endogenous non-human TMPRSS protein only at the C-terminus, e.g., by lacking amino acids or having additional amino acids at the at the C-terminus of the transmembrane domain; and in some embodiments, a polypeptide composed of the cytoplasmic domain and substantially the transmembrane domain of an endogenous non-human TMPRSS protein. By "substantially the transmembrane domain" of an endogenous non-human TMPRSS protein, it is meant a polypeptide that is identical with the transmembrane domain, or differs from the transmembrane domain by lacking 1-5 amino acids or having additional 1-5 amino acids at the C-terminus. [00139] In some embodiments, the humanized TMPRSS gene in the genome of a genetically modified rodent includes a nucleotide sequence of an endogenous non-human TMPRSS gene and a nucleotide sequence of an orthologous human TMPRSS gene, wherein the nucleotide sequence of the orthologous human TMPRSS gene encodes a polypeptide substantially identical to the extracellular of the human TMPRSS protein encoded by the human TMPRSS gene. In certain embodiments, the nucleotide sequence of an orthologous human TMPRSS gene in a humanized TMPRSS gene encodes the extracellular of the human TMPRSS protein encoded by the human TMPRSS gene.

[00140] In some embodiments, the humanized TMPRSS gene in the genome of a genetically modified rodent includes a nucleotide sequence of an endogenous non-human TMPRSS gene and a nucleotide sequence of an orthologous human TMPRSS gene, wherein the nucleotide sequence of an endogenous non-human TMPRSS gene encodes a polypeptide substantially identical to the cytoplasmic and transmembrane portion of the endogenous non- human TMPRSS protein encoded by the non-human TMPRSS gene. In specific embodiments, the nucleotide sequence of an endogenous non-human TMPRSS gene present in a humanized TMPRSS gene encodes the cytoplasmic and transmembrane domains of the endogenous non-human TMPRSS protein encoded by the endogenous non-human TMPRSS gene.

[00141] In some embodiments, a humanized TMPRSS gene results from a replacement of a nucleotide sequence of an endogenous non-human TMPRSS gene at an endogenous non- human TMPRSS locus with a nucleotide sequence of an orthologous human TMPRSS gene. [00142] In some embodiments, a contiguous genomic sequence of a non-human TMPRSS gene at an endogenous non-human TMPRSS locus has been replaced with a contiguous genomic sequence of an orthologous human TMPRSS gene to form a humanized TMPRSS gene.

[00143] In specific embodiments, a contiguous genomic sequence of a human TMPRSS gene inserted into an endogenous non-human TMPRSS gene includes exons, in full or in part, of a human TMPRSS gene, that encode an extracellular that is substantially identical with the extracellular of the human TMPRSS protein encoded by the human TMPRSS gene.

[00144] In certain embodiments, the genomic sequence of an endogenous non-human TMPRSS gene that remains at an endogenous non-human TMPRSS locus after the humanization replacement and is operably linked to the inserted contiguous human TMPRSS genomic sequence encodes a cytoplasmic and transmembrane portion that is substantially identical with the cytoplasmic and transmembrane portion of the endogenous non-human TMPRSS protein encoded by the endogenous non-human TMPRSS gene.

[00145] In circumstances where an endogenous TMPRSS protein and a human TMPRSS protein share common amino acids near the junction between the transmembrane domain and the extracellular domain, it may not be necessary to insert a human TMPRSS genomic sequence that encodes precisely the extracellular domain of the human TMPRSS protein. It is possible to insert a slightly longer or shorter genomic sequence of a human TMPRSS gene, which encodes substantially the extracellular domain of the human TMPRSS protein, in operable linkage to a genomic sequence of an endogenous non-human TMPRSS gene that encodes the cytoplasmic domain and substantially the transmembrane domain of the endogenous non-human TMPRSS protein, such that the recombinant TMPRSS protein encoded by the resulting humanized TMPRSS gene includes an extracellular domain that is identical with the extracellular domain of the human TMPRSS protein and a transmembrane domain that is identical with the transmembrane domain of the endogenous non-human TMPRSS protein.

[00146] In some embodiments, the nucleotide sequence of a human TMPRSS gene included in a humanized TMPRSS gene also includes the 3’ untranslated region ("UTR") of the human TMPRSS gene. In certain embodiments, in addition to the 3’ UTR of a human TMPRSS gene, a humanized TMPRSS gene also includes an additional human genomic sequence from the human TMPRSS gene locus, following the human TMPRSS 3’ UTR. The additional human genomic sequence can consist of at least 10-200 bp, e.g., 50 bp, 75 bp, 100 bp, 125 bp, 150 bp, 175 bp, 200 bp, or more, found in the human TMPRSS gene locus immediately downstream of the 3’ UTR of the human TMPRSS gene. In other embodiments, the nucleotide sequence of a human TMPRSS gene present in a humanized TMPRSS gene does not include a human 3’ UTR; instead, the 3’ UTR of an endogenous non-human TMPRSS gene is included and immediately follows the stop codon of the humanized TMPRSS gene. For example, a humanized TMPRSS gene can include a nucleotide sequence of an endogenous non-human TMPRSS gene containing exon sequences encoding the cytoplasmic and transmembrane domains of the endogenous non-human TMPRSS protein, followed by a nucleotide sequence of a human TMPRSS gene containing exon sequences encoding the extracellular through the stop codon of the human TMPRSS protein, with the 3’ UTR of the endogenous non-human TMPRSS gene following immediately after the stop codon. [00147] In some embodiments, a humanized TMPRSS gene results in an expression of the encoded recombinant TMPRSS protein in a rodent. In some embodiments, a recombinant TMPRSS protein is expressed in a pattern comparable with, or substantially the same as, a counterpart non-human TMPRSS protein in a control rodent (e.g., a rodent without the humanized TMPRSS gene). In some embodiments, a recombinant TMPRSS protein is expressed at a level comparable with, or substantially the same as, a counterpart non-human TMPRSS protein in a control rodent (e.g., a rodent without the humanized TMPRSS gene). In certain embodiments, a recombinant TMPRSS protein is expressed and detected at the cell surface. In certain embodiments, a recombinant TMPRSS protein or a soluble form (e.g., a shed extracellular form) is expressed and detected in serum of a rodent, e.g., at a level comparable with, or substantially the same as, a counterpart non-human TMPRSS protein or a soluble form thereof in a control rodent. In the context of comparing a humanized gene or protein in a humanized rodent with an endogenous rodent gene or protein in a control rodent, the term "comparable" means that the molecules or levels being compared may not be identical to one another but are sufficiently similar to permit comparison there between so that conclusions may reasonably be drawn based on differences or similarities observed; and the term "substantially the same" in referring to expression levels means that the levels being compared are not different from one another by more than 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, or 1%.

C. Non-Human Cells and Non-Human Animals Comprising a Humanized ACE2 and AAS Loci

[00148] Non-human animal cells and non-human animals comprising a humanized ACE2 locus and a humanized TMPRSS as described elsewhere herein are provided. The cells or non-human animals can be heterozygous or homozygous for the humanized ACE2 locus and/or for the TMRPSS locus. A diploid organism has two alleles at each genetic locus. Each pair of alleles represents the genotype of a specific genetic locus. Genotypes are described as homozygous if there are two identical alleles at a particular locus and as heterozygous if the two alleles differ.

[00149] The non-human animal cells provided herein can be, for example, any non- human cell comprising a humanized ACE2 locus or a genomic locus homologous or orthologous to the human ACE2 locus, and a humanized TMPRSS locus or a genomic locus homologous or orthologous to a human TMPRSS locus. The cells can be eukaryotic cells, which include, for example, fungal cells (e.g., yeast), plant cells, animal cells, mammalian cells, non-human mammalian cells, and human cells. The term “animal” includes mammals, fishes, and birds. A mammalian cell can be, for example, a non-human mammalian cell, a rodent cell, a rat cell, a mouse cell, or a hamster cell. Other non-human mammals include, for example, non-human primates, monkeys, apes, orangutans, cats, dogs, rabbits, horses, bulls, deer, bison, livestock (e.g., bovine species such as cows, steer, and so forth; ovine species such as sheep, goats, and so forth; and porcine species such as pigs and boars). Birds include, for example, chickens, turkeys, ostrich, geese, ducks, and so forth. Domesticated animals and agricultural animals are also included. The term “non-human” excludes humans. [00150] The cells can also be any type of undifferentiated or differentiated state. For example, a cell can be a totipotent cell, a pluripotent cell (e.g., a human pluripotent cell or a non-human pluripotent cell such as a mouse embryonic stem (ES) cell or a rat ES cell), or a non-pluripotent cell. Totipotent cells include undifferentiated cells that can give rise to any cell type, and pluripotent cells include undifferentiated cells that possess the ability to develop into more than one differentiated cell types. Such pluripotent and/or totipotent cells can be, for example, ES cells or ES-like cells, such as an induced pluripotent stem (iPS) cells. ES cells include embryo-derived totipotent or pluripotent cells that are capable of contributing to any tissue of the developing embryo upon introduction into an embryo. ES cells can be derived from the inner cell mass of a blastocyst and are capable of differentiating into cells of any of the three vertebrate germ layers (endoderm, ectoderm, and mesoderm). [00151] The cells provided herein can also be germ cells (e.g., sperm or oocytes). The cells can be mitotically competent cells or mitotically-inactive cells, meiotically competent cells or meiotically-inactive cells. Similarly, the cells can also be primary somatic cells or cells that are not a primary somatic cell. Somatic cells include any cell that is not a gamete, germ cell, gametocyte, or undifferentiated stem cell. For example, the cells can be liver cells, such as hepatoblasts or hepatocytes.

[00152] Suitable cells provided herein also include primary cells. Primary cells include cells or cultures of cells that have been isolated directly from an organism, organ, or tissue. Primary cells include cells that are neither transformed nor immortal. They include any cell obtained from an organism, organ, or tissue which was not previously passed in tissue culture or has been previously passed in tissue culture but is incapable of being indefinitely passed in tissue culture. Such cells can be isolated by conventional techniques and include, for example, hepatocytes.

[00153] Other suitable cells provided herein include immortalized cells. Immortalized cells include cells from a multicellular organism that would normally not proliferate indefinitely but, due to mutation or alteration, have evaded normal cellular senescence and instead can keep undergoing division. Such mutations or alterations can occur naturally or be intentionally induced. A specific example of an immortalized cell line is the HepG2 human liver cancer cell line. Numerous types of immortalized cells are well known. Immortalized or primary cells include cells that are typically used for culturing or for expressing recombinant genes or proteins.

[00154] The cells provided herein also include one-cell stage embryos (i.e., fertilized oocytes or zygotes). Such one-cell stage embryos can be from any genetic background (e.g., BALB/c, C57BL/6, 129, or a combination thereof for mice), can be fresh or frozen, and can be derived from natural breeding or in vitro fertilization.

[00155] The cells provided herein can be normal, healthy cells, or can be diseased or mutant-bearing cells.

[00156] Non-human animals comprising a humanized ACE2 locus and a humanized TMPRSS locus as described herein can be made by the methods described elsewhere herein. The term “animal” includes mammals, fishes, and birds. Non-human mammals include, for example, non-human primates, monkeys, apes, orangutans, cats, dogs, horses, bulls, deer, bison, sheep, rabbits, rodents (e.g., mice, rats, hamsters, and guinea pigs), and livestock (e.g., bovine species such as cows and steer; ovine species such as sheep and goats; and porcine species such as pigs and boars). Birds include, for example, chickens, turkeys, ostrich, geese, and ducks. Domesticated animals and agricultural animals are also included. The term “non- human animal” excludes humans. Preferred non-human animals include, for example, rodents, such as mice and rats.

[00157] The non-human animals can be from any genetic background. For example, suitable mice can be from a 129 strain, a C57BL/6 strain, a mix of 129 and C57BL/6, a BALB/c strain, or a Swiss Webster strain. Examples of 129 strains include 129P1, 129P2, 129P3, 129X1, 129S1 (e.g., 129S1/SV, 129Sl/Svlm), 129S2, 129S4, 129S5, 129S9/SvEvH, 129S6 (129/SvEvTac), 129S7, 129S8, 129T1, and 129T2. See, e.g., & n etal. (1999) Mammalian Genome 10:836, herein incorporated by reference in its entirety for all purposes. Examples of C57BL strains include C57BL/A, C57BL/An, C57BL/GrFa, C57BL/Kal_wN, C57BL/6, C57BL/6J, C57BL/6ByJ, C57BL/6NJ, C57BL/10, C57BL/10ScSn, C57BL/10Cr, and C57BL/01a. Suitable mice can also be from a mix of an aforementioned 129 strain and an aforementioned C57BL/6 strain (e.g., 50% 129 and 50% C57BL/6). Likewise, suitable mice can be from a mix of aforementioned 129 strains or a mix of aforementioned BL/6 strains (e.g., the 129S6 (129/SvEvTac) strain). [00158] Similarly, rats can be from any rat strain, including, for example, an ACI rat strain, a Dark Agouti (DA) rat strain, a Wistar rat strain, a LEA rat strain, a Sprague Dawley (SD) rat strain, or a Fischer rat strain such as Fisher F344 or Fisher F6. Rats can also be obtained from a strain derived from a mix of two or more strains recited above. For example, a suitable rat can be from a DA strain or an ACI strain. The ACI rat strain is characterized as having black agouti, with white belly and feet and an RTl avl haplotype. Such strains are available from a variety of sources including Harlan Laboratories. The Dark Agouti (DA) rat strain is characterized as having an agouti coat and an RTl avl haplotype. Such rats are available from a variety of sources including Charles River and Harlan Laboratories. Some suitable rats can be from an inbred rat strain. See, e.g., US 2014/0235933, herein incorporated by reference in its entirety for all purposes.

[00159] In some embodiments, a non-human animal comprising a genetically modified endogenous ACE2 locus as described herein expresses a recombinant ACE2 protein in an organ selected from the group consisting of colon, duodenum, kidney, heart, liver, lung, trachea, and any combination thereof. In some embodiments, the expression pattern of a recombinant ACE2 protein in a genetically modified non-human animal as described herein follows the expression pattern of a non-human animal ACE2 protein in a control non-human animal comprising a wildtype endogenous ACE2 locus.

[00160] In some embodiments, the recombinant ACE2 protein is expressed on epithelial cells. Accordingly, also described herein is a non-human animal cell expressing a recombinant ACE2 protein, optionally wherein the non-human animal cell (e.g., rat cell or mouse cell) is a somatic cell, optionally wherein the somatic cell is an epithelial cell. Nonlimiting exmaples of epithethial cells that may express a recombination ACE2 protein as described herein include respiratory and/or gastrointestinal epithelial cells, e.g., an alveolar cell of the lung, an esophagus upper and stratified epithelial cell, an absorptive enterocyte from the ileum or colon, etc. In some embodiments, a non-human animal cell as described hererein expresses the recombinant ACE2 protein in the epithelium of small intestine villi, surface epithelium of the large intestine (colon), the epithelium of large to small bronchioles and bronchi of the lung, respiratory epithelium of the trachea, proximal tubular epithelium of the kidney, respiratory epithelium of the nasal cavity, and/or the stratum granulosum and/or stratum spinosum of oral mucosa/tongue in the oral cavity. III. METHODS OF USING NON-HUMAN ANIMALS COMPRISING A HUMANIZED ACE2 LOCUS FOR ASSESSING CORONA VIRUS INFECTION AND/OR ANTI-CORONA VIRUS THERAPIES

[00161] Various methods are provided for using the non-human animals comprising a human or humanized ACE2 locus and a IMPRSS locus for assessing coronavirus infection and/or the in vivo efficacy of human anti-coronavirus treatments. Because the non-human animals comprise a human or humanized A CE2 locus and a human or humanized IMPRSS locus, the non-human animals will more accurately reflect coronavirus infection mediated by human ACE2 or human anti-coronavirus therapies than non-human animals with a nonhumanized ACE2 locus with or without a humanized TMPRSS locus. As one example, the methods can monitor coronavirus infection comprising infecting a non-human animal as described herein with a coronavirus that utilizes a human ACE2 and/or TMPRSS protein for infection. In some embodiments, a non-human animal as described herein may be infected by intranasal inhalation of the coronoavirus, e.g., SARS-CoV-2. In some embodiments, a non-human animal as desribed herein may be infected by intragastric injection.

[00162] In some embodiments, the method further comprises assessing the non-human animal for coronavirus related disorders and/or diseases, e.g., lung capacity, gastrointestinal disorders and/or clotting related disorders, e.g., ischemia, and/or disease progression.

Disease progression may be monitored by obvious clinical signs, e.g., respiratory distress, neurological symptoms, death, etc. Disease progression may also be monitored by measuring the amount of replicating viruses of the coronoavirus, e.g., SARS-CoV-2, that may be isolated from organs (e.g., lungs, brain) of of the infected animal, e.g., by well-known plaque assays.

[00163] In some embodiments, a non-human animal comprising a modified endogenous ACE2 locus and a modified TMPRSS locus as described herein is infected with a SARS-CoV-2 strain, e.g., the non-human animal further comprises replicating SARS-CoV-2. In some embodiments, a non-human animal comprising a modified endogenous A CE2 locus and a modified IMPRSS locus as described herein as described herein and replicating SARS- CoV-2 exhibits COVID-19 symptoms for at least one, at least two, at least three, at least four, or at least five days post infection. In some embodiments, the COVID-19 symptom is selected from the group consisting of viral replication in an organ, minimal to severe inflammation (perivascular, vascular, peribronchiolar, septa and alveoli), minimal to severe necrosis (vascular, bronchioles, septa and alveoli), minimal to severe syncytia (vascular endothelium, bronchiolar epithelium and alveolar epithelium), minimal to severe hypertrophy/hyperplasia (vascular endothelium, bronchiolar and alveolar epithelium), minimal to severe hemorrhage (bronchioles and alveoli), minimal to severe edema (bronchioles and alveoli), minimal to severe fibrin (alveoli) and/or minimal to severe hyaline membranes (alveoli), and any combination thereof. In some embodiments, the COVID-19 symptom is selected from the group consisting of viral replication in an organ, necrosis of epithelium, e.g., necrosis of bronchiolar epithelium, vasculitis, endothelialitis, alveolar hyperplasia and/or syncytia, bronchiolar hyperplasia and syncytia, alveolar hemorrhage, perivascular edema, and any combination thereof. In some embodiments, the amount of replicating virus isolated from an organ of an infected non-human animal comprising a genetically modified endogenous is directly correlated with the severity of at least one symptom selected from the group consisting of inflammation (perivascular, vascular, peribronchiolar, septa and alveoli), necrosis (vascular, bronchioles, septa and alveoli), syncytia (vascular endothelium, bronchiolar epithelium and alveolar epithelium), hypertrophy/hyperplasia (vascular endothelium, bronchiolar and alveolar epithelium), hemorrhage (bronchioles and alveoli), minimal to severe edema (bronchioles and alveoli), fibrin (alveoli) and/or hyaline membranes (alveoli), necrosis of epithelium, e.g., necrosis of bronchiolar epithelium, vasculitis, endothelialitis, alveolar hyperplasia and/or syncytia, bronchiolar hyperplasia and syncytia, alveolar hemorrhage, perivascular edema, and any combination thereof. In some embodiments, severity of a COVID19 symptom is scored on a scale of 0 to 4 (0- within normal limits, 1-minimal, 2-mild, 3-moderate and 4- severe). [00164] In some embodiments, a method as described herein assesses or identifies a candidate agent capable of preventing, reducing or otherwise treating coronavirus infection and related disorders, (e.g., preventing, reducing or eliminating binding of the coronavirus (ligand of a human ACE2 protein) to the human ACE2 protein), the method comprising administering an antigen binding protein specific for a coronavirus to a non-human animal comprising a humanized A CE2 locus and a humanized TMPRSS locus and monitoring the non-human animal for coronavirus related disorders and/or diseases, e.g., lung capacity, gastrointestinal disorders and/or clotting related disorders, e.g., ischemia, wherein the non- human animal is infected with the coronavirus before, simultaneously with, or after the administration, and wherein a reduction of the coronavirus related disorders and/or diseases compared to that of a control animal identifies the candidate agent as capable of preventing, reducing or otherwise treating coronavirus infection and related disorders. , e.g., identifies the candidate as capable of preventing, reducing or eliminating binding of the coronavirus (ligand of a human ACE2 protein) to human ACE2 protein. [00165] In some embodiments, a non-human animal comprising a modified endogenous ACE2 locus and a modified TMPRSS locus as described herein is infected with a SARS-CoV-2 strain, e.g., the non-human animal further comprises replicating SARS-CoV-2, before, after, or simultaneously with an antigen binding protein (e.g., an antibody) specific for SARS-CoV-2 (e.g., the spike protein of SARS-CoV-2). Accordingly, in some embodiments, a non-human animal as described herein comprises a modified endogenous ACE2 locus, a modified endogenous TMPRSS locus, replicating SARS-CoV-2, and an antigen binding protein that binds SARS-CoV-2. In some embodiments, a method described herein comprises administering an antigen-binding protein that binds SARS-CoV-2 and SARS-CoV-2 to a non-human animal comprising a genetically modified endogenous ACE2 locus and a modified endogenous TMPRSS locus as described herein as described herein and monitoring the non-human animal for COVID-19 related symptoms, wherein the antigenbinding protein that binds SARS-CoV-2 may be administered prior to, simultaneously with, or after the administration of SARS-CoV-2. In some embodiments, the non-human animal is monitored within one week of adminstration of (infection with) SARS-CoV-2. In some embodiments, the non-human animal is monitored for at least 3 days after adminstration of (infection with) SARS-CoV-2. In some embodiments, the non-human animal is monitored for at least 3 days after adminstration of (infection with) SARS-CoV-2. In some embodiments, the non-human animal is monitored for COVID-19 related symptoms 1 to 2 days after adminstration of (infection with) SARS-CoV-2.

[00166] In some embodiments, the COVID-19 symptom is selected from the group consisting of viral replication in an organ, minimal to severe inflammation (perivascular, vascular, peribronchiolar, septa and alveoli), minimal to severe necrosis (vascular, bronchioles, septa and alveoli), minimal to severe syncytia (vascular endothelium, bronchiolar epithelium and alveolar epithelium), minimal to severe hypertrophy/hyperplasia (vascular endothelium, bronchiolar and alveolar epithelium), minimal to severe hemorrhage (bronchioles and alveoli), minimal to severe edema (bronchioles and alveoli), minimal to severe fibrin (alveoli) and/or minimal to severe hyaline membranes (alveoli), and any combination thereof. In some embodiments, the COVID-19 symptom is selected from the group consisting of viral replication in an organ, necrosis of epithelium, e.g., necrosis of bronchiolar epithelium, vasculitis, endothelialitis, alveolar hyperplasia and/or syncytia, bronchiolar hyperplasia and syncytia, alveolar hemorrhage, perivascular edema, and any combination thereof. [00167] In some embodiments, the dose of the antigen-binding protein is inversely correlated with the amount of replicating virus isolated from an organ of an infected nonhuman animal comprising a genetically modified endogenous and with the severity of at least one symptom selected from the group consisting of inflammation (perivascular, vascular, peribronchiolar, septa and alveoli), necrosis (vascular, bronchioles, septa and alveoli), syncytia (vascular endothelium, bronchiolar epithelium and alveolar epithelium), hypertrophy/hyperplasia (vascular endothelium, bronchiolar and alveolar epithelium), hemorrhage (bronchioles and alveoli), minimal to severe edema (bronchioles and alveoli), fibrin (alveoli) and/or hyaline membranes (alveoli), necrosis of epithelium, e.g., necrosis of bronchiolar epithelium, vasculitis, endothelialitis, alveolar hyperplasia and/or syncytia, bronchiolar hyperplasia and syncytia, alveolar hemorrhage, perivascular edema, and any combination thereof. In some embodiments, severity of a COVID19 symptom is scored on a scale of 0 to 4 (0- within normal limits, 1-minimal, 2-mild, 3-moderate and 4- severe).

IV. METHODS OF MAKING NON-HUMAN ANIMALS COMPRISING HUMANIZED ACE2 AND TMPRSS2 LOCI

[00168] Various methods are provided for making a non-human animal comprising a humanized ACE2 locus as disclosed elsewhere herein. Any convenient method or protocol for producing a genetically modified organism is suitable for producing such a genetically modified non-human animal. See, e.g., Cho et al. (2009) Current Protocols in Cell Biology 42: 19.11 : 19.11.1-19.11.22 and Gama Sosa et al. (2010) Brain Struct. Funct. 214(2-3):91- 109, each of which is herein incorporated by reference in its entirety for all purposes. Such genetically modified non-human animals can be generated, for example, through gene knock- in at a targeted ACE2 locus.

[00169] For example, the method of producing a non-human animal comprising a humanized ACE2 locus and a humanized IMPRSS locus can comprise first generating a pluripotent cell (e.g., a non-human animal ES cell) to comprise a humanized A CE2 locus and then further modifying the modified pluripotent cell to further comprise a humanized TMPRSS locus, and generating a non-human animal with the pluripotent cell modified to comprise both the humanized ACE2 locus and the humanized IMPRSS locus. Modifying the pluripotent cell (e.g., non-human ES cell) to comprise a humanized ACE2 locus may comprise (1) modifying the genome of a pluripotent cell to comprise the humanized A CE2 locus and (2) identifying or selecting the genetically modified pluripotent cell comprising the humanized ACE2 locus. Modifying the pluripotent cell (e.g., non-human ES cell) to comprise a humanized TMPRSS locus may comprise (1) modifying the genome of a pluripotent cell to comprise the humanized TMPRSS locus and (2) identifying or selecting the genetically modified pluripotent cell comprising the humanized TMPRSS locus. Modified ES cells may then be introduced into a non-human animal host embryo; and the host embryo implanted in a surrogate mother for gestation. Optionally, the host embryo comprising modified pluripotent cell (e.g., a non-human ES cell) can be incubated until the blastocyst stage before being implanted into and gestated in the surrogate mother to produce an F0 non-human animal. The surrogate mother can then produce an F0 generation non-human animal comprising the humanized ACE2 locus and the humanized TMPRSS locus.

[00170] The methods can further comprise identifying a cell or animal having a modified target genomic locus. Various methods can be used to identify cells and animals having a targeted genetic modification.

[00171] The screening step can comprise, for example, a quantitative assay for assessing modification of allele (MO A) of a parental chromosome. For example, the quantitative assay can be carried out via a quantitative PCR, such as a real-time PCR (qPCR). The real-time PCR can utilize a first primer set that recognizes the target locus and a second primer set that recognizes a non-targeted reference locus. The primer set can comprise a fluorescent probe that recognizes the amplified sequence.

[00172] Other examples of suitable quantitative assays include fluorescence-mediated in situ hybridization (FISH), comparative genomic hybridization, isothermic DNA amplification, quantitative hybridization to an immobilized probe(s), INVADER® Probes, TAQMAN® Molecular Beacon probes, or ECLIPSE™ probe technology (see, e.g., US 2005/0144655, incorporated herein by reference in its entirety for all purposes).

[00173] An example of a suitable pluripotent cell is an embryonic stem (ES) cell (e.g., a mouse ES cell or a rat ES cell). The modified pluripotent cell can be generated, for example, through recombination by (a) introducing into the cell, in any order, a first and second targeting vector, each comprising a distinct insert nucleic acid flanked by 5’ and 3’ homology arms corresponding to 5’ and 3’ target sites, wherein one insert nucleic acid comprises a humanized ACE2 locus and an other insert nucleic acid comprises a humanized TMPRSS locus; and (b) identifying at least one cell comprising in its genome both insert nucleic acids integrated at the target genomic locus. Alternatively, the modified pluripotent cell can be generated by (a) introducing into the cell: (i) a nuclease agent, wherein the nuclease agent induces a nick or double-strand break at a recognition site within the target genomic locus; and (ii) one or more targeting vectors comprising an insert nucleic acid flanked by 5’ and 3’ homology arms corresponding to 5’ and 3’ target sites located in sufficient proximity to the recognition site, wherein the insert nucleic acid(s) comprises the humanized A CE2 locus and/or the humanized TMPRSS locus; and (b) identifying at least one cell comprising a modification (e.g., integration of the insert nucleic acid) at the target genomic locus. Any nuclease agent that induces a nick or double-strand break into a desired recognition site can be used. Examples of suitable nucleases include a Transcription Activator-Like Effector Nuclease (TALEN), a zinc-finger nuclease (ZFN), a meganuclease, and Clustered Regularly Interspersed Short Palindromic Repeats (CRISPR)/CRISPR-associated (Cas) systems or components of such systems (e.g., CRISPR/Cas9). See, e.g., US 2013/0309670 and US 2015/0159175, each of which is herein incorporated by reference in its entirety for all purposes.

[00174] The donor cell can be introduced into a host embryo at any stage, such as the blastocyst stage or the pre-morula stage (i.e., the 4 cell stage or the 8 cell stage). Progeny that are capable of transmitting the genetic modification though the germline are generated. See, e.g., US Patent No. 7,294,754, herein incorporated by reference in its entirety for all purposes.

[00175] Alternatively, the method of producing the non-human animals described elsewhere herein can comprise: (1) modifying the genome of a one-cell stage embryo to comprise the humanized A CE2 locus using the methods described above for modifying pluripotent cells; (2) selecting the genetically modified embryo; and (3) implanting and gestating the genetically modified embryo into a surrogate mother. Progeny that are capable of transmitting the genetic modification though the germline are generated.

[00176] Nuclear transfer techniques can also be used to generate the non-human mammalian animals. Briefly, methods for nuclear transfer can include the steps of: (1) enucleating an oocyte or providing an enucleated oocyte; (2) isolating or providing a donor cell or nucleus to be combined with the enucleated oocyte; (3) inserting the cell or nucleus into the enucleated oocyte to form a reconstituted cell; (4) implanting the reconstituted cell into the womb of an animal to form an embryo; and (5) allowing the embryo to develop. In such methods, oocytes are generally retrieved from deceased animals, although they may be isolated also from either oviducts and/or ovaries of live animals. Oocytes can be matured in a variety of well-known media prior to enucleation. Enucleation of the oocyte can be performed in a number of well-known manners. Insertion of the donor cell or nucleus into the enucleated oocyte to form a reconstituted cell can be by microinjection of a donor cell under the zona pellucida prior to fusion. Fusion may be induced by application of a DC electrical pulse across the contact/fusion plane (electrofusion), by exposure of the cells to fusion-promoting chemicals, such as polyethylene glycol, or by way of an inactivated virus, such as the Sendai virus. A reconstituted cell can be activated by electrical and/or nonelectrical means before, during, and/or after fusion of the nuclear donor and recipient oocyte. Activation methods include electric pulses, chemically induced shock, penetration by sperm, increasing levels of divalent cations in the oocyte, and reducing phosphorylation of cellular proteins (as by way of kinase inhibitors) in the oocyte. The activated reconstituted cells, or embryos, can be cultured in well-known media and then transferred to the womb of an animal. See, e.g., US 2008/0092249, WO 1999/005266, US 2004/0177390, WO 2008/017234, and US Patent No. 7,612,250, each of which is herein incorporated by reference in its entirety for all purposes.

[00177] The various methods provided herein allow for the generation of a genetically modified non-human F0 animal wherein the cells of the genetically modified F0 animal comprise the humanized ACE2 locus and the humanized TMPRSS locus. It is recognized that depending on the method used to generate the F0 animal, the number of cells within the F0 animal that have the humanized ACE2 locus and the humanized TMPRSS locus will vary. The introduction of the donor ES cells into a pre-morula stage embryo from a corresponding organism (e.g., an 8-cell stage mouse embryo) via for example, the VELOCIMOUSE® method allows for a greater percentage of the cell population of the F0 animal to comprise cells having the nucleotide sequence of interest comprising the targeted genetic modification. For example, at least 50%, 60%, 65%, 70%, 75%, 85%, 86%, 87%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% of the cellular contribution of the non-human F0 animal can comprise a cell population having the targeted modification.

[00178] The cells of the genetically modified F0 animal can be heterozygous for the humanized ACE2 locus and/or the humanized TMPRSS locus or can be homozygous for the humanized ACE2 locus and/or the humanized TMPRSS locus.

[00179] All patent filings, websites, other publications, accession numbers and the like cited above or below are incorporated by reference in their entirety for all purposes to the same extent as if each individual item were specifically and individually indicated to be so incorporated by reference. If different versions of a sequence are associated with an accession number at different times, the version associated with the accession number at the effective filing date of this application is meant. The effective filing date means the earlier of the actual filing date or filing date of a priority application referring to the accession number if applicable. Likewise, if different versions of a publication, website or the like are published at different times, the version most recently published at the effective filing date of the application is meant unless otherwise indicated. Any feature, step, element, embodiment, or aspect of the invention can be used in combination with any other unless specifically indicated otherwise. Although the present invention has been described in some detail by way of illustration and example for purposes of clarity and understanding, it will be apparent that certain changes and modifications may be practiced within the scope of the appended claims.

BRIEF DESCRIPTION OF THE SEQUENCES

[00180] The nucleotide and amino acid sequences listed in the accompanying sequence listing are shown using standard letter abbreviations for nucleotide bases, and three-letter code for amino acids. The nucleotide sequences follow the standard convention of beginning at the 5’ end of the sequence and proceeding forward (i.e., from left to right in each line) to the 3’ end. Only one strand of each nucleotide sequence is shown, but the complementary strand is understood to be included by any reference to the displayed strand. The amino acid sequences follow the standard convention of beginning at the amino terminus of the sequence and proceeding forward (i.e., from left to right in each line) to the carboxy terminus.

Table 3. Description of Sequences.

[00181] While the invention has been particularly shown and described with reference to a number of embodiments, it would be understood by those skilled in the art that changes in the form and details may be made to the various embodiments disclosed herein without departing from the spirit and scope of the invention and that the various embodiments disclosed herein are not intended to act as limitations on the scope of the claims.

EXAMPLES

[00182] The following examples are provided for illustrative purposes only and are not intended to limit the scope of the invention.

Example 1 : Generation of Mice Comprising a Humanized 4(7/2 locus

[00183] Described in this example is the generation of mice comprising a humanized ACE2 locus for use as a model useful in understanding coronavirus infection, particularly SARS-CoV and SARS-CoV-2 function, and validation of vaccination and/or treatment protocols therefor.

[00184] Illustrative (not-to- scale) diagrams of the human and mouse ACE2 gene are provided in Figure 1A. NCBI Accession number NP-001123985.1 provides an exemplary non-limiting mouse ACE2 protein amino acid sequence, which is set forth as SEQ ID NO:2. An exemplary non-limiting example of a nucleotide sequence that encodes the exemplary non-limiting mouse ACE2 protein is set forth as SEQ ID NO: 1. Nucleotides 1-51 of SEQ ID NO: 1 encodes the mouse ACE2 signal peptide (set forth as amino acids 1-17 of SEQ ID NO:2), nucleotides 52-2220 of SEQ ID NO: 1 encode the mouse ACE2 extracellular domain set forth as amino acids 18-740 of SEQ ID NO:2), nucleotides 2221-2283 of SEQ ID NO: 1 encode the mouse ACE2 transmembrane domain (set forth as amino acids 741-761 of SEQ ID NO:2), and nucleotides 2284-2418 of SEQ ID NO: 1 encode the mouse ACE2 cytoplasmic domain (set forth as amino acids 762-805 of SEQ ID NO:2). NCBI Accession number NP- 068576.1 provides an exemplary non-limiting human ACE2 protein amino acid sequence, which is set forth as SEQ ID NO:4. Amino acids 1-17 of SEQ ID NO:4 sets forth the amino acid sequence of the signal peptide of human ACE2 protein, amino acids 18-740 of SEQ ID NO:4 sets forth the amino acid sequence of the extracellular domain of human ACE2 protein, amino acids 741-761 of SEQ ID NO:4 sets forth the amino acid sequence of the transmembrane domain of human ACE2 protein, and amino acids 762-805 of SEQ ID NO:4 sets forth the amino acid sequence of the cytoplasmic domain of human ACE2 protein. An exemplary non-limiting example of a nucleotide sequence that encodes the exemplary nonlimiting mouse ACE2 protein is set forth as SEQ ID NO:3. Nucleotides 1-51 of SEQ ID NO:3 encode a signal peptide amino acid sequence of human ACE2 protein (set forth as amino acids 1-17 of SEQ ID NO:4), nucleotides 52-2220 of SEQ ID NO:3 encode an extracellular domain amino acid sequence of human ACE2 protein (set forth as amino acids 18-740 of SEQ ID NO:4), nucleotides 2221-2283 of SEQ ID NO:3 encode a transmembrane domain amino acid sequence of human ACE2 protein (set forth as amino acids 741-761 of SEQ ID NO:4), and nucleotides 2284-2418 of SEQ ID NO:3 encode a cytoplasmic domain amino acid sequence of human ACE2 protein (set forth as amino acids 762-805 of SEQ ID NO:4).

[00185] A large targeting vector comprising a 5’ homology arm comprising 14.9 kb from RP23-244L14 and 3’ homology arm comprising 126 kb from RP23-244L14 was generated to replace part of coding exon 1 through part of coding exon 17 (e.g., part of coding exon 1, intron 1, exons 2-16 and intervening introns, intron 16, and part of coding exon 17) of mouse ACE2 with the corresponding human sequence of ACE2. See, e.g., Figures 1A and IB. The targeting vector is designed to replace 45,019 bp of the mouse sequence with 36,742 bp of the human sequence, which also is modified to comprise a self-deleting floxed neo cassette (loxP-mPrml-Crei-pA-hUbl-em7-Neo-pA-loxP) inserted into the human intron 16. See, e.g., Figure IB.

[00186] Since the ACE2 locus is found on the X-chromosome, CRISPR/Cas9 components were introduced into male F1H4 mouse embryonic stem cells together with the large targeting vector described herein. Loss-of-allele assays using the primers and probes set forth in Table 4 were performed to detect loss of the endogenous mouse allele, and gain-of- allele assays using the primers and probes set forth in Table 5 were performed to detect gain of the humanized allele. Loss-of-allele and gain-of-allele assays are described, for example, in US 2014/0178879; US 2016/0145646; WO 2016/081923; and Frendewey et al. (2010) Methods Enzymol. 476:295-307, each of which is herein incorporated by reference in its entirety for all purposes.

Table 4. Mouse TAQMAN® Loss-of-Allele Assays

Table 5. Human TAQMAN® Gain-of-Allele Assays

[00187] The resulting humanized mouse ACE2 allele comprising the self-deleting floxed neo cassette comprises a nucleotide sequence as set forth in SEQ ID NO: 5 (referred to as the 7878 allele). Figure IB. Figure IB also provides the sequences at various junctions (A, B, C, D) of the 7878 allele. The nucleotide sequence of (A) the 5’ mouse/human junction is set forth as SEQ ID NO: 18. Figure IB. The nucleotide sequence of (B) the 5’ human/cassette junction is set forth as SEQ ID NO: 19. Figure IB. The nucleotide sequence of (C) the 3’ cassette/human junction is set forth as SEQ ID NO:20. Figure IB. The nucleotide sequence of (D) the 3’ human/mouse junction is set forth as SEQ ID NO:21. [00188] F0 mice were then generated using the VELOCIMOUSE® method. See, e.g., US 7,576,259; US 7,659,442; US 7,294,754; US 2008/007800; and Poueymirou et al. (2007) Nature Biotech. 25(l):91-99, each of which is herein incorporated by reference in its entirety for all purposes.

[00189] Upon removal of the self-deleting neomycin cassette with Cre recombinase, the loxP and cloning sites (77bp) remain inserted in human intron 16. Figure 1C. The resulting humanized mouse, cassette-deleted, ACE2 allele is set forth in SEQ ID NO:22 (referred to as the 7879 allele). Figure 1C. Figure 1C also provides the sequence at (E) cloning and loxp site after recombination, which sequence is set forth as SEQ ID NO:23.

[00190] The modified endogenous mouse ACE2 7879 allele encodes a chimeric human/mouse ACE2 protein under the regulatory control of endogenous mouse ACE2 promoter and other regulatory elements. The amino acid sequence of the encoded chimeric human/mouse ACE2 protein is set forth as SEQ ID NO:24. An exemplary nucleotide sequence, e.g., CDS, that encodes the chimeric human/mouse ACE protein is set forth as SEQ ID NO:25.

[00191] The chimeric human/mouse ACE2 protein comprises a mouse ACE2 signal sequence at amino acids 1-17 of SEQ ID NO:24 (the amino acid sequence of the signal sequence of the chimeric human/mouse ACE2 protein is set forth as SEQ ID NO:26 and may be encoded by nucleotides 1-51 of SEQ ID NO:25), a human ACE2 extracellular domain at amino acids 18-740 of SEQ ID NO:24 (the amino acid sequence of the extracellular domain of the chimeric human/mouse ACE2 protein is set forth as SEQ ID NO:27 and may be encoded by nucleotides 52-2220 of SEQ ID NO: 25), a mouse ACE2 transmembrane domain at amino acids 741-761 of SEQ ID NO:24 (the amino acid sequence of the transmembrane domain of the chimeric human/mouse ACE2 protein is set forth as SEQ ID NO:28 and may be encoded by nucleotides 2221-2283 of SEQ ID NO:25), and a mouse ACE2 cytoplasmic domain at amino acids 762-805 of SEQ ID NO:24 (the amino acid sequence of the cytoplasmic domain of the chimeric human/mouse ACE2 protein is set forth as SEQ ID NO:29 and may be encoded by nucleotides 2284-2418 of SEQ ID NO:25). The mouse portion(s) of the chimeric human/mouse ACE2 protein spans amino acids 1-19 and 741-805 of SEQ ID NO:24 (and may be encoded by nucleotides 1-57 and 2221-2418 of SEQ ID NO:25, respectively). The human portion of the chimeric human/mouse ACE2 protein spans amino acids 20-740 of SEQ ID NO:24 (and may be encoded by nucleotides 58-2220 of SEQ ID NO:25). Example 2: Characterization of Humanized ACE2 Mouse

[00192] Humanized ACE2 mice were compared to ACE2-null mice. ACE2-null mice lack an ACE coding sequence at an endogenous ACE2 allele (see, e.g., Figures 2A and 2B). Expression levels of chimeric human/mouse ACE2 messenger RNA (mRNA)

[00193] The expression levels of the chimeric human/mouse ACE2 mRNA were analyzed by TaqMan qRT-PCR analysis as described herein. Messenger RNA levels in this Example were analyzed by Real-Time Quantitative Polymerase Chain Reaction (RT-qPCR). Total RNA from each sample was extracted and reverse transcribed using primers that amplify across intronic boundaries of mouse and human ACE2 genes. RT-qPCR was performed using probes and primers of readily available kits.

[00194] Specifically, 1cm pieces of small intestine, colon, kidney, and liver were isolated directly into RNALater (Ambion by Life Technologies), and chilled to 4°C. Tissues were homogenized in TRIzol, and chloroform was used for phase separation. The aqueous phase, containing total RNA, was purified using the MagMAX™-96 for Microarrays Total RNA Isolation Kit (Ambion by Life Technologies) according to manufacturer’s specifications. Genomic DNA was removed using RNase-Free DNase Set (Qiagen). mRNA was reverse-transcribed into cDNA using SuperScript® VILO™ Master Mix (Invitrogen by Life Technologies). cDNA was amplified with the SensiFAST Probe Lo-ROX (Meridian) using the 12K Flex System (Thermofisher). A housekeeping control gene (Gapdh) was used to normalize any cDNA input differences. Data were reported as the comparative CT method using delta delta CT.

[00195] Negative control mRNAs were extracted from mice that are deleted in the ACE2 gene (aka ACE2-null, or knockout, or KO). Positive control mRNAs were extracted from wild-type (WT) mice. Human control mRNAs from heart (Cat# 1H30-50), kidney (Cat# 1H50-50), lung (Cat# 1H40-50), small intestine (Cat# 1H24-50), and adult human tracheal epithelial cells (Cat#504-R25a) were purchased from Cell Applications Inc. (San Diego, CA). The sequences of the primers and probes used in the analysis are provided in Table 6 below. Note that exons denoted in Table 6 are numbered counting coding exons only.

Table 6.

[00196] Figure 4 shows that post natal (P4-P7) mice comprising an endogenous ACE2 locus modified to encode a humanized ACE2 protein expresses levels of the humanized ACE2 similar to the levels of wild-type ACE2 expressed by wild-type mice. Panel A and B provide data from the two human-specific assays (hEx3-4 and hExl6-17), which amplify only humanized ACE2 allele and normal human RNA. No amplification was detected in ACE2-KO tissues. Zooming into relative levels shows that humanized ACE2 allele is expressed at lower, but detectable, levels compared with human tissues. The mExl7-18 assay is shown in panel C. This assay amplifies both mouse and humanized ACE2 allele, but does not amplify the human samples. KO mice are negative controls. The mExl 1-12 assay amplifies both human and mouse genes due to identity of the base pairs in this region between these species. Here it fails to amplify ACE2-KO tissues. The levels of humanized ACE2 mRNA in aged mice (P40) is shown in Figure 5. As shown in Figure 5, as mice age, the levels of humanized ACE2 mRNA in animal genetically expressing a humanized endogenous ACE2 locus remained low when compared to the levels of murine ACE2 mRNA in wildtype mice. Notably, the levels of murine ACE2 mRNA in wildtype mice increase in the colon, duodenum, and lungs as the mice age (Figures 4 and 5).

Expression levels of chimeric human/mouse ACE2 protein in the Respiratory and Gastrointestinal tracts

[00197] Pieces of tissues (lung, trachea, duodenum) from the same mouse cohort as qPCR experiment were harvested directly into chilled 4% paraformaldehyde and post-fixed overnight with mild rocking. Tissues were rinsed 3x with PBS, then serially dehydrated in 70%, 85%, 95%, and 100% ethanol, defatted with xylene, and embedded in paraffin.

[00198] Automated staining was performed on a Ventana Ultra autostainer (Roche Diagnostics) using Universal protocol. Four micrometer paraffin sections of different tissues were used.

[00199] Slides were deparaffinized with EZPrep (Roche Diagnostics) at 69 °C for 24 minutes; antigen retrieval was performed with CC1 buffer (Roche Diagnostics) at 100 °C for 56 minutes, followed by blocking with 10% normal horse serum (Vector Labs) for 32 minutes.

Anti-ACE2 antibody recognizing both mouse and human ectodomain (AF933, R&D Systems) or anti-ACE2 antibody recognizing only mouse ectodomain (AF3437 R&D Systems) were added at concentration of 0.5 pg/ml followed by 5-hour incubation at room temperature. Secondary horse anti-goat IgG antibodies (Vector Labs) at 1 :200 dilution were incubated for 1 hour at room temperature, followed by detection with DABMap kit (Roche diagnostics) according to the preset protocol parameters. Slides were counterstained with Hematoxylin (Roche Diagnostics) for 32 minutes followed by incubation with Bluing Reagent (Roche Diagnostics) for 8 minutes, washed with soap and tap water, dehydrated in increasing concentrations of ethyl alcohol and xylenes and mounted with Cytoseal-60 (Thermo Scientific). All images were obtained at 20x magnification.

[00200] Figures 6 and 7 are representative images that show that a mouse comprising an endogenous ACE2 locus modified to encode a ACE2 protein expresses the ACE2 protein on respiratory and gastrointestinal epithelial cells. Provided in Table 7 is a summary of ACE2 expression as observed by immunohistochemistry in various organs of an endogenous ACE2 locus modified to encode a humanized ACE2 protein (hACE2), and newborn (P7) ACE2 knockout (KO) mice.

Table 7

[00201] The expression of hACE2 in hACE2 mice was similar to ACE2 expression in WT mice and humans with few exceptions. Staining of sinusoidal staining and muscular junctions in the heart observed in WT mice was absent in hACE2 mice. Staining in the nasal mucosa, oral mucosa and tongue was present in hACE2 mice but was absent in WT mice. These exceptions were likely due to human ACE2 gene expression in hACE2 mice.

Example 3: Humanized ACE2 Mice as Models of SARS-CoV-2 Infection and Use in Assessing Anti-SARS-CoV-2 Therapies

[00202] Humanized ACE2 mice allow for the study of not only non-mouse adapted SARS-CoV-2 infection, but also the efficacy of antibodies for improving SARS-CoV-2- mediated pathologies.

[00203] Generally, mice expressing humanized ACE2 as described in Example 1 are treated with antibodies 1 day prior to infection with SARS-CoV-2. Mice are then intranasally inoculated with SARS-CoV-2 and monitored daily for weight loss and signs of clinical disease. At 2 days post-infection, mice are euthanized and lungs were harvested for lung pathology by histological analysis and qRT-PCR for viral titers.

SARS-CoV-2 Model

[00204] Humanized ACE2 mice as described in Example 1 were infected with a human SARS-CoV-2 isolate, WAI at doses of 10 2 (e.g., 10E2) PFU, 10 3 (e.g., 10E3) PFU, 10 4 (e.g., 10E4) PFU, or 10 5 (e.g., 10E5)PFU. Mice were monitored and weighed daily for 7 days. No significant weight loss was observed among mice infected with SARS-CoV-2 and control mice exposed to PBS only (Figure 8). [00205] Both plaque assay and qPCR for the presence of the N gene were performed on lung homogenates collected on days 2, 4. 7 and 10 post-infection. As shown in Figure 9, replicating virus could be isolated at day 2 at all inoculums, however, all groups cleared the virus to below the limit of detection by day 4 post-infection. This trend is confirmed by qPCR. At day 2, mice at all inoculums had detectable virus which exhibited a dose dependent pattern. At day 4 post-infection, only those mice in the 10 5 group had appreciable expression of the nucleocapsid gene of SARS-CoV-2, Figure 10.

[00206] H&E stained lung sections were examined for pathological changes and providing with a pathology score. Lung sections showed low levels of inflammatory inflitrate and damage (data not shown), demonstrating that the humanized ACE2 mice as described in Example 1 are feasible for use in antibody testing experiments and use of IxlO 5 PFU of SARS-COV-2 WAI is an appropriate inoculum.

Antibody prophylaxis against SARS-CoV-2 in SARS-CoV-2 Model

[00207] Humanized ACE2 mice were prophylactically treated at 2 days prior to infection via intraperitoneal injection with either one of two single monoclonal antibody specific for SARS-CoV-2 spike protein at 50 mg/kg, 5 mg/kg or 0.5 mg/kg or a combination of both antibodies, each at 25 mg/kg, 2.5 mg/kg, 0.25 mg/kg or 0.25 mg/kg diluted in PBS. On day 0, mice were anesthetized and intranasally infected with IxlO 5 PFU of WAI SARS-CoV-2. [00208] Figure 11 shows that 50 pg of either a single anti-spike protein antibody or 25 pg each of a combination of two anti-spike antibodies were able to reduce weight loss of mice infected with SARS-CoV-2 over the 2 days of observation.

[00209] On day 2 post-infection, mice were sacrificed, and lungs were harvested for analysis of lung pathology and viral titer. Both by plaque assay and by qPCR for the presence of the N gene, a reduction in viral titer was observed in a dose dependent fashion for both the single monoclonal antibodies as well as the combination. (Figure 12 and Figure 13). Lung sections of all infected control animals that received placebo showed SARS-CoV-2 induced inflammation characterized by minimal to mild infiltration of lymphocytes, macrophages, and neutrophils, most commonly in the peribronchiolar and perivascular areas, and less commonly in the alveolar septa and vascular wall. Necrosis was most prominently observed in the bronchiolar epithelium. Inflammation in the vascular walls (vasculitis/endothelitis) was accompanied by endothelial hypertrophy /hyperplasia, endothelial syncytia, and vascular necrosis (Figure 14). Other changes include alveolar and bronchiolar hyperplasia and syncytia, alveolar hemorrhage, and perivascular edema. Histopathological evaluation show that the the monotherapies and combination therapy reduced the lung pathology and total pathology score in a dose-dependent manner; significant reduction was observed only at 50 mg/kg. (Figure 15). This was correlated well with virus load data, which also showed a greatest reduction at 50 mg/kg (Figure 15).

Experimental Materials and Methodologies for Examples 1-3

Viruses and Cells

[00210] SARS-CoV-2 WA-1 was obtained from the CDC following isolation from a patient in Washington State (WA-1 strain - BEI #NR-52281). All virus stocks were stored at -80 °C until ready to use. VeroE6 cells from ATCC (catalog# CRL-1586) (Manassas, VA) were used for growing SAR-CoV-2 virus as well as in plaque assays. VeroE6 cells were grown in DMEM (Invitrogen, Carlsbad, CA) with 10% FBS (Atlanta Biologicals, Lawerenceville, GA), 1% penicillin/streptomycin (Gemini Bioproducts, West Sacramento, CA) and 1% L-glutamine (Gibco).

In vivo Mouse Infections

[00211] All infections were performed in an animal biosafety level 3 facility using appropriate practices including HEPA filtered BCON caging system, HEPA filtered PAPR respirators and Tyvek suiting. Animals were anesthetized using a mixture of xylazine (0.38 mg/mouse) and ketamine (1.3 mg/mouse) in 50 pL total volume by intraperitoneal injection. Mice were inoculated intranasally with 50 pL of either PBS or 1X10 5 PFU of WAI SARS- CoV-2 after which all animals were monitored daily for weight loss. Mice were euthanized at day 2, 4 and 7 post-infection and lung tissue was harvested for further analysis. All animals were housed and used in accordance with appropriate Institutional Animal Care and Use Committee guidelines.

Antibody Treatments

[00212] Mice were given monoclonal antibodies via intraperitoneal injection at 2 days prior to infection with SARS-CoV-2. Mice received 50 mg/kg, 5 mg/kg or 0.5 mg/kg of a single monoclonal antibody or a combination of 2 monoclonal antibodies each at 25 mg/kg, 2.5 mg/kg or 0.25 mg/kg (totalling 50 mg/kg, 25 mg/kg and 0.5 mg/kg combined) diluted in sterile PBS (Quality Biological) to a total volume of lOOuL.

Histology.

[00213] Lung sections were fixed in 4% paraformaldehyde (PF A) in phosphate-buffered saline (PBS) for a minimum of 48 h, after which they were sent to the Histology Core at the University of Maryland, Baltimore, for paraffin embedding and sectioning. Five-micrometer sections were prepared and used for hematoxylin and eosin (H&E) staining by the Histology Core Services. Sections were imaged at 10 X magnification and figures were put together using Adobe Photoshop and Illustrator software.

[00214] Histopathological evaluation was done by a board-certified veterinary pathologist. The following parameters were evaluated: inflammation (perivascular, vascular, peribronchiolar, septa and alveoli), necrosis (vascular, bronchioles, septa and alveoli), syncytia (vascular endothelium, bronchiolar epithelium and alveolar epithelium), hypertrophy/hyperplasia (vascular endothelium, bronchiolar and alveolar epithelium), hemorrhage (bronchioles and alveoli), edema (bronchioles and alveoli), fibrin (alveoli) and hyaline membranes (alveoli). A 0-4 severity scoring scale was used (0- within normal limits, 1-minimal, 2-mild, 3-moderate and 4- severe) to score the above 21 parameters. A total pathology score was calculated for each mouse by adding the individual histopathological feature scores, and a maximum pathology score of 84 was possible for an individual animal. Statistical analysis was performed using one-way analysis of variance followed by the Tukey’s HSD test and a P value of <0.05 was considered significant.

Plaque Assay on Lung Homogenate

[00215] SARS-CoV-2 lung titers were quantified by homogenizing mouse lungs in 1 mL phosphate buffered saline (PBS; Quality Biological Inc.) using 1.0mm glass beads (Sigma Aldrich) and a Beadruptor (Omni International Inc). VeroE6 cells are plated in 6 well plates with IxlO 5 cells per well. SARS-CoV-2 virus titer in plaque forming units was determined by plaque assay. In the plaque assay, 25 pl of the lung homogenate is added to 225ul of PBS and diluted 10 fold across a 6 point dilution curve with 200 pl of diluent added to each well. After 1 hour, a 3ml agar overlay containing DMEM is added to each well. Plates are incubated for 3 days at 37°C (5% CO2) before plaques are counted.

Example 4: Humanization of an Endogenous TMPRSS2 Gene

[00216] This example illustrates exemplary methods of humanizing an endogenous gene encoding TMPRSS2 in a rodent (e.g., a mouse). The methods described in this example can be employed to humanize an endogenous TMPRSS2 gene of a rodent using any human sequence, or combination of human sequences (or sequence fragments) as desired.

[00217] A targeting vector for humanization of an endogenous TMPRSS2 gene was constructed using bacterial artificial chromosome (BAC) clones and VELOCIGENE.RTM. technology (see, e.g., U.S. Pat. No. 6,586,251 and Valenzuela et al. (2003) High-throughput engineering of the mouse genome coupled with high-resolution expression analysis, Nature Biotech. 21(6):652-659; incorporated herein by reference). [00218] Briefly, mouse bacterial artificial chromosome (BAC) clone bMQ-264A15 containing a mouse TMPRSS2 gene was used and modified as follows. A DNA fragment was generated to include a 5’ mouse homology nucleotide sequence, a human TMPRSS2 genomic DNA of about 25,091 bp (containing the last 7 bp of coding exon 3, intron 3, and coding exon 4 through coding exon 13 (including the 3’ UTR which is part of coding exon 13), of a human TMPRSS2 gene), a self-del eting neomycin cassette of about 2,691 bp, and a 3’ mouse homology sequence. This DNA fragment was used to modify BAC clone bMQ-264A15 through homologous recombination in bacterial cells. As a result, an extracellular-encoding mouse TMPRSS2 genomic fragment (of about 25,291 bp) in the BAC clone was replaced with the human TMPRSS2 genomic fragment of about 25,091 bp, followed by a self-deleting neomycin cassette of about 2691 bp. Specifically, the mouse TMPRSS2 genomic fragment that was replaced included the last 7 bp of coding exon 3, intron 3, and coding exon 4 through the stop codon in coding exon 13 of the mouse TMPRSS2 gene (FIGS. 16A-16B). The human TMPRSS2 genomic fragment that was inserted included the last 7 bp of coding exon 3, intron 3, and coding exon 4 through coding exon 13 of a human TMPRSS2 gene (including the 3’ UTR of human TMPRSS2), and a human 3’ genomic sequence of 131 bp downstream of the 3’ UTR of human TMPRSS2 (FIGS. 16A-16B). The resulting modified BAC clone included, from 5’to 3', (i) a 5’mouse homology arm containing about 12 kb of mouse genomic DNA including a mouse TMPRSS2 5’ UTR, mouse TMPRSS2 exon 1 (noncoding), coding exons 1-3 (except the last 7 bp of coding exon 3); (ii) a human TMPRSS2 genomic fragment of about 25,091 bp including the last 7 bp of human coding exon 3, intron 3, human coding exons 4 through 13 (including the 3’ UTR of human TMPRSS2), and a human 3’ genomic sequence; (iii) a self-deleting neomycin cassette of about 2691 bp, followed by (iv) a 3’ mouse homology arm of 45 kb containing the mouse TMPRSS2 3'UTR and the remaining mouse genomic DNA in the original BAC clone. See FIGS. 16A-16B. The junction sequences are also set forth at the bottom of FIG. 16B. The part of the modified BAC clone containing the human TMPRSS2 genomic fragment and the neomycin cassette, as well as the upstream and downstream insertion junctions, is set forth in SEQ ID NO:72. The amino acid sequence of the protein encoded by the humanized TMPRSS2 gene is set forth in SEQ ID NO:74. An alignment of this humanized TMPRSS2 protein ("7010 mutant protein"), a mouse TMPRSS2 protein (SEQ ID NO: 69), and a human TMPRSS2 protein (SEQ ID NO:71), is provided in FIG. 16D.

[00219] The modified BAC clone containing the humanized TMPRSS2 gene, as described above, was used to electroporate mouse embryonic stem (ES) cells to create modified ES cells comprising a humanized TMPRSS2 gene. Positively targeted ES cells containing a humanized TMPRSS2 gene were identified by an assay (Valenzuela et al., supra) that detected the presence of the human TMPRSS2 sequences (e.g., coding exons 4-13 of human TMPRSS2) and confirmed the loss and/or retention of mouse TMPRSS2 sequences (e.g., loss of coding exons 4-13 of mouse TMPRSS2). Table 8 sets forth the primers and probes that were used to confirm humanization of an endogenous TMPRSS2 gene as described above (FIGS. 16A-16B). Once a correctly targeted ES cell clone has been selected, the neomycin selection cassette can be excised by introducing a Cre recombinase, e.g., via electroporation. Alternatively, the neomycin selection cassette can be removed by crossing the progeny generated from the ES clone with a deletor rodent strain that expresses a Cre recombinase. The humanized TMPRSS2 locus after the deletion of the cassette is depicted in FIG. 16C, with the junction sequences shown at the bottom of FIG. 16C.

[00220] Selected ES cell clones (with or without the cassette) were used to implant female mice using the VELOCIMOUSE®. method (see, e.g., U.S. Pat. No. 7,294,754 and Poueymirou et al., F0 generation mice that are essentially fully derived from the donor gene- targeted ES cells allowing immediate phenotypic analyses, 2007, Nature Biotech. 25(1):91- 99) to generate a litter of pups containing a humanized TMPRSS2 allele in the genome. Mice bearing a humanized TMPRSS2 allele can be again confirmed and identified by genotyping of DNA isolated from tail snips using a modification of allele assay (Valenzuela et al., supra) that detects the presence of the human TMPRSS2 gene sequences. Pups are genotyped and cohorts of animals heterozygous for the humanized TMPRSS2 locus are selected for characterization. Animals homozygous for the humanized TMPRSS2 locus are made by crossing heterozygous animals.

Table 8.

Example 5. Humanization of an Endogenous TMPRSS4 Gene

[00221] This example illustrates exemplary methods of humanizing an endogenous gene encoding TMPRSS4 in a rodent (e.g., a mouse). The methods described in this example can be employed to humanize an endogenous TMPRSS4 gene of a rodent using any human sequence, or combination of human sequences (or sequence fragments) as desired.

[00222] A targeting vector for humanization of an endogenous TMPRSS4 gene was constructed using bacterial artificial chromosome (BAC) clones and VELOCIGENE.RTM. technology (see, e.g., U.S. Pat. No. 6,586,251 and Valenzuela et al. (2003), supra).

[00223] Briefly, mouse bacterial artificial chromosome (BAC) clone RP23-71M15 containing a mouse TMPRSS4 gene was used and modified as follows. A DNA fragment was generated to include a 5’ mouse homology nucleotide sequence, a self-del eting neomycin cassette of about 4,996 bp, a human genomic DNA of about 14,963 bp (containing coding exon 4 through the stop codon in coding exon 13 of a human TMPRSS4 gene), and a 3’ mouse homology sequence. This DNA fragment was used to modify BAC clone RP23- 71M15 through homologous recombination in bacterial cells. As a result, an extracellular- encoding mouse genomic fragment (of about 11,074 bp) in the BAC clone was replaced with a self-deleting neomycin cassette of about 4,996 bp, followed by the human genomic DNA of about 14,963 bp. Specifically, the mouse genomic fragment that was deleted and replaced included the 3’ 130 bp of mouse intron 3, coding exon 4 through the stop codon in coding exon 13 of the mouse TMPRSS4 gene (FIGS. 17A-17B). The human genomic fragment that was inserted included a 3’ portion of human TMPRSS4 intron 3 of about 150 bp, and human TMPRSS4 coding exon 4 through the stop codon in coding exon 13 (FIGS. 17A-17B). The resulting modified BAC clone included, from 5’ to 3', a 5’ mouse homology arm containing about 44.8 kb of mouse genomic DNA (including a mouse TMPRSS4 5’ UTR, mouse TMPRSS4 coding exons 1 through 3, mouse TMPRSS4 intron 3 in part (without the 3’ 130 bp), a self-deleting neomycin cassette of about 4996 bp, a 3’ portion of human TMPRSS4 intron 3 of about 150 bp, human TMPRSS4 coding exons 4 through the stop codon in coding exon 13, followed directly by the mouse TMPRSS4 3’ UTR and the remaining mouse genomic DNA in the original BAC clone (a 3’ mouse homology arm of about 118 kb in total). See FIGS. 17A-17B. The junction sequences are also set forth at the bottom of FIG. 17B. The part of the modified BAC clone containing the neomycin cassette and the human TMPRSS4 genomic fragment, as well as the upstream and downstream insertion junctions, is set forth in SEQ ID NO:79. The amino acid sequence of the protein encoded by the humanized TMPRSS4 gene is set forth in SEQ ID NO:81. An alignment of this humanized TMPRSS4 protein ("7224 mutant pro"), a mouse TMPRSS4 protein (SEQ ID NO:76), and a human TMPRSS4 protein (SEQ ID NO:78), is provided in FIG. 17D.

[00224] The modified BAC clone containing the humanized TMPRSS4 gene, as described above, was used to electroporate mouse embryonic stem (ES) cells to create modified ES cells comprising a humanized TMPRSS4 gene. Positively targeted ES cells containing a humanized TMPRSS4 gene were identified by an assay (Valenzuela et al., supra) that detected the presence of the human TMPRSS4 sequences (e.g., coding exons 4-13 of human TMPRSS4) and confirmed the loss and/or retention of mouse TMPRSS4 sequences (e.g., loss of coding exons 4-13 of mouse TMPRSS4). Table 9 sets forth the primers and probes that were used to confirm humanization of an endogenous TMPRSS4 gene as described above (FIGS.17A-17B). Once a correctly targeted ES cell clone has been selected, the neomycin selection cassette can be excised by introducing a Cre recombinase, e.g., via electroporation. Alternatively, the neomycin selection cassette can be removed by crossing the progeny generated from the ES clone with a deletor rodent strain that expresses a Cre recombinase. The humanized TMPRSS4 locus after the deletion of the cassette is depicted in FIG. 17C, with the junction sequences shown at the bottom of FIG. 17C.

[00225] Selected ES cell clones (with or without the cassette) were used to implant female mice using the VELOCIMOUSE.RTM. method (see, e.g., U.S. Pat. No. 7,294,754 and Poueymirou et al. (2007), supra) to generate a litter of pups containing a humanized TMPRSS4 allele in the genome. Mice bearing a humanized TMPRSS4 allele were again confirmed and identified by genotyping of DNA isolated from tail snips using a modification of allele assay (Valenzuela et al., supra) that detected the presence of the human TMPRSS4 gene sequences. Pups were genotyped and cohorts of animals heterozygous for the humanized TMPRSS4 locus were selected for characterization. Animals homozygous for the humanized TMPRSS4 locus were made by crossing heterozygous animals. Table 9

Example 6. Humanization of an Endogenous TMPRSS1 Id Gene

[00226] This example illustrates exemplary methods of humanizing an endogenous gene encoding TMPRSS1 Id in a rodent (e.g., a mouse). The methods described in this example can be employed to humanize an endogenous TMPRSS1 Id gene of a rodent using any human sequence, or combination of human sequences (or sequence fragments) as desired.

[00227] A targeting vector for humanization of an endogenous TMPRSS1 Id gene was constructed using bacterial artificial chromosome (BAC) clones and VELOCIGENE.RTM. technology (see, e.g., U.S. Pat. No. 6,586,251 and Valenzuela et al. (2003), supra).

[00228] Briefly, mouse bacterial artificial chromosome (BAC) clone RP23-95N22 containing a mouse TMPRSS1 Id gene was used and modified as follows. A DNA fragment was generated to include a 5’ mouse homology nucleotide sequence, a human TMPRSS1 ID genomic DNA of about 33,927 bp (containing 444 bp at the 3’ end of intron 2, and coding exon 3 through coding exon 10 (including the 3’ UTR which is part of coding exon 10), of a human TMPRSS1 ID gene), a self-deleting neomycin cassette of about 4,996 bp, and a 3’ mouse homology sequence. This DNA fragment was used to modify BAC clone RP23- 95N22 through homologous recombination in bacterial cells. As a result, an extracellular- encoding mouse TMPRSS1 Id genomic fragment (of about 35,667 bp) in the BAC clone was replaced with the human TMPRSS1 ID genomic fragment of about 33,927 bp, followed by a self-deleting neomycin cassette of about 4,996 bp. Specifically, the mouse TMPRSS1 Id genomic fragment that was replaced included a 3’ portion of intron 2, and coding exon 3 through the stop codon in coding exon 10 of the mouse TMPRSS1 Id gene (FIGS. 18A-18B). The human TMPRSS1 ID genomic fragment that was inserted included 444 bp at the 3’ end of intron 2, and coding exon 3 through coding exon 10 of a human TMPRSS1 ID gene (including the 3’ UTR of human TMPRSS1 ID), and a human 3’ genomic sequence of about 172 bp downstream of the 3’ UTR of human TMPRSS1 ID (FIGS. 18A-18B). The resulting modified BAC clone included, from 5’to 3', (i) a 5’ mouse homology arm containing about 143 kb of mouse genomic DNA including a mouse TMPRSS1 Id 5’ UTR, mouse TMPRSS1 Id coding exons 1-2 and a 5’ portion of intron 2; (ii) a human TMPRSS1 ID genomic fragment including a 3’ portion of intron 2 and coding exons 3 through 10 (including the 3’ UTR) of human TMPRSS1 ID, and a human 3’ genomic sequence; (iii) a self-deleting neomycin cassette of about 4,996 bp, followed by (iv) a 3’ mouse homology arm of 10 kb containing the mouse TMPRSS1 Id 3 TR and the remaining mouse genomic DNA in the original BAC clone. See FIGS. 18A-18B. The junction sequences are also set forth at the bottom of FIG. 18B. The part of the modified BAC clone containing the human TMPRSS1 ID genomic fragment and the neomycin cassette, as well as the upstream and downstream insertion junctions, is set forth in SEQ ID NO:86. The amino acid sequence of the protein encoded by the humanized TMPRSS1 Id gene is set forth in SEQ ID NO:88. An alignment of this humanized TMPRSS1 Id protein ("7226 mutant pro"), a mouse

TMPRSS1 Id protein (SEQ ID NO:83), and a human TMPRSS1 ID protein (SEQ ID NO:85), is provided in FIG. 18D.

[00229] The modified BAC clone containing the humanized TMPRSS1 Id gene, as described above, is used to electroporate mouse embryonic stem (ES) cells to create modified ES cells comprising a humanized TMPRSS1 Id gene. Positively targeted ES cells containing a humanized TMPRSS1 Id gene are identified by an assay (Valenzuela et al., supra) that detects the presence of the human TMPRSS1 ID sequences (e.g., coding exons 3-10 of human TMPRSS1 ID) and confirms the loss and/or retention of mouse TMPRSS1 Id sequences (e.g., loss of coding exons 3-10 of mouse TMPRSS1 Id). Table 10 sets forth the primers and probes that were used to confirm humanization of an endogenous TMPRSS1 Id gene as described above (FIGS. 18A-18B). Once a correctly targeted ES cell clone has been selected, the neomycin selection cassette can be excised by introducing a Cre recombinase, e.g., via electroporation. Alternatively, the neomycin selection cassette can be removed by crossing the progeny generated from the ES clone with a deletor rodent strain that expresses a Cre recombinase. The humanized TMPRSS1 Id locus after the deletion of the cassette is depicted in FIG. 18C, with the junction sequences shown at the bottom of FIG.18C.

[00230] Selected ES cell clones (with or without the cassette) are used to implant female mice using the VELOCIMOUSE.RTM. method (see, e.g., U.S. Pat. No. 7,294,754 and Poueymirou et al. (2007), supra) to generate a litter of pups containing a humanized TMPRSS1 Id allele in the genome. Mice bearing a humanized TMPRSS1 Id allele are again confirmed and identified by genotyping of DNA isolated from tail snips using a modification of allele assay (Valenzuela et al., supra) that detects the presence of the human TMPRSS1 ID gene sequences. Pups are genotyped and cohorts of animals heterozygous for the humanized TMPRSS1 Id locus are selected for characterization. Animals homozygous for the humanized TMPRSS1 Id locus are made by crossing heterozygous animals.

Table 10

Example 6. Characterization of Humanized ACE2 and TMPRSS2 Mouse

[00231] Mice comprising a humanized ACE2 gene at an endogenous ACE2 locus were generated as described above. Briefly, in the ACE2 mouse, a nucleotide sequence comprising part of coding exon 1 through part of coding exon 17 of an endogneous ACE2 gene was replaced with the correspondence sequence of a human ACE2 gene such that the human ACE2 nucleotide sequence was operably inked to endogenous nucleotide sequences encoding endogenous ACE2 transmembrane and cytoplasmic domains.

[00232] Mice comprising a genetically engineered TMPRSS2 gene were generated as described above. Briefly, in the TMPRSS2 mouse, a human TMPRSS2 genomic DNA of about 25,091 bp (containing the last 7 bp of coding exon 3, intron 3, and coding exon 4 through coding exon 13 (including the 3’ UTR which is part of coding exon 13), of a human TMPRSS2 gene replaced the orthologous sequence at an endogenous TMPRSS2 locus.

[00233] Homozygous ACE2 mice described above were bred to homozygous TMPRSS2 mice to produce a mouse heterozygous for the ACE2 allele and the TMPRSS2 allele. Fl heterozygous mice generated from this cross were bred to each other to obtain mice homozygous for each allele. Such mice express chimeric human ACE2 and TMPRSS.

[00234] The presence of the genetically modified alleles in the endogenous ACE2 and TMPRSS2 loci was confirmed by TAQMAN™ screening and karyotyping using specific probes and primers described above.

[00235] Alternatively, to generate mice comprising both ACE2 allele and TMPRSS2 alleles, ES cells harboring an ACE2 modification or ES cells harboring a TMPRSS2 modification are targeted with the TMPRSS2 or ACE2 targeting vector, respectively. Mice are generated from ES cells harboring both modifications by introducing ES cells into an 8 stage mouse embryo by VELOCIMMUNE ® method and screening as described above. Fl heterozygous mice are bred to obtain homozygous mice.

[00236] All mice were housed and bred in specific pathogen-free conditions. Five week old mice comprising a humanized ACE2 locus alone or both a humanized ACE2 locus and a humanized TMPRSS2 locus were compared to ACE2-null mice. ACE2-null mice lack an ACE coding sequence at an endogenous ACE2 allele (see, e.g., Figure 1C).

[00237] Expression levels of human ACE2 and mouse or human TMPRSS2 messenger RNA (mRNA) [00238] The expression levels of human ACE2, mouse TMPRSS2, or human TMPRSS2 mRNA were analyzed by TaqMan qRT-PCR analysis as described herein. Messenger RNA levels in this Example were analyzed by Real-Time Quantitative Polymerase Chain Reaction (RT-qPCR). Total RNA from each sample was extracted and reverse transcribed using primers that amplify human ACE2, mouse TMPRSS2, and human TMPRSS2 genes. RT- qPCR was performed using probes and primers of readily available kits.

[00239] Specifically, 1cm pieces of duodenum, colon, liver, kidney, heart, lung, and trachea were isolated directly into RNALater (Ambion by Life Technologies), and chilled to 4’C. Tissues were homogenized in TRIzol, and chloroform was used for phase separation. The aqueous phase, containing total RNA, was purified using the MagMAX™-96 for Microarrays Total RNA Isolation Kit (Ambion by Life Technologies) according to manufacturer’s specifications. Genomic DNA was removed using RNase-Free DNase Set (Qiagen). mRNA was reverse-transcribed into cDNA using SuperScript® VILO™ Master Mix (Invitrogen by Life Technologies). cDNA was amplified with the SensiFAST Probe Lo- ROX (Meridian) using the 12K Flex System (Thermofisher). A housekeeping control gene (Gapdh) was used to normalize any cDNA input differences and relative levels were plotted against kidney levels in the ACE2/TMPRSS2 mouse. Negative control mRNAs were extracted from mice that are deleted in the ACE2 gene (aka ACE2-null, or knockout, or KO). Positive control mRNAs were extracted from wild-type (WT) mice.

[00240] As shown in Figures 19A and 19B, two human-specific assays (hEx3-4 and hExl6-17) amplified only the humanized Ace 2 allele. No amplification was detected in Ace2-KO tissues. Human-specific TMPRSS2 assay detected variable levels of human mRNA in double ACE2/TMPRSS2 mice (Figure 19C). KO and hACE2 mice are negative controls. The Mouse Tmprss2 assay amplified at variable levels in all mice, since only one allele of the gene is humanized in the ACE2/TMPRSS2 mice tested.

[00241] The sequences of the primers and probes used in the analysis are provided in

Table 11 below.