PSIBLAST 2.11.0+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Stephen F. Altschul, John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: uniprot_sprot.fasta 571,282 sequences; 206,678,396 total letters Results from round 1 Query= sp|Q7TLC7|Y14_SARS Uncharacterized protein 14 OS=Severe acute respiratory syndrome coronavirus OX=694009 GN=ORF14 PE=1 SV=1 Length=70 Score E Sequences producing significant alignments: (Bits) Value sp|Q7TLC7|Y14_SARS Uncharacterized protein 14 OS=Severe acute res... 140 8e-45 sp|Q3I5I5|Y14_BCRP3 Uncharacterized protein 14 OS=Bat coronavirus... 127 1e-39 sp|P0DTD3|ORF9C_SARS2 Putative ORF9c protein OS=Severe acute resp... 100 1e-28 sp|O60449|LY75_HUMAN Lymphocyte antigen 75 OS=Homo sapiens OX=960... 29.6 1.6 sp|Q20849|KNTC1_CAEEL Kinetochore-associated protein rod-1 OS=Cae... 29.6 1.8 sp|Q5KUF0|SYR_GEOKA Arginine--tRNA ligase OS=Geobacillus kaustoph... 29.6 1.9 sp|Q60767|LY75_MOUSE Lymphocyte antigen 75 OS=Mus musculus OX=100... 28.9 3.2 sp|Q920P9|LY75_MESAU Lymphocyte antigen 75 OS=Mesocricetus auratu... 28.9 3.6 sp|P44240|VG47_HAEIN Mu-like prophage FluMu protein gp47 OS=Haemo... 28.5 3.7 sp|B5DG67|WDR12_SALSA Ribosome biogenesis protein wdr12 OS=Salmo ... 28.1 6.6 sp|Q7PZ36|U518_ANOGA FHIP family protein AGAP011705 OS=Anopheles ... 27.7 7.6 sp|Q1B7P4|LYSX_MYCSS Lysylphosphatidylglycerol biosynthesis bifun... 27.7 7.7 sp|A1UHB3|LYSX_MYCSK Lysylphosphatidylglycerol biosynthesis bifun... 27.7 7.7 >sp|Q7TLC7|Y14_SARS Uncharacterized protein 14 OS=Severe acute respiratory syndrome coronavirus OX=694009 GN=ORF14 PE=1 SV=1 Length=70 Score = 140 bits (354), Expect = 8e-45, Method: Compositional matrix adjust. Identities = 70/70 (100%), Positives = 70/70 (100%), Gaps = 0/70 (0%) Query 1 MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAE 60 MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAE Sbjct 1 MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAE 60 Query 61 VVKLPSRYCC 70 VVKLPSRYCC Sbjct 61 VVKLPSRYCC 70 >sp|Q3I5I5|Y14_BCRP3 Uncharacterized protein 14 OS=Bat coronavirus Rp3/2004 OX=349344 GN=ORF14 PE=4 SV=1 Length=70 Score = 127 bits (320), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 64/70 (91%), Positives = 65/70 (93%), Gaps = 0/70 (0%) Query 1 MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAE 60 MLP CYNFLKEQHCQKASTQ+ AEAAVKPLLA HHVVAVIQEIQLLAAVGEIL LEWLAE Sbjct 1 MLPSCYNFLKEQHCQKASTQKGAEAAVKPLLALHHVVAVIQEIQLLAAVGEILQLEWLAE 60 Query 61 VVKLPSRYCC 70 VKLPSRYCC Sbjct 61 AVKLPSRYCC 70 >sp|P0DTD3|ORF9C_SARS2 Putative ORF9c protein OS=Severe acute respiratory syndrome coronavirus 2 OX=2697049 GN=9c PE=5 SV=1 Length=73 Score = 100 bits (248), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 48/57 (84%), Positives = 51/57 (89%), Gaps = 0/57 (0%) Query 1 MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEW 57 ML CYNFLKEQHCQKASTQ+ AEAAVKPLL PHHVVA +QEIQL AAVGE+LLLEW Sbjct 1 MLQSCYNFLKEQHCQKASTQKGAEAAVKPLLVPHHVVATVQEIQLQAAVGELLLLEW 57 >sp|O60449|LY75_HUMAN Lymphocyte antigen 75 OS=Homo sapiens OX=9606 GN=LY75 PE=1 SV=3 Length=1722 Score = 29.6 bits (65), Expect = 1.6, Method: Composition-based stats. Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 0/64 (0%) Query 5 CYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAEVVKL 64 CYNF+ ++ A+TQ E + L H++++ E + + ++L ++A V L Sbjct 1255 CYNFIITKNRHMATTQDEVHTKCQKLNPKSHILSIRDEKENNFVLEQLLYFNYMASWVML 1314 Query 65 PSRY 68 Y Sbjct 1315 GITY 1318 >sp|Q20849|KNTC1_CAEEL Kinetochore-associated protein rod-1 OS=Caenorhabditis elegans OX=6239 GN=rod-1 PE=1 SV=1 Length=2049 Score = 29.6 bits (65), Expect = 1.8, Method: Composition-based stats. Identities = 21/77 (27%), Positives = 33/77 (43%), Gaps = 11/77 (14%) Query 5 CYNFLKEQHCQKASTQREAEAAVKPLLAPHHV---VAVIQEIQLL--AAVGEILLLEWLA 59 C+ L+ T + E VKP +A H+ ++ IQ++ AAV L W Sbjct 714 CHKILQNALANPNMTHAKIEKFVKPFMAERHLDQEQTIVNYIQMMSGAAVTNANLFGWEK 773 Query 60 EVVKL------PSRYCC 70 + V+L +R CC Sbjct 774 QCVQLCASLMDETRRCC 790 >sp|Q5KUF0|SYR_GEOKA Arginine--tRNA ligase OS=Geobacillus kaustophilus (strain HTA426) OX=235909 GN=argS PE=3 SV=1 Length=557 Score = 29.6 bits (65), Expect = 1.9, Method: Composition-based stats. Identities = 20/67 (30%), Positives = 36/67 (54%), Gaps = 4/67 (6%) Query 4 PCYNFLKEQHCQKASTQREAEA---AVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAE 60 P Y +++ H + +S R+AE + LA HH+V +EI+LL +G+ + A Sbjct 433 PVY-YVQYAHARVSSILRQAEEQHISYDGDLALHHLVETEKEIELLKVLGDFPDVVAEAA 491 Query 61 VVKLPSR 67 + ++P R Sbjct 492 LKRMPHR 498 >sp|Q60767|LY75_MOUSE Lymphocyte antigen 75 OS=Mus musculus OX=10090 GN=Ly75 PE=1 SV=2 Length=1723 Score = 28.9 bits (63), Expect = 3.2, Method: Composition-based stats. Identities = 16/64 (25%), Positives = 30/64 (47%), Gaps = 0/64 (0%) Query 5 CYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAEVVKL 64 CYNF+ + K T E ++ + L + H +++ E + V ++L ++A V L Sbjct 1256 CYNFMITNNRHKTVTPEEVQSTCEKLHSKAHSLSIRNEEENTFVVEQLLYFNYIASWVML 1315 Query 65 PSRY 68 Y Sbjct 1316 GITY 1319 >sp|Q920P9|LY75_MESAU Lymphocyte antigen 75 OS=Mesocricetus auratus OX=10036 GN=LY75 PE=2 SV=1 Length=1722 Score = 28.9 bits (63), Expect = 3.6, Method: Composition-based stats. Identities = 15/64 (23%), Positives = 32/64 (50%), Gaps = 0/64 (0%) Query 5 CYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAEVVKL 64 CYNF+ ++ + TQ+E + + L + ++++ E + V ++L ++A V L Sbjct 1255 CYNFMITKNRHRTITQKEVHSLCQKLHSKAQILSIRNEEENNFVVEQLLYFNYIASWVML 1314 Query 65 PSRY 68 Y Sbjct 1315 GVTY 1318 >sp|P44240|VG47_HAEIN Mu-like prophage FluMu protein gp47 OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=HI_1520 PE=3 SV=1 Length=355 Score = 28.5 bits (62), Expect = 3.7, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 19/34 (56%), Gaps = 0/34 (0%) Query 29 PLLAPHHVVAVIQEIQLLAAVGEILLLEWLAEVV 62 P L H+V+ VI I + GE + L+WLA + Sbjct 24 PTLKRHNVIGVINRICAALSAGEHMHLDWLARQI 57 >sp|B5DG67|WDR12_SALSA Ribosome biogenesis protein wdr12 OS=Salmo salar OX=8030 GN=wdr12 PE=2 SV=1 Length=423 Score = 28.1 bits (61), Expect = 6.6, Method: Composition-based stats. Identities = 11/19 (58%), Positives = 14/19 (74%), Gaps = 0/19 (0%) Query 52 ILLLEWLAEVVKLPSRYCC 70 ILL EW +E KL +R+CC Sbjct 167 ILLWEWNSERNKLKARHCC 185 >sp|Q7PZ36|U518_ANOGA FHIP family protein AGAP011705 OS=Anopheles gambiae OX=7165 GN=AGAP011705 PE=3 SV=4 Length=1023 Score = 27.7 bits (60), Expect = 7.6, Method: Composition-based stats. Identities = 17/46 (37%), Positives = 23/46 (50%), Gaps = 6/46 (13%) Query 30 LLAPHHV------VAVIQEIQLLAAVGEILLLEWLAEVVKLPSRYC 69 LL PH + +V E QL A+G +LL EWL E+ + C Sbjct 957 LLHPHSLEHGLTSYSVGSERQLNLAIGAVLLDEWLRELSAVTQEQC 1002 >sp|Q1B7P4|LYSX_MYCSS Lysylphosphatidylglycerol biosynthesis bifunctional protein LysX OS=Mycobacterium sp. (strain MCS) OX=164756 GN=lysX PE=3 SV=1 Length=1112 Score = 27.7 bits (60), Expect = 7.7, Method: Composition-based stats. Identities = 15/45 (33%), Positives = 21/45 (47%), Gaps = 0/45 (0%) Query 6 YNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVG 50 Y L E QK + ++ +V PL PH +A + E L A G Sbjct 985 YEHLVEDRTQKPTFYKDFPTSVSPLTRPHRSIAGVAERWDLVAWG 1029 >sp|A1UHB3|LYSX_MYCSK Lysylphosphatidylglycerol biosynthesis bifunctional protein LysX OS=Mycobacterium sp. (strain KMS) OX=189918 GN=lysX PE=3 SV=1 Length=1112 Score = 27.7 bits (60), Expect = 7.7, Method: Composition-based stats. Identities = 15/45 (33%), Positives = 21/45 (47%), Gaps = 0/45 (0%) Query 6 YNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVG 50 Y L E QK + ++ +V PL PH +A + E L A G Sbjct 985 YEHLVEDRTQKPTFYKDFPTSVSPLTRPHRSIAGVAERWDLVAWG 1029 Lambda K H a alpha 0.324 0.135 0.422 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 5115167456 Results from round 2 Query= sp|Q7TLC7|Y14_SARS Uncharacterized protein 14 OS=Severe acute respiratory syndrome coronavirus OX=694009 GN=ORF14 PE=1 SV=1 Length=70 Score E Sequences producing significant alignments: (Bits) Value Sequences used in model and found again: sp|Q7TLC7|Y14_SARS Uncharacterized protein 14 OS=Severe acute res... 109 2e-32 sp|Q3I5I5|Y14_BCRP3 Uncharacterized protein 14 OS=Bat coronavirus... 108 4e-32 sp|P0DTD3|ORF9C_SARS2 Putative ORF9c protein OS=Severe acute resp... 90.1 1e-24 Sequences not found previously or not previously below threshold: sp|Q920P9|LY75_MESAU Lymphocyte antigen 75 OS=Mesocricetus auratu... 29.2 2.1 sp|Q5KUF0|SYR_GEOKA Arginine--tRNA ligase OS=Geobacillus kaustoph... 28.8 2.8 sp|O60449|LY75_HUMAN Lymphocyte antigen 75 OS=Homo sapiens OX=960... 28.8 3.6 sp|P44240|VG47_HAEIN Mu-like prophage FluMu protein gp47 OS=Haemo... 28.4 4.2 sp|Q60767|LY75_MOUSE Lymphocyte antigen 75 OS=Mus musculus OX=100... 28.4 4.5 >sp|Q7TLC7|Y14_SARS Uncharacterized protein 14 OS=Severe acute respiratory syndrome coronavirus OX=694009 GN=ORF14 PE=1 SV=1 Length=70 Score = 109 bits (273), Expect = 2e-32, Method: Composition-based stats. Identities = 70/70 (100%), Positives = 70/70 (100%), Gaps = 0/70 (0%) Query 1 MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAE 60 MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAE Sbjct 1 MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAE 60 Query 61 VVKLPSRYCC 70 VVKLPSRYCC Sbjct 61 VVKLPSRYCC 70 >sp|Q3I5I5|Y14_BCRP3 Uncharacterized protein 14 OS=Bat coronavirus Rp3/2004 OX=349344 GN=ORF14 PE=4 SV=1 Length=70 Score = 108 bits (271), Expect = 4e-32, Method: Composition-based stats. Identities = 64/70 (91%), Positives = 65/70 (93%), Gaps = 0/70 (0%) Query 1 MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAE 60 MLP CYNFLKEQHCQKASTQ+ AEAAVKPLLA HHVVAVIQEIQLLAAVGEIL LEWLAE Sbjct 1 MLPSCYNFLKEQHCQKASTQKGAEAAVKPLLALHHVVAVIQEIQLLAAVGEILQLEWLAE 60 Query 61 VVKLPSRYCC 70 VKLPSRYCC Sbjct 61 AVKLPSRYCC 70 >sp|P0DTD3|ORF9C_SARS2 Putative ORF9c protein OS=Severe acute respiratory syndrome coronavirus 2 OX=2697049 GN=9c PE=5 SV=1 Length=73 Score = 90.1 bits (222), Expect = 1e-24, Method: Composition-based stats. Identities = 48/57 (84%), Positives = 51/57 (89%), Gaps = 0/57 (0%) Query 1 MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEW 57 ML CYNFLKEQHCQKASTQ+ AEAAVKPLL PHHVVA +QEIQL AAVGE+LLLEW Sbjct 1 MLQSCYNFLKEQHCQKASTQKGAEAAVKPLLVPHHVVATVQEIQLQAAVGELLLLEW 57 >sp|Q920P9|LY75_MESAU Lymphocyte antigen 75 OS=Mesocricetus auratus OX=10036 GN=LY75 PE=2 SV=1 Length=1722 Score = 29.2 bits (64), Expect = 2.1, Method: Composition-based stats. Identities = 15/65 (23%), Positives = 32/65 (49%), Gaps = 0/65 (0%) Query 4 PCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAEVVK 63 CYNF+ ++ + TQ+E + + L + ++++ E + V ++L ++A V Sbjct 1254 SCYNFMITKNRHRTITQKEVHSLCQKLHSKAQILSIRNEEENNFVVEQLLYFNYIASWVM 1313 Query 64 LPSRY 68 L Y Sbjct 1314 LGVTY 1318 >sp|Q5KUF0|SYR_GEOKA Arginine--tRNA ligase OS=Geobacillus kaustophilus (strain HTA426) OX=235909 GN=argS PE=3 SV=1 Length=557 Score = 28.8 bits (63), Expect = 2.8, Method: Composition-based stats. Identities = 18/63 (29%), Positives = 34/63 (54%), Gaps = 3/63 (5%) Query 8 FLKEQHCQKASTQREAEA---AVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAEVVKL 64 +++ H + +S R+AE + LA HH+V +EI+LL +G+ + A + ++ Sbjct 436 YVQYAHARVSSILRQAEEQHISYDGDLALHHLVETEKEIELLKVLGDFPDVVAEAALKRM 495 Query 65 PSR 67 P R Sbjct 496 PHR 498 >sp|O60449|LY75_HUMAN Lymphocyte antigen 75 OS=Homo sapiens OX=9606 GN=LY75 PE=1 SV=3 Length=1722 Score = 28.8 bits (63), Expect = 3.6, Method: Composition-based stats. Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 0/64 (0%) Query 5 CYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAEVVKL 64 CYNF+ ++ A+TQ E + L H++++ E + + ++L ++A V L Sbjct 1255 CYNFIITKNRHMATTQDEVHTKCQKLNPKSHILSIRDEKENNFVLEQLLYFNYMASWVML 1314 Query 65 PSRY 68 Y Sbjct 1315 GITY 1318 >sp|P44240|VG47_HAEIN Mu-like prophage FluMu protein gp47 OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=HI_1520 PE=3 SV=1 Length=355 Score = 28.4 bits (62), Expect = 4.2, Method: Composition-based stats. Identities = 13/34 (38%), Positives = 19/34 (56%), Gaps = 0/34 (0%) Query 29 PLLAPHHVVAVIQEIQLLAAVGEILLLEWLAEVV 62 P L H+V+ VI I + GE + L+WLA + Sbjct 24 PTLKRHNVIGVINRICAALSAGEHMHLDWLARQI 57 >sp|Q60767|LY75_MOUSE Lymphocyte antigen 75 OS=Mus musculus OX=10090 GN=Ly75 PE=1 SV=2 Length=1723 Score = 28.4 bits (62), Expect = 4.5, Method: Composition-based stats. Identities = 16/65 (25%), Positives = 30/65 (46%), Gaps = 0/65 (0%) Query 4 PCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQLLAAVGEILLLEWLAEVVK 63 CYNF+ + K T E ++ + L + H +++ E + V ++L ++A V Sbjct 1255 SCYNFMITNNRHKTVTPEEVQSTCEKLHSKAHSLSIRNEEENTFVVEQLLYFNYIASWVM 1314 Query 64 LPSRY 68 L Y Sbjct 1315 LGITY 1319 Lambda K H a alpha 0.311 0.136 0.450 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0415 0.140 1.90 42.6 43.6 Effective search space used: 5115167456 Search has CONVERGED! Database: uniprot_sprot.fasta Posted date: Apr 3, 2024 12:05 PM Number of letters in database: 206,678,396 Number of sequences in database: 571,282 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40