FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB4189, 755 aa 1>>>pF1KB4189 755 - 755 aa - 755 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 10.8764+/-0.000517; mu= -4.9429+/- 0.032 mean_var=431.3298+/-85.580, 0's: 0 Z-trim(119.7): 59 B-trim: 38 in 1/55 Lambda= 0.061755 statistics sampled from 33950 (34010) to 33950 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.71), E-opt: 0.2 (0.399), width: 16 Scan time: 12.000 The best scores are: opt bits E(85289) XP_005262461 (OMIM: 300346) PREDICTED: HIV Tat-spe ( 755) 5068 466.8 1.4e-130 NP_001156752 (OMIM: 300346) HIV Tat-specific facto ( 755) 5068 466.8 1.4e-130 NP_055315 (OMIM: 300346) HIV Tat-specific factor 1 ( 755) 5068 466.8 1.4e-130 NP_001030025 (OMIM: 300029,300455,300834,304020,31 (1152) 380 49.4 0.0001 >>XP_005262461 (OMIM: 300346) PREDICTED: HIV Tat-specifi (755 aa) initn: 5068 init1: 5068 opt: 5068 Z-score: 2465.1 bits: 466.8 E(85289): 1.4e-130 Smith-Waterman score: 5068; 100.0% identity (100.0% similar) in 755 aa overlap (1-755:1-755) 10 20 30 40 50 60 pF1KB4 MSGTNLDGNDEFDEQLRMQELYGDGKDGDTQTDAGGEPDSLGQQPTDTPYEWDLDKKAWF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 MSGTNLDGNDEFDEQLRMQELYGDGKDGDTQTDAGGEPDSLGQQPTDTPYEWDLDKKAWF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 PKITEDFIATYQANYGFSNDGASSSTANVEDVHARTAEEPPQEKAPEPTDARKKGEKRKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 PKITEDFIATYQANYGFSNDGASSSTANVEDVHARTAEEPPQEKAPEPTDARKKGEKRKA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 ESGWFHVEEDRNTNVYVSGLPPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQGNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 ESGWFHVEEDRNTNVYVSGLPPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQGNL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 KGDGLCCYLKRESVELALKLLDEDEIRGYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 KGDGLCCYLKRESVELALKLLDEDEIRGYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 SMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 SMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 IRKLLLFDRHPDGVASVSFRDPEEADYCIQTLDGRWFGGRQITAQAWDGTTDYQVEETSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 IRKLLLFDRHPDGVASVSFRDPEEADYCIQTLDGRWFGGRQITAQAWDGTTDYQVEETSR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB4 EREERLRGWEAFLNAPEANRGLRRSDSVSASERAGPSRARHFSEHPSTSKMNAQETATGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 EREERLRGWEAFLNAPEANRGLRRSDSVSASERAGPSRARHFSEHPSTSKMNAQETATGM 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB4 AFEEPIDEKKFEKTEDGGEFEEGASENNAKESSPEKEAEEGCPEKESEEGCPKRGFEGSC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 AFEEPIDEKKFEKTEDGGEFEEGASENNAKESSPEKEAEEGCPEKESEEGCPKRGFEGSC 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB4 SQKESEEGNPVRGSEEDSPKKESKKKTLKNDCEENGLAKESEDDLNKESEEEVGPTKESE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 SQKESEEGNPVRGSEEDSPKKESKKKTLKNDCEENGLAKESEDDLNKESEEEVGPTKESE 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB4 EDDSEKESDEDCSEKQSEDGSEREFEENGLEKDLDEEGSEKELHENVLDKELEENDSENS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 EDDSEKESDEDCSEKQSEDGSEREFEENGLEKDLDEEGSEKELHENVLDKELEENDSENS 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB4 EFEDDGSEKVLDEEGSEREFDEDSDEKEEEEDTYEKVFDDESDEKEDEEYADEKGLEAAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 EFEDDGSEKVLDEEGSEREFDEDSDEKEEEEDTYEKVFDDESDEKEDEEYADEKGLEAAD 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB4 KKAEEGDADEKLFEESDDKEDEDADGKEVEDADEKLFEDDDSNEKLFDEEEDSSEKLFDD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 KKAEEGDADEKLFEESDDKEDEDADGKEVEDADEKLFEDDDSNEKLFDEEEDSSEKLFDD 670 680 690 700 710 720 730 740 750 pF1KB4 SDERGTLGGFGSVEEGPLSTGSSFILSSDDDDDDI ::::::::::::::::::::::::::::::::::: XP_005 SDERGTLGGFGSVEEGPLSTGSSFILSSDDDDDDI 730 740 750 >>NP_001156752 (OMIM: 300346) HIV Tat-specific factor 1 (755 aa) initn: 5068 init1: 5068 opt: 5068 Z-score: 2465.1 bits: 466.8 E(85289): 1.4e-130 Smith-Waterman score: 5068; 100.0% identity (100.0% similar) in 755 aa overlap (1-755:1-755) 10 20 30 40 50 60 pF1KB4 MSGTNLDGNDEFDEQLRMQELYGDGKDGDTQTDAGGEPDSLGQQPTDTPYEWDLDKKAWF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MSGTNLDGNDEFDEQLRMQELYGDGKDGDTQTDAGGEPDSLGQQPTDTPYEWDLDKKAWF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 PKITEDFIATYQANYGFSNDGASSSTANVEDVHARTAEEPPQEKAPEPTDARKKGEKRKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 PKITEDFIATYQANYGFSNDGASSSTANVEDVHARTAEEPPQEKAPEPTDARKKGEKRKA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 ESGWFHVEEDRNTNVYVSGLPPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQGNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 ESGWFHVEEDRNTNVYVSGLPPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQGNL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 KGDGLCCYLKRESVELALKLLDEDEIRGYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KGDGLCCYLKRESVELALKLLDEDEIRGYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 SMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 IRKLLLFDRHPDGVASVSFRDPEEADYCIQTLDGRWFGGRQITAQAWDGTTDYQVEETSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 IRKLLLFDRHPDGVASVSFRDPEEADYCIQTLDGRWFGGRQITAQAWDGTTDYQVEETSR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB4 EREERLRGWEAFLNAPEANRGLRRSDSVSASERAGPSRARHFSEHPSTSKMNAQETATGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 EREERLRGWEAFLNAPEANRGLRRSDSVSASERAGPSRARHFSEHPSTSKMNAQETATGM 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB4 AFEEPIDEKKFEKTEDGGEFEEGASENNAKESSPEKEAEEGCPEKESEEGCPKRGFEGSC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AFEEPIDEKKFEKTEDGGEFEEGASENNAKESSPEKEAEEGCPEKESEEGCPKRGFEGSC 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB4 SQKESEEGNPVRGSEEDSPKKESKKKTLKNDCEENGLAKESEDDLNKESEEEVGPTKESE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SQKESEEGNPVRGSEEDSPKKESKKKTLKNDCEENGLAKESEDDLNKESEEEVGPTKESE 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB4 EDDSEKESDEDCSEKQSEDGSEREFEENGLEKDLDEEGSEKELHENVLDKELEENDSENS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 EDDSEKESDEDCSEKQSEDGSEREFEENGLEKDLDEEGSEKELHENVLDKELEENDSENS 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB4 EFEDDGSEKVLDEEGSEREFDEDSDEKEEEEDTYEKVFDDESDEKEDEEYADEKGLEAAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 EFEDDGSEKVLDEEGSEREFDEDSDEKEEEEDTYEKVFDDESDEKEDEEYADEKGLEAAD 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB4 KKAEEGDADEKLFEESDDKEDEDADGKEVEDADEKLFEDDDSNEKLFDEEEDSSEKLFDD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KKAEEGDADEKLFEESDDKEDEDADGKEVEDADEKLFEDDDSNEKLFDEEEDSSEKLFDD 670 680 690 700 710 720 730 740 750 pF1KB4 SDERGTLGGFGSVEEGPLSTGSSFILSSDDDDDDI ::::::::::::::::::::::::::::::::::: NP_001 SDERGTLGGFGSVEEGPLSTGSSFILSSDDDDDDI 730 740 750 >>NP_055315 (OMIM: 300346) HIV Tat-specific factor 1 [Ho (755 aa) initn: 5068 init1: 5068 opt: 5068 Z-score: 2465.1 bits: 466.8 E(85289): 1.4e-130 Smith-Waterman score: 5068; 100.0% identity (100.0% similar) in 755 aa overlap (1-755:1-755) 10 20 30 40 50 60 pF1KB4 MSGTNLDGNDEFDEQLRMQELYGDGKDGDTQTDAGGEPDSLGQQPTDTPYEWDLDKKAWF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 MSGTNLDGNDEFDEQLRMQELYGDGKDGDTQTDAGGEPDSLGQQPTDTPYEWDLDKKAWF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 PKITEDFIATYQANYGFSNDGASSSTANVEDVHARTAEEPPQEKAPEPTDARKKGEKRKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 PKITEDFIATYQANYGFSNDGASSSTANVEDVHARTAEEPPQEKAPEPTDARKKGEKRKA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 ESGWFHVEEDRNTNVYVSGLPPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQGNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 ESGWFHVEEDRNTNVYVSGLPPDITVDEFIQLMSKFGIIMRDPQTEEFKVKLYKDNQGNL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 KGDGLCCYLKRESVELALKLLDEDEIRGYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 KGDGLCCYLKRESVELALKLLDEDEIRGYKLHVEVAKFQLKGEYDASKKKKKCKDYKKKL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 SMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 SMQQKQLDWRPERRAGPSRMRHERVVIIKNMFHPMDFEDDPLVLNEIREDLRVECSKFGQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 IRKLLLFDRHPDGVASVSFRDPEEADYCIQTLDGRWFGGRQITAQAWDGTTDYQVEETSR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 IRKLLLFDRHPDGVASVSFRDPEEADYCIQTLDGRWFGGRQITAQAWDGTTDYQVEETSR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB4 EREERLRGWEAFLNAPEANRGLRRSDSVSASERAGPSRARHFSEHPSTSKMNAQETATGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 EREERLRGWEAFLNAPEANRGLRRSDSVSASERAGPSRARHFSEHPSTSKMNAQETATGM 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB4 AFEEPIDEKKFEKTEDGGEFEEGASENNAKESSPEKEAEEGCPEKESEEGCPKRGFEGSC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 AFEEPIDEKKFEKTEDGGEFEEGASENNAKESSPEKEAEEGCPEKESEEGCPKRGFEGSC 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB4 SQKESEEGNPVRGSEEDSPKKESKKKTLKNDCEENGLAKESEDDLNKESEEEVGPTKESE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 SQKESEEGNPVRGSEEDSPKKESKKKTLKNDCEENGLAKESEDDLNKESEEEVGPTKESE 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB4 EDDSEKESDEDCSEKQSEDGSEREFEENGLEKDLDEEGSEKELHENVLDKELEENDSENS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 EDDSEKESDEDCSEKQSEDGSEREFEENGLEKDLDEEGSEKELHENVLDKELEENDSENS 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB4 EFEDDGSEKVLDEEGSEREFDEDSDEKEEEEDTYEKVFDDESDEKEDEEYADEKGLEAAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 EFEDDGSEKVLDEEGSEREFDEDSDEKEEEEDTYEKVFDDESDEKEDEEYADEKGLEAAD 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB4 KKAEEGDADEKLFEESDDKEDEDADGKEVEDADEKLFEDDDSNEKLFDEEEDSSEKLFDD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 KKAEEGDADEKLFEESDDKEDEDADGKEVEDADEKLFEDDDSNEKLFDEEEDSSEKLFDD 670 680 690 700 710 720 730 740 750 pF1KB4 SDERGTLGGFGSVEEGPLSTGSSFILSSDDDDDDI ::::::::::::::::::::::::::::::::::: NP_055 SDERGTLGGFGSVEEGPLSTGSSFILSSDDDDDDI 730 740 750 >>NP_001030025 (OMIM: 300029,300455,300834,304020,312610 (1152 aa) initn: 251 init1: 251 opt: 380 Z-score: 205.6 bits: 49.4 E(85289): 0.0001 Smith-Waterman score: 384; 30.5% identity (56.9% similar) in 364 aa overlap (356-705:741-1086) 330 340 350 360 370 380 pF1KB4 DYCIQTLDGRWFGGRQITAQAWDGTTDYQVEETSREREERLRGWEAFLNAPEANRGLRRS :: .::.::. .: . : .: :.. NP_001 EQKEREQGHQKERNQEMEEGGEEEHGEGEEEEGDREEEEEKEGEGKEEGEGEEVEGEREK 720 730 740 750 760 770 390 400 410 420 430 440 pF1KB4 DSVSASERAGPSRARHFSEHPSTSKMNAQETATGMAFEEPIDEKKFEKTE----DGGEFE . .:: :: . . . .: : . :: . . :: : .::: : NP_001 EE---GERKKEERA-------GKEEKGEEEGDQGEGEEEETEGRGEEKEEGGEVEGGEVE 780 790 800 810 820 450 460 470 480 490 500 pF1KB4 EGASENNAKESSPEKEAEEGCPEKESEEGCPKRGFEGSCSQKESEEGNPVRGSEEDSPKK :: .: . .: : : ::: :.: :: ..: ::. ...:.:::. . .:: . NP_001 EGKGEREEEEEEGEGEEEEGEGEEEEGEGEEEEG-EGK-GEEEGEEGEGEEEGEEGEGEG 830 840 850 860 870 510 520 530 540 550 pF1KB4 ESKKKTLKNDCEENGLAKE------SEDDLNKESEEEVGPTKESEE-DDSEKESDEDCSE : .. ... : .: ..: .:.. . :.::: : : :: ...: :..:. .: NP_001 EEEEGEGEGEEEGEGEGEEEEGEGEGEEEGEGEGEEEEGEGKGEEEGEEGEGEGEEEEGE 880 890 900 910 920 930 560 570 580 590 600 610 pF1KB4 KQSEDGSEREFEENGLEKDLDEEGSEKELHENVLDKELEENDSENSEFEDDGSEKVLDEE ..::: . ::.: . .::: . .:. . : :...:. : : .: :. .:: NP_001 GEGEDGEGEGEEEEGEWEGEEEEGEGEGEEEGEGEGEEGEGEGEEEEGEGEGEEEEGEEE 940 950 960 970 980 990 620 630 640 650 660 670 pF1KB4 GSEREFDEDSDE---KEEEEDTYEKVFDDESDEKEDEEYADEKGLEAADKKAEEGDADEK : :. :. : .:::: : . : : : :: :.: : .... .::...:. NP_001 GEEEGEGEEEGEGEGEEEEEGEVEGEVEGEEGEGEGEE---EEGEEEGEEREKEGEGEEN 1000 1010 1020 1030 1040 1050 680 690 700 710 720 730 pF1KB4 LFEESDDKEDEDADGKEVEDADEKLFEDDDSNEKLFDEEEDSSEKLFDDSDERGTLGGFG . . .:.:. .:: : ..:. ..: . : NP_001 ---RRNREEEEEEEGKYQETGEEENERQDGEEYKKVSKIKGSVKYGKHKTYQKKSVTNTQ 1060 1070 1080 1090 1100 1110 740 750 pF1KB4 SVEEGPLSTGSSFILSSDDDDDDI NP_001 GNGKEQRSKMPVQSKRLLKNGPSGSKKFWNNVLPHYLELK 1120 1130 1140 1150 755 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 04:56:05 2016 done: Tue Nov 8 04:56:06 2016 Total Scan time: 12.000 Total Display time: 0.040 Function used was FASTA [36.3.4 Apr, 2011]