FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2175, 124 aa 1>>>pF1KE2175 124 - 124 aa - 124 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3650+/-0.000366; mu= 4.7809+/- 0.022 mean_var=137.6763+/-29.214, 0's: 0 Z-trim(117.4): 200 B-trim: 308 in 1/50 Lambda= 0.109306 statistics sampled from 29179 (29434) to 29179 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.737), E-opt: 0.2 (0.345), width: 16 Scan time: 4.910 The best scores are: opt bits E(85289) NP_055419 (OMIM: 616484) tax1-binding protein 3 is ( 124) 804 137.5 5.4e-33 NP_001191627 (OMIM: 616484) tax1-binding protein 3 ( 98) 355 66.6 9.5e-12 NP_001158067 (OMIM: 609730) PDZ domain-containing (1036) 213 45.2 0.00027 XP_016865126 (OMIM: 606944) PREDICTED: protein LAP (1298) 204 43.9 0.00085 NP_001006600 (OMIM: 606944) erbin isoform 7 [Homo (1302) 204 43.9 0.00085 NP_001240630 (OMIM: 606944) erbin isoform 9 [Homo (1367) 198 43.0 0.0017 NP_061165 (OMIM: 606944) erbin isoform 2 [Homo sap (1371) 198 43.0 0.0017 NP_001240626 (OMIM: 606944) erbin isoform 1 [Homo (1412) 198 43.0 0.0017 XP_016865125 (OMIM: 606944) PREDICTED: protein LAP (1415) 198 43.0 0.0017 NP_001240628 (OMIM: 606944) erbin isoform 8 [Homo (1419) 198 43.0 0.0017 XP_005248612 (OMIM: 606944) PREDICTED: protein LAP (1456) 198 43.0 0.0018 XP_016865124 (OMIM: 606944) PREDICTED: protein LAP (1460) 198 43.0 0.0018 XP_005248611 (OMIM: 606944) PREDICTED: protein LAP (1460) 198 43.0 0.0018 XP_016857375 (OMIM: 614453) PREDICTED: leucine-ric (1594) 195 42.6 0.0026 NP_001317564 (OMIM: 614453) leucine-rich repeat-co (1495) 191 41.9 0.0039 XP_016857379 (OMIM: 614453) PREDICTED: leucine-ric (1528) 191 41.9 0.0039 NP_065845 (OMIM: 614453) leucine-rich repeat-conta (1537) 191 41.9 0.0039 XP_016857376 (OMIM: 614453) PREDICTED: leucine-ric (1547) 191 41.9 0.004 XP_016857374 (OMIM: 614453) PREDICTED: leucine-ric (1594) 191 41.9 0.004 NP_060843 (OMIM: 609411) synaptojanin-2-binding pr ( 145) 172 37.9 0.0061 >>NP_055419 (OMIM: 616484) tax1-binding protein 3 isofor (124 aa) initn: 804 init1: 804 opt: 804 Z-score: 713.3 bits: 137.5 E(85289): 5.4e-33 Smith-Waterman score: 804; 100.0% identity (100.0% similar) in 124 aa overlap (1-124:1-124) 10 20 30 40 50 60 pF1KE2 MSYIPGQPVTAVVQRVEIHKLRQGENLILGFSIGGGIDQDPSQNPFSEDKTDKGIYVTRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 MSYIPGQPVTAVVQRVEIHKLRQGENLILGFSIGGGIDQDPSQNPFSEDKTDKGIYVTRV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 SEGGPAEIAGLQIGDKIMQVNGWDMTMVTHDQARKRLTKRSEEVVRLLVTRQSLQKAVQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_055 SEGGPAEIAGLQIGDKIMQVNGWDMTMVTHDQARKRLTKRSEEVVRLLVTRQSLQKAVQQ 70 80 90 100 110 120 pF1KE2 SMLS :::: NP_055 SMLS >>NP_001191627 (OMIM: 616484) tax1-binding protein 3 iso (98 aa) initn: 355 init1: 355 opt: 355 Z-score: 331.9 bits: 66.6 E(85289): 9.5e-12 Smith-Waterman score: 575; 79.0% identity (79.0% similar) in 124 aa overlap (1-124:1-98) 10 20 30 40 50 60 pF1KE2 MSYIPGQPVTAVVQRVEIHKLRQGENLILGFSIGGGIDQDPSQNPFSEDKTDKGIYVTRV ::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MSYIPGQPVTAVVQRVEIHKLRQGENLILGFSIGGGIDQDPSQNPFSEDKTDK------- 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 SEGGPAEIAGLQIGDKIMQVNGWDMTMVTHDQARKRLTKRSEEVVRLLVTRQSLQKAVQQ ::::::::::::::::::::::::::::::::::::::::: NP_001 -------------------VNGWDMTMVTHDQARKRLTKRSEEVVRLLVTRQSLQKAVQQ 60 70 80 90 pF1KE2 SMLS :::: NP_001 SMLS >>NP_001158067 (OMIM: 609730) PDZ domain-containing RING (1036 aa) initn: 203 init1: 98 opt: 213 Z-score: 198.1 bits: 45.2 E(85289): 0.00027 Smith-Waterman score: 215; 38.7% identity (64.9% similar) in 111 aa overlap (7-116:221-318) 10 20 30 pF1KE2 MSYIPGQPVTAVVQRVEIHKLRQGENLILGFSIGGG .: : :..: :: :::.: :: NP_001 RYQEKFTQYMAHVRNFVGDLGGGHRRDGEHKPFTIVLER---------ENDTLGFNIIGG 200 210 220 230 240 40 50 60 70 80 90 pF1KE2 IDQDPSQNPFSEDKTDKGIYVTRVSEGGPAEIA-GLQIGDKIMQVNGWDMTMVTHDQARK :.:: .: . .::::... :.:::. : ::.: ::::.::: :.. .::..: . NP_001 ---RPNQNN-QEGTSTEGIYVSKILENGPADRADGLEIHDKIMEVNGKDLSKATHEEAVE 250 260 270 280 290 100 110 120 pF1KE2 RLTKRSEEVVRLLVTRQSLQKAVQQSMLS . . .: .: .. : :.. NP_001 AFRNAKEPIVVQVLRRTPLSRPAYGMASEVQLMNASTQTDITFEHIMALAKLRPPTPPVP 300 310 320 330 340 350 >>XP_016865126 (OMIM: 606944) PREDICTED: protein LAP2 is (1298 aa) initn: 171 init1: 83 opt: 204 Z-score: 189.2 bits: 43.9 E(85289): 0.00085 Smith-Waterman score: 204; 37.9% identity (63.8% similar) in 116 aa overlap (2-112:1186-1295) 10 20 pF1KE2 MSYIPGQPVTAVVQRVEIHKLRQGENLI--- : .: . :....: .::.. . . XP_016 SDFNYSRTSPSKRPNARVGSEHSLLDPPGKSKVPRDWREQVLRHIEAKKLEKIRVRVEKD 1160 1170 1180 1190 1200 1210 30 40 50 60 70 80 pF1KE2 --LGFSIGGGIDQDPSQNPFSEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVNGWDMT :::::.::. ::: : : ::.::::. ::: :: ::::.:.::... XP_016 PELGFSISGGVG--GRGNPFRPD--DDGIFVTRVQPEGPAS-KLLQPGDKIIQANGYSFI 1220 1230 1240 1250 1260 1270 90 100 110 120 pF1KE2 MVTHDQARKRLTKRSEEVVRLLVTRQSLQKAVQQSMLS . : :: . : : ...:.:...:. XP_016 NIEHGQAVS-LLKTFQNTVELIIVREVSS 1280 1290 >>NP_001006600 (OMIM: 606944) erbin isoform 7 [Homo sapi (1302 aa) initn: 171 init1: 83 opt: 204 Z-score: 189.2 bits: 43.9 E(85289): 0.00085 Smith-Waterman score: 204; 37.9% identity (63.8% similar) in 116 aa overlap (2-112:1190-1299) 10 20 pF1KE2 MSYIPGQPVTAVVQRVEIHKLRQGENLI--- : .: . :....: .::.. . . NP_001 SDFNYSRTSPSKRPNARVGSEHSLLDPPGKSKVPRDWREQVLRHIEAKKLEKIRVRVEKD 1160 1170 1180 1190 1200 1210 30 40 50 60 70 80 pF1KE2 --LGFSIGGGIDQDPSQNPFSEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVNGWDMT :::::.::. ::: : : ::.::::. ::: :: ::::.:.::... NP_001 PELGFSISGGVG--GRGNPFRPD--DDGIFVTRVQPEGPAS-KLLQPGDKIIQANGYSFI 1220 1230 1240 1250 1260 1270 90 100 110 120 pF1KE2 MVTHDQARKRLTKRSEEVVRLLVTRQSLQKAVQQSMLS . : :: . : : ...:.:...:. NP_001 NIEHGQAVS-LLKTFQNTVELIIVREVSS 1280 1290 1300 >>NP_001240630 (OMIM: 606944) erbin isoform 9 [Homo sapi (1367 aa) initn: 171 init1: 83 opt: 198 Z-score: 183.8 bits: 43.0 E(85289): 0.0017 Smith-Waterman score: 198; 38.9% identity (63.9% similar) in 108 aa overlap (5-112:1266-1364) 10 20 30 pF1KE2 MSYIPGQPVTAVVQRVEIHKLRQGENLILGFSIG : : . :..... .. : :::::. NP_001 VARHPSREQLIDYLMLKVAHQPPYTQPHCSPRQGHELAKQEIRVRVEKDPE---LGFSIS 1240 1250 1260 1270 1280 1290 40 50 60 70 80 90 pF1KE2 GGIDQDPSQNPFSEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVNGWDMTMVTHDQAR ::. ::: : : ::.::::. ::: :: ::::.:.::... . : :: NP_001 GGVG--GRGNPFRPD--DDGIFVTRVQPEGPASKL-LQPGDKIIQANGYSFINIEHGQAV 1300 1310 1320 1330 1340 100 110 120 pF1KE2 KRLTKRSEEVVRLLVTRQSLQKAVQQSMLS . : : ...:.:...:. NP_001 S-LLKTFQNTVELIIVREVSS 1350 1360 >>NP_061165 (OMIM: 606944) erbin isoform 2 [Homo sapiens (1371 aa) initn: 171 init1: 83 opt: 198 Z-score: 183.8 bits: 43.0 E(85289): 0.0017 Smith-Waterman score: 198; 38.9% identity (63.9% similar) in 108 aa overlap (5-112:1270-1368) 10 20 30 pF1KE2 MSYIPGQPVTAVVQRVEIHKLRQGENLILGFSIG : : . :..... .. : :::::. NP_061 VARHPSREQLIDYLMLKVAHQPPYTQPHCSPRQGHELAKQEIRVRVEKDPE---LGFSIS 1240 1250 1260 1270 1280 1290 40 50 60 70 80 90 pF1KE2 GGIDQDPSQNPFSEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVNGWDMTMVTHDQAR ::. ::: : : ::.::::. ::: :: ::::.:.::... . : :: NP_061 GGVG--GRGNPFRPD--DDGIFVTRVQPEGPASKL-LQPGDKIIQANGYSFINIEHGQAV 1300 1310 1320 1330 1340 1350 100 110 120 pF1KE2 KRLTKRSEEVVRLLVTRQSLQKAVQQSMLS . : : ...:.:...:. NP_061 S-LLKTFQNTVELIIVREVSS 1360 1370 >>NP_001240626 (OMIM: 606944) erbin isoform 1 [Homo sapi (1412 aa) initn: 171 init1: 83 opt: 198 Z-score: 183.6 bits: 43.0 E(85289): 0.0017 Smith-Waterman score: 198; 38.9% identity (63.9% similar) in 108 aa overlap (5-112:1311-1409) 10 20 30 pF1KE2 MSYIPGQPVTAVVQRVEIHKLRQGENLILGFSIG : : . :..... .. : :::::. NP_001 VARHPSREQLIDYLMLKVAHQPPYTQPHCSPRQGHELAKQEIRVRVEKDPE---LGFSIS 1290 1300 1310 1320 1330 40 50 60 70 80 90 pF1KE2 GGIDQDPSQNPFSEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVNGWDMTMVTHDQAR ::. ::: : : ::.::::. ::: :: ::::.:.::... . : :: NP_001 GGVG--GRGNPFRPD--DDGIFVTRVQPEGPASKL-LQPGDKIIQANGYSFINIEHGQAV 1340 1350 1360 1370 1380 1390 100 110 120 pF1KE2 KRLTKRSEEVVRLLVTRQSLQKAVQQSMLS . : : ...:.:...:. NP_001 S-LLKTFQNTVELIIVREVSS 1400 1410 >>XP_016865125 (OMIM: 606944) PREDICTED: protein LAP2 is (1415 aa) initn: 171 init1: 83 opt: 198 Z-score: 183.6 bits: 43.0 E(85289): 0.0017 Smith-Waterman score: 198; 38.9% identity (63.9% similar) in 108 aa overlap (5-112:1314-1412) 10 20 30 pF1KE2 MSYIPGQPVTAVVQRVEIHKLRQGENLILGFSIG : : . :..... .. : :::::. XP_016 VARHPSREQLIDYLMLKVAHQPPYTQPHCSPRQGHELAKQEIRVRVEKDPE---LGFSIS 1290 1300 1310 1320 1330 1340 40 50 60 70 80 90 pF1KE2 GGIDQDPSQNPFSEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVNGWDMTMVTHDQAR ::. ::: : : ::.::::. ::: :: ::::.:.::... . : :: XP_016 GGVG--GRGNPFRPD--DDGIFVTRVQPEGPASKL-LQPGDKIIQANGYSFINIEHGQAV 1350 1360 1370 1380 1390 100 110 120 pF1KE2 KRLTKRSEEVVRLLVTRQSLQKAVQQSMLS . : : ...:.:...:. XP_016 S-LLKTFQNTVELIIVREVSS 1400 1410 >>NP_001240628 (OMIM: 606944) erbin isoform 8 [Homo sapi (1419 aa) initn: 171 init1: 83 opt: 198 Z-score: 183.6 bits: 43.0 E(85289): 0.0017 Smith-Waterman score: 198; 38.9% identity (63.9% similar) in 108 aa overlap (5-112:1318-1416) 10 20 30 pF1KE2 MSYIPGQPVTAVVQRVEIHKLRQGENLILGFSIG : : . :..... .. : :::::. NP_001 VARHPSREQLIDYLMLKVAHQPPYTQPHCSPRQGHELAKQEIRVRVEKDPE---LGFSIS 1290 1300 1310 1320 1330 1340 40 50 60 70 80 90 pF1KE2 GGIDQDPSQNPFSEDKTDKGIYVTRVSEGGPAEIAGLQIGDKIMQVNGWDMTMVTHDQAR ::. ::: : : ::.::::. ::: :: ::::.:.::... . : :: NP_001 GGVG--GRGNPFRPD--DDGIFVTRVQPEGPASKL-LQPGDKIIQANGYSFINIEHGQAV 1350 1360 1370 1380 1390 100 110 120 pF1KE2 KRLTKRSEEVVRLLVTRQSLQKAVQQSMLS . : : ...:.:...:. NP_001 S-LLKTFQNTVELIIVREVSS 1400 1410 124 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 14:00:49 2016 done: Sun Nov 6 14:00:50 2016 Total Scan time: 4.910 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]