FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE6567, 181 aa 1>>>pF1KE6567 181 - 181 aa - 181 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.0820+/-0.000298; mu= 3.0703+/- 0.019 mean_var=231.5074+/-47.352, 0's: 0 Z-trim(125.3): 7 B-trim: 2742 in 1/60 Lambda= 0.084293 statistics sampled from 48703 (48710) to 48703 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.85), E-opt: 0.2 (0.571), width: 16 Scan time: 4.200 The best scores are: opt bits E(85289) NP_001135392 (OMIM: 300879) glycoprotein Xg isofor ( 181) 1333 173.3 1.9e-43 NP_780778 (OMIM: 300879) glycoprotein Xg isoform 1 ( 180) 1316 171.2 8.1e-43 XP_005274644 (OMIM: 300879) PREDICTED: glycoprotei ( 196) 984 130.9 1.2e-30 NP_001135391 (OMIM: 300879) glycoprotein Xg isofor ( 195) 967 128.8 5.1e-30 XP_011543877 (OMIM: 300879) PREDICTED: glycoprotei ( 130) 960 127.8 7e-30 XP_016885276 (OMIM: 300879) PREDICTED: glycoprotei ( 182) 722 99.0 4.6e-21 >>NP_001135392 (OMIM: 300879) glycoprotein Xg isoform 3 (181 aa) initn: 1333 init1: 1333 opt: 1333 Z-score: 900.8 bits: 173.3 E(85289): 1.9e-43 Smith-Waterman score: 1333; 100.0% identity (100.0% similar) in 181 aa overlap (1-181:1-181) 10 20 30 40 50 60 pF1KE6 MESWWGLPCLAFLCFLMHARGQRDFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MESWWGLPCLAFLCFLMHARGQRDFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 SGGNIYPRPKPRPQPQPGNSGNSGGSYFNDVDRDDGRYPPRPRPRPPAGGGGGGYSSYGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SGGNIYPRPKPRPQPQPGNSGNSGGSYFNDVDRDDGRYPPRPRPRPPAGGGGGGYSSYGN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 SDNTHGGDHHSTYGNPEGNMVAKIVSPIVSVVVVTLLGAAASYFKLNNRRNCFRTHEPEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 SDNTHGGDHHSTYGNPEGNMVAKIVSPIVSVVVVTLLGAAASYFKLNNRRNCFRTHEPEN 130 140 150 160 170 180 pF1KE6 V : NP_001 V >>NP_780778 (OMIM: 300879) glycoprotein Xg isoform 1 pre (180 aa) initn: 1028 init1: 705 opt: 1316 Z-score: 889.7 bits: 171.2 E(85289): 8.1e-43 Smith-Waterman score: 1316; 99.4% identity (99.4% similar) in 181 aa overlap (1-181:1-180) 10 20 30 40 50 60 pF1KE6 MESWWGLPCLAFLCFLMHARGQRDFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_780 MESWWGLPCLAFLCFLMHARGQRDFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 SGGNIYPRPKPRPQPQPGNSGNSGGSYFNDVDRDDGRYPPRPRPRPPAGGGGGGYSSYGN ::::::::::::::::::::::::: :::::::::::::::::::::::::::::::::: NP_780 SGGNIYPRPKPRPQPQPGNSGNSGG-YFNDVDRDDGRYPPRPRPRPPAGGGGGGYSSYGN 70 80 90 100 110 130 140 150 160 170 180 pF1KE6 SDNTHGGDHHSTYGNPEGNMVAKIVSPIVSVVVVTLLGAAASYFKLNNRRNCFRTHEPEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_780 SDNTHGGDHHSTYGNPEGNMVAKIVSPIVSVVVVTLLGAAASYFKLNNRRNCFRTHEPEN 120 130 140 150 160 170 pF1KE6 V : NP_780 V 180 >>XP_005274644 (OMIM: 300879) PREDICTED: glycoprotein Xg (196 aa) initn: 968 init1: 968 opt: 984 Z-score: 671.0 bits: 130.9 E(85289): 1.2e-30 Smith-Waterman score: 1293; 92.3% identity (92.3% similar) in 196 aa overlap (1-181:1-196) 10 20 30 40 50 60 pF1KE6 MESWWGLPCLAFLCFLMHARGQRDFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 MESWWGLPCLAFLCFLMHARGQRDFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 SGGNIYPRPKPRPQPQPGNSGNSGGSYFNDVDRDDGRYPPRPRPRPPAGGGGGGYSSYGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_005 SGGNIYPRPKPRPQPQPGNSGNSGGSYFNDVDRDDGRYPPRPRPRPPAGGGGGGYSSYGN 70 80 90 100 110 120 130 140 150 160 pF1KE6 SDNTHG---------------GDHHSTYGNPEGNMVAKIVSPIVSVVVVTLLGAAASYFK :::::: ::::::::::::::::::::::::::::::::::::::: XP_005 SDNTHGRGGYRLNSRYGNTYGGDHHSTYGNPEGNMVAKIVSPIVSVVVVTLLGAAASYFK 130 140 150 160 170 180 170 180 pF1KE6 LNNRRNCFRTHEPENV :::::::::::::::: XP_005 LNNRRNCFRTHEPENV 190 >>NP_001135391 (OMIM: 300879) glycoprotein Xg isoform 2 (195 aa) initn: 792 init1: 684 opt: 967 Z-score: 659.9 bits: 128.8 E(85289): 5.1e-30 Smith-Waterman score: 1276; 91.8% identity (91.8% similar) in 196 aa overlap (1-181:1-195) 10 20 30 40 50 60 pF1KE6 MESWWGLPCLAFLCFLMHARGQRDFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MESWWGLPCLAFLCFLMHARGQRDFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 SGGNIYPRPKPRPQPQPGNSGNSGGSYFNDVDRDDGRYPPRPRPRPPAGGGGGGYSSYGN ::::::::::::::::::::::::: :::::::::::::::::::::::::::::::::: NP_001 SGGNIYPRPKPRPQPQPGNSGNSGG-YFNDVDRDDGRYPPRPRPRPPAGGGGGGYSSYGN 70 80 90 100 110 130 140 150 160 pF1KE6 SDNTHG---------------GDHHSTYGNPEGNMVAKIVSPIVSVVVVTLLGAAASYFK :::::: ::::::::::::::::::::::::::::::::::::::: NP_001 SDNTHGRGGYRLNSRYGNTYGGDHHSTYGNPEGNMVAKIVSPIVSVVVVTLLGAAASYFK 120 130 140 150 160 170 170 180 pF1KE6 LNNRRNCFRTHEPENV :::::::::::::::: NP_001 LNNRRNCFRTHEPENV 180 190 >>XP_011543877 (OMIM: 300879) PREDICTED: glycoprotein Xg (130 aa) initn: 960 init1: 960 opt: 960 Z-score: 657.4 bits: 127.8 E(85289): 7e-30 Smith-Waterman score: 960; 100.0% identity (100.0% similar) in 125 aa overlap (1-125:1-125) 10 20 30 40 50 60 pF1KE6 MESWWGLPCLAFLCFLMHARGQRDFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 MESWWGLPCLAFLCFLMHARGQRDFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE6 SGGNIYPRPKPRPQPQPGNSGNSGGSYFNDVDRDDGRYPPRPRPRPPAGGGGGGYSSYGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_011 SGGNIYPRPKPRPQPQPGNSGNSGGSYFNDVDRDDGRYPPRPRPRPPAGGGGGGYSSYGN 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE6 SDNTHGGDHHSTYGNPEGNMVAKIVSPIVSVVVVTLLGAAASYFKLNNRRNCFRTHEPEN ::::: XP_011 SDNTHEPENV 130 >>XP_016885276 (OMIM: 300879) PREDICTED: glycoprotein Xg (182 aa) initn: 706 init1: 706 opt: 722 Z-score: 499.2 bits: 99.0 E(85289): 4.6e-21 Smith-Waterman score: 1155; 85.2% identity (85.2% similar) in 196 aa overlap (1-181:1-182) 10 20 30 40 50 60 pF1KE6 MESWWGLPCLAFLCFLMHARGQRDFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPD :::::::::::::::::::: :::::::::::::::::::::::::: XP_016 MESWWGLPCLAFLCFLMHAR--------------EPTKKPNSDIYPKPKPPYYPQPENPD 10 20 30 40 70 80 90 100 110 120 pF1KE6 SGGNIYPRPKPRPQPQPGNSGNSGGSYFNDVDRDDGRYPPRPRPRPPAGGGGGGYSSYGN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: XP_016 SGGNIYPRPKPRPQPQPGNSGNSGGSYFNDVDRDDGRYPPRPRPRPPAGGGGGGYSSYGN 50 60 70 80 90 100 130 140 150 160 pF1KE6 SDNTHG---------------GDHHSTYGNPEGNMVAKIVSPIVSVVVVTLLGAAASYFK :::::: ::::::::::::::::::::::::::::::::::::::: XP_016 SDNTHGRGGYRLNSRYGNTYGGDHHSTYGNPEGNMVAKIVSPIVSVVVVTLLGAAASYFK 110 120 130 140 150 160 170 180 pF1KE6 LNNRRNCFRTHEPENV :::::::::::::::: XP_016 LNNRRNCFRTHEPENV 170 180 181 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Tue Nov 8 14:28:32 2016 done: Tue Nov 8 14:28:33 2016 Total Scan time: 4.200 Total Display time: -0.040 Function used was FASTA [36.3.4 Apr, 2011]