GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:19:25 Sequence gi568815584f:75422683_75646668 : 223986 bp : 46.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4241 4280 40 -2.96 1.01 Init + 5326 5570 245 0 2 32 76 131 0.483 1.51 1.02 Intr + 5891 5924 34 2 1 131 84 19 0.723 4.03 1.03 Intr + 9436 9584 149 0 2 109 24 183 0.252 13.03 1.04 Intr + 13588 13727 140 1 2 5 103 37 0.052 -2.99 1.05 Intr + 15216 15434 219 1 0 109 59 232 0.337 20.67 1.06 Intr + 18723 18888 166 1 1 52 113 85 0.505 6.42 1.07 Intr + 31342 31458 117 1 0 31 68 118 0.351 3.68 1.08 Intr + 35105 35192 88 2 1 66 60 31 0.109 -1.93 1.09 Intr + 38744 38848 105 1 0 96 88 183 0.944 19.51 1.10 Term + 46608 46793 186 2 0 71 47 424 0.820 34.09 1.11 PlyA + 49996 50001 6 1.05 2.00 Prom + 51128 51167 40 -0.46 2.01 Init + 67020 67136 117 2 0 83 -7 135 0.114 3.70 2.02 Intr + 72439 72558 120 0 0 62 94 16 0.063 0.29 2.03 Intr + 91332 91440 109 2 1 72 15 84 0.186 -0.64 2.04 Intr + 92467 92735 269 2 2 18 94 132 0.259 4.05 2.05 Intr + 102402 102506 105 2 0 114 119 114 0.996 17.51 2.06 Term + 123780 123989 210 2 0 83 49 404 0.898 33.29 2.07 PlyA + 124291 124296 6 1.05 3.06 PlyA - 124305 124300 6 -6.84 3.05 Term - 124620 124338 283 2 1 -9 48 257 0.188 6.90 3.04 Intr - 126221 125929 293 2 2 71 85 50 0.074 -1.07 3.03 Intr - 128156 127948 209 0 2 53 51 92 0.057 0.80 3.02 Intr - 136608 136467 142 0 1 -40 4 337 0.364 12.43 3.01 Init - 139274 139146 129 2 0 78 89 72 0.825 6.45 3.00 Prom - 141502 141463 40 -3.96 4.00 Prom + 143630 143669 40 -2.26 4.01 Init + 149375 149470 96 1 0 65 81 38 0.165 0.32 4.02 Intr + 154992 155118 127 2 1 75 78 92 0.312 7.15 4.03 Intr + 156241 156959 719 2 2 13 96 718 0.328 56.25 4.04 Term + 157176 157193 18 2 0 64 42 8 0.276 -7.88 4.05 PlyA + 158568 158573 6 -1.95 5.04 PlyA - 159540 159535 6 1.05 5.03 Term - 160424 160230 195 2 0 55 33 168 0.747 5.41 5.02 Intr - 162065 161932 134 0 2 107 80 71 0.988 8.56 5.01 Init - 170914 170833 82 1 1 79 92 53 0.879 5.93 5.00 Prom - 171992 171953 40 -7.16 6.02 PlyA - 172212 172207 6 1.05 6.01 Sngl - 173524 173204 321 1 0 87 54 248 0.992 17.29 6.00 Prom - 175764 175725 40 -2.76 7.00 Prom + 177810 177849 40 -5.66 7.01 Init + 183227 183312 86 1 2 83 83 64 0.087 3.71 7.02 Intr + 187303 187378 76 1 1 58 46 66 0.010 -1.08 7.03 Intr + 199397 199538 142 1 1 78 87 134 0.993 12.23 7.04 Intr + 201930 202070 141 1 0 106 91 70 0.958 9.42 7.05 Intr + 210947 211014 68 0 2 125 107 32 0.962 7.42 7.06 Intr + 212228 212331 104 1 2 52 108 87 0.918 6.07 7.07 Intr + 216670 216780 111 1 0 75 98 60 0.897 5.19 7.08 Intr + 218273 218378 106 2 1 78 80 93 0.990 7.72 7.09 Intr + 218500 218611 112 0 1 124 105 114 0.962 16.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100063 63 1 0 83 80 45 0.828 4.35 S.002 Term + 105247 105306 60 0 0 82 44 92 0.967 2.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:75422683_75646668|GENSCAN_predicted_peptide_1|482_aa MQPVPPPATACGRDARRPRRGALAAADAAAAAGLAPRRLPGRDRPGHRAELRGRAAAARS GHGACSRAPAPGAPPPPAPSAREVLSSGGGEGGSWGQPSEEGLLLLAFTLSLGAFPIILL LAFWISSTFGSRVTYVIQIPRERPLLQETESPRTMWPSEYINCPTEKYCPMDIRSSLFCQ IHLSPWEETGWPATPPAMMPGQIPDPSVTTGSLPGLGPLTGLPSSALTVEELKYADIRNL GAMIAPLHFLEVKLGKRPQPVKIPLFKPYLLSIYYVLSTAPGTGNRATDKAVKALHLSAP YIAVTASKSKQINTRYISYVFENAYEKSEFKITSGLLFVTVEMPRHLQTGPEKHGNGQCA SREGRLPALLAYYKQREEATPAEKAPLDEEEERRKRRREKNKVAAARCRNKKKERTEFLQ RESERLELMNAELKTQIEELKQERQQLILMLNRHRPTCIVRTDSVKTPESEGNPLLEQLE KK >gi568815584f:75422683_75646668|GENSCAN_predicted_CDS_1|1449_bp atgcaacccgtcccgccgcccgccacagcctgcgggagggacgctcggcggccgcgacgg ggggcgctggcggcggcggacgctgcagcggcggcggggctggcgccgcggcggctcccg ggccgggacaggcctgggcaccgggcggagctccgcggccgggcggcagcggcgcggagc gggcacggcgcctgcagccgggccccggccccgggggcgccgcctccccccgcaccttct gcacgggaggtgctctcctcgggcggcggggaaggcgggagctggggacagccatctgaa gagggccttttgctgctcgccttcactctcagtcttggggccttccccatcatcttgctt ctcgccttctggatctcctcgacttttggaagccgcgtgacctacgtcattcagattcca agagagaggccactcctgcaggagacagagagcccaagaaccatgtggccatctgagtac attaactgcccaacggagaagtattgtccgatggacatcagatcttccctcttctgccaa atacatctctctccatgggaagagacaggctggcctgccactcctcctgctatgatgcct gggcagatcccggacccttcggtgaccacaggctccctgccagggcttggccccctgacc gggctccccagctcggccctgactgtggaggagctgaaatacgctgacatccgcaacctc ggggccatgattgcacccttgcacttcctggaggtgaaactgggcaagaggccccagccc gtgaaaatccctttgttcaagccatatttgttaagcatctactatgtgctgagcactgca ccaggcactgggaacagagctacagataaagcagtcaaagccttacatctctcggcaccg tatattgcagtgacggcatccaaaagtaagcaaattaatacacgatatataagctatgtt tttgaaaacgcttatgaaaagtcagaattcaaaatcacatctgggctgctgtttgtcaca gtggaaatgccgcgccacctgcagacggggcctgagaaacacggaaatggccagtgtgct tctagggaaggcaggctcccagctttgctggcctactacaaacagagagaagaggccacc ccggcagagaaagctcccctagatgaggaagaggagcgaaggaaaaggcgccgggagaag aacaaagtcgcagcagcccgatgccggaacaagaagaaggagcgcacggagtttctgcag cgggaatccgagcggctggaactcatgaacgcagagctgaagacccagattgaggagctg aagcaggagcggcagcagctcatcctgatgctgaaccgacaccgccccacctgcatcgtc cggaccgacagtgtcaagacccccgagtcagaaggcaacccactgctcgagcagctcgag aagaagtga >gi568815584f:75422683_75646668|GENSCAN_predicted_peptide_2|309_aa MQRMVFLTESEEAVCSVPLQNAKKSWLVLEPSTPEHLKTDMAILMMAEAAINQVHPSCKF MESSKQLHDTYTGNIFIFQVQEEEALQEAALMPTPAAASGWLDVLSAWPYPQVEVEGKQP TVAPKDPDRQRPRAAAEGTGSHFLHFTSSCLACIDDLNWGVNPKRSSASSRRMMRTDSGL VNLRERESVGANEAKGARLLSAPRQDSSDDVRRVQRREKNRIAAQKSRQRQTQKADTLHL ESEDLEKQNAALRKEIKQLTEELKYFTSVLNSHEPLCSVLAASTPSPPEVVYSAHAFHQP HVSSPRFQP >gi568815584f:75422683_75646668|GENSCAN_predicted_CDS_2|930_bp atgcaaaggatggtgtttctcacggagtcagaggaggccgtctgctccgtgcccttacag aatgccaagaagtcatggctggtgctggagccctcaaccccagaacaccttaagactgat atggccatattaatgatggcagaggctgccattaaccaagtgcacccaagctgtaaattc atggaatcctccaaacagctccatgatacatatactggtaatatcttcatttttcaggtg caggaagaggaggccctgcaggaagcagctctgatgcccactccggctgctgccagtggc tggctggatgtgctatcagcatggccatacccacaggttgaagtagagggaaagcaaccc actgtggccccgaaagacccagaccgccagcggccccgggcagccgccgagggcaccggc tcacacttcctgcatttcacaagctcttgcttggcatgcattgatgacttgaactggggg gttaacccaaaaagatcaagtgcgagctcaaggcggatgatgagaacagattcggggctg gttaacctcagagagcgggagtccgttggtgctaatgaggccaagggagcccggctcctt tctgcccccaggcaggactcatctgatgatgtgagaagagttcagaggagggagaaaaat cgtattgccgcccagaagagccgacagaggcagacacagaaggccgacaccctgcacctg gagagcgaagacctggagaaacagaacgcggctctacgcaaggagatcaagcagctcaca gaggaactgaagtacttcacgtcggtgctgaacagccacgagcccctgtgctcggtgctg gccgccagcacgccctcgccccccgaggtggtgtacagcgcccacgcattccaccaacct catgtcagctccccgcgcttccagccctga >gi568815584f:75422683_75646668|GENSCAN_predicted_peptide_3|351_aa MEYYAAIKRNKIMSFAATWMELETITLSELTQKWKNKYCIFSLKEKEEEEEEKGEEEEEK GEEEEEEKGEEEEDEEEEEEKKEKEKKWILGAEASSLDTSSRVWTRPLVCASLDPRDGSC GGECGRNLVFQAGMSAGMNWRLLWYRKNQGRARKRQAAVRRGGKELDRALATKIQVLKEI KKAPPHDTSAQSTWRLCTERGACFATFRWETKGYNARFCAGRNASTQGTSSGLSSQALNY GAQPSVHILGSHGHQLLCLSQDLRNSVKKKEEEEEKGEKRRRRRKEEEEEEKEKEKEKET PREGIESFPSTSINHGKEELLVESLELFEGTRVQKTKKVPTSQNLECQGTG >gi568815584f:75422683_75646668|GENSCAN_predicted_CDS_3|1056_bp atggaatactatgctgccataaaaaggaacaagatcatgtcctttgcagcaacatggatg gagctggagaccattaccctgagcgaactaacacagaaatggaaaaacaaatactgcata ttctcacttaaggagaaagaggaggaggaggaggagaaaggggaggaggaggaggagaaa ggggaggaggaggaggaggagaaaggggaggaggaggaggacgaagaggaggaggaggag aagaaggagaaggagaagaaatggattctgggagctgaagcctcttccctggatacttct tcccgagtgtggaccagaccactggtatgtgcatctctggacccaagggatgggtcctgt gggggtgaatgtggacggaatttggtcttccaggctggcatgtccgccggcatgaactgg aggctgctgtggtacaggaagaaccaggggagggcaagaaagcggcaagctgcagtccgg aggggtgggaaagagctagacagagcgttagctaccaaaatccaggttctgaaggaaatt aagaaagctcctcctcatgacacatctgcccagtccacatggaggctctgcactgaaagg ggagcctgttttgccacgtttcgttgggagaccaaaggttacaatgcaagattctgtgct gggagaaatgccagcactcagggaacgtcttctgggctgtcttcccaggctctgaattac ggggctcagcccagtgtccacatcctaggctcccatgggcaccagcttctctgcctgagc caagatctgagaaattctgtcaagaagaaggaggaggaggaggagaagggggagaagagg aggaggaggagaaaggaggaggaggaggaggagaaggagaaggagaaggagaaggagaca cctagagaaggaattgaaagcttcccaagtacatcaataaaccatggaaaagaggagctg ttagtggagtccctagagctctttgagggtactagagtccaaaaaacaaagaaggtccca acatcgcagaatttagagtgtcaagggacagggtga >gi568815584f:75422683_75646668|GENSCAN_predicted_peptide_4|319_aa MGEGFLEEGTLALHCEARLWGAAGGSTCLEQKMKKLKRREVEFLGLQWHLIVAKLVFELR ENLGSLPGSLAKDRDCRSPHHFSRIPEETVAMVNEGPNQEESDDTPVPESALQADPSVSV HPSVSVHPSVSINPSVSVHPSSSAHPSALAQPSGLAHPSSSGPEDLSVIKVSRRRWAVVL VFSCYSMCNSFQWIQYGSINNIFMHFYGVSAFAIDWLSMCYMLTYIPLLLPVAWLLEKFG LRTIALTGSALNCLGAWVKLGSLKPHLFPVTVVGQLICSVAQVFILGMPSRIASVWFGAN EVSTACSVAVFGNQPKACF >gi568815584f:75422683_75646668|GENSCAN_predicted_CDS_4|960_bp atgggggaaggcttcctggaggaggggacgcttgccctgcattgtgaagcacgcttatgg ggagctgcaggtggctccacctgtctggagcaaaagatgaagaaactgaagcgcagagag gtggagttccttggtctacagtggcacctcattgtagctaagctggtgtttgaactcagg gagaatttgggctccttaccaggatccctcgccaaggacagagactgccggagtcctcac cacttctccaggattccagaggagactgtggcgatggtgaatgaaggtcccaaccaggaa gagagcgatgacacccctgtgccggagtccgcactccaagcggaccccagcgtctcggtc catcccagcgtctcggtccatcccagcgtctccatcaaccccagcgtctctgtccacccc agcagttcggcccaccccagtgccttagcccaacccagtggcttggctcaccccagtagc tcgggccctgaggacctcagcgtgatcaaggtgagcaggcgccgttgggccgtggtcctg gtgtttagctgctactccatgtgcaactcctttcagtggatccagtacggctccatcaat aacatcttcatgcacttctacggtgtcagtgcctttgccattgactggctgtccatgtgc tacatgctgacttacatccctctgctcctgccagtggcttggctgctggagaagttcggc ctgcgcaccattgctctcactggctcggctctcaactgcctgggggcctgggtgaagctg ggcagcctgaagccgcatctctttccggtcaccgtggtgggccagctcatctgctctgtg gcccaggttttcatcctgggcatgccctcccgcatcgcttccgtctggttcggggctaat gaggtttcaacagcctgctccgtggctgtctttggcaatcagccaaaagcatgcttctag >gi568815584f:75422683_75646668|GENSCAN_predicted_peptide_5|136_aa MGEDPPPADLMEADSHSRKTRLVPAEMDERERVFSLAQSHADNCRLHEPDLQEALEQFPK RTPSGTFRQIPQPLFAFTWTDPDAHQPQQLIWAVLPQGFKDSAHYFSQAQTSSSSVTYLG IILNENNVLSLLIVSS >gi568815584f:75422683_75646668|GENSCAN_predicted_CDS_5|411_bp atgggggaggatcctccaccagctgatctcatggaagcagattctcactctcgcaagact agattagttcctgcagaaatggatgaacgggaaagagttttttctctagcccaatctcac gctgataactgccggcttcacgagccagacctccaggaagcattagagcagttccccaag aggacccccagtggaactttcaggcagattccccagcctctcttcgctttcacttggact gaccctgatgcccatcagcctcagcaacttatctgggctgtactgccacaaggcttcaag gacagcgcccattacttcagtcaagcccaaacttcgtcttcatctgttacctatcttggc ataattcttaatgaaaacaatgtgctctccctgctgatcgtgtccagctaa >gi568815584f:75422683_75646668|GENSCAN_predicted_peptide_6|106_aa MNDTVTIHTRKFMINRRLQRKQTVIDILHPGKATVPKTEIWEKLAKMYKTTLDVIFVFAF KTHFGGGKTTGFGMIYDPLDYAKKNEPKHRLARHGLYEKKKTSRKQ >gi568815584f:75422683_75646668|GENSCAN_predicted_CDS_6|321_bp atgaacgacacagtaactatccacactagaaagttcatgatcaaccgacgacttcagagg aaacaaacagtcattgatatccttcaccctgggaaggcaacagtgcctaagacagaaatt tgggaaaaactagccaaaatgtacaagaccacactggatgtcatctttgtatttgcattc aaaactcattttggtggtggcaagacaaccggctttggcatgatttatgatcccctggat tatgcaaagaaaaatgaacccaaacatagacttgcaagacatggcctgtatgagaagaaa aagacctcaagaaagcaatga >gi568815584f:75422683_75646668|GENSCAN_predicted_peptide_7|316_aa MAGSPLAALEVAVCMQVYLLCSSTGKHRRIGGQSVPKHLNGALHIAYFKCENFGLGIAIG FLVPPVLVPNIEDRDELAYHISIMFYIIGGVATLLLILVIIVFKEKPKYPPSRAQSLSYA LTSPDASYLGSIARLFKNLNFVLLVITYGLNAGAFYALSTLLNRMVIWHYPGEEVNAGRI GLTIVIAGMLGAVISGIWLDRSKTYKETTLVVYIMTLVGMVVYTFTLNLGHLWVVFITAG TMGFFMTGYLPLGFEFAVELTYPESEGISSGLLNISAQVFGIIFTISQGQIIDNYGTKPG NIFLCVFLTLGAALTX >gi568815584f:75422683_75646668|GENSCAN_predicted_CDS_7|948_bp atggctgggtctcctttggcggctctagaggtagctgtctgtatgcaagtctatttgctg tgttcatctactggaaaacataggagaattggtgggcaatcagtgccaaagcatttgaat ggcgctttgcacatcgcctacttcaaatgtgaaaactttggacttggaattgcgattggg ttcttggtccctcctgttttggtacccaacattgaagaccgggacgagcttgcctaccac atcagcatcatgttctatataataggaggtgtggccactctcctcctcatccttgtcatc attgtgttcaaggagaaacctaaatatccccccagcagggcccaatccctgagctatgcc ttgacctctcctgatgcctcatacttaggttccatcgcccggctcttcaaaaatctcaac tttgtgctgcttgtcatcacctatggtctgaatgctggtgctttttatgccttgtccact cttctgaatcgcatggtgatctggcactacccgggggaagaagtgaatgctggaagaatt ggcctgacgatcgtcattgcaggaatgcttggggctgtgatctcaggaatctggctggat aggtccaaaacctacaaagagacaaccctggtagtctatatcatgacactggtgggcatg gtggtgtacacgtttaccttgaacctgggacacctgtgggtagtgttcatcactgctggc acaatgggcttctttatgactggctatctcccactgggatttgagtttgctgtggagctc acgtacccagaatcagaaggcatctcctccggcctcctcaacatatctgcacaggtattt gggatcatctttaccatctcccagggccagattattgacaactatggaaccaagcctggg aacatcttcctgtgtgtgttccttactcttggagcagccctcactgnn