GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:10:52 Sequence gi568815597r:150633049_150864763 : 231715 bp : 40.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 39 34 6 1.05 1.03 Term - 1262 207 1056 2 0 1 42 357 0.392 13.53 1.02 Intr - 2791 2544 248 2 2 77 98 117 0.207 7.86 1.01 Init - 7483 7336 148 1 1 93 35 35 0.179 -1.00 1.00 Prom - 13809 13770 40 -3.95 2.46 PlyA - 14038 14033 6 1.05 2.45 Term - 15700 15273 428 2 2 117 42 502 0.756 42.98 2.44 Intr - 17689 17632 58 1 1 61 68 8 0.166 -6.36 2.43 Intr - 25175 24883 293 0 2 99 33 174 0.316 8.93 2.42 Intr - 25662 25498 165 1 0 46 26 117 0.418 0.31 2.41 Intr - 28880 28766 115 2 1 82 99 89 0.994 8.50 2.40 Intr - 30715 30584 132 1 0 121 79 126 0.999 14.92 2.39 Intr - 61802 61608 195 2 0 67 80 206 0.638 16.39 2.38 Intr - 64727 64617 111 2 0 50 92 42 0.500 0.46 2.37 Intr - 70345 70262 84 1 0 94 54 71 0.878 3.40 2.36 Intr - 71146 71070 77 2 2 114 75 34 0.963 3.02 2.35 Intr - 71295 71229 67 0 1 38 116 67 0.963 2.06 2.34 Intr - 73761 73505 257 1 2 67 84 172 0.951 10.94 2.33 Intr - 75359 75208 152 1 2 67 74 97 0.438 5.09 2.32 Intr - 114831 114729 103 1 1 78 90 117 0.873 9.21 2.31 Intr - 117123 116958 166 0 1 103 43 170 0.978 12.61 2.30 Intr - 118960 118733 228 1 0 85 74 290 0.994 24.44 2.29 Intr - 122102 121953 150 2 0 87 89 128 0.991 12.24 2.28 Intr - 124932 124810 123 0 0 82 78 57 0.915 3.96 2.27 Intr - 131716 131590 127 0 1 80 71 48 0.441 2.06 2.26 Intr - 166225 166120 106 2 1 52 95 84 0.926 3.85 2.25 Intr - 166661 166496 166 2 1 88 116 165 0.992 17.91 2.24 Intr - 171191 170973 219 2 0 71 94 271 0.985 23.58 2.23 Intr - 172968 172813 156 0 0 107 78 154 0.957 15.59 2.22 Intr - 175277 175221 57 2 0 62 54 79 0.100 0.06 2.21 Intr - 180290 180124 167 0 2 76 110 199 0.996 19.66 2.20 Intr - 181188 181029 160 0 1 89 97 146 0.738 14.24 2.19 Intr - 183358 183211 148 0 1 95 98 150 0.999 16.02 2.18 Intr - 183842 183740 103 0 1 114 99 -15 0.998 0.51 2.17 Intr - 184154 184034 121 2 1 106 79 103 0.999 10.35 2.16 Intr - 184385 184313 73 1 1 26 109 73 0.993 1.69 2.15 Intr - 184982 184872 111 1 0 96 98 97 0.999 10.08 2.14 Intr - 190297 190146 152 1 2 53 47 106 0.813 1.04 2.13 Intr - 193569 193495 75 0 0 91 93 34 0.885 2.99 2.12 Intr - 196179 196045 135 0 0 109 94 114 0.731 13.94 2.11 Intr - 198066 198053 14 1 2 44 110 9 0.416 -7.72 2.10 Intr - 198855 198770 86 2 2 81 110 43 0.677 4.34 2.09 Intr - 199351 199286 66 0 0 92 94 24 0.434 0.50 2.08 Intr - 201592 201490 103 2 1 79 105 50 0.830 4.11 2.07 Intr - 203445 203232 214 0 1 110 94 169 0.807 17.07 2.06 Intr - 206606 206393 214 1 1 101 20 218 0.790 14.10 2.05 Intr - 209420 209376 45 1 0 69 91 87 0.927 3.71 2.04 Intr - 209844 209677 168 2 0 60 89 46 0.502 0.04 2.03 Intr - 213259 213215 45 0 0 118 90 82 0.995 8.01 2.02 Intr - 219758 219714 45 1 0 74 94 42 0.498 0.01 2.01 Intr - 225412 225301 112 2 1 85 96 181 0.678 17.12 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 32761 32723 39 1 0 73 101 24 0.900 2.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:150633049_150864763|GENSCAN_predicted_peptide_1|483_aa MGIYFTWVPKRESAKGDGLSLILIGFGIGGEVESNVLRAGVDLTKYILKGGMRDLRKEGD TETKYRERKVGLGKWRSAYSGPAPAPVSEFPQYLLIIIGRFSERGMWQDNRVIVERRSAG KHVNKRLRIINKEQVRQAFINSGAWQIGLANFVGIIDHHYPKTKIFQFLKLTTWILPKIT RREPLENALTVFTDGSSNGKAAYTGPKEQVIKTQYQLARRAELVAVITVLQDFDQPINIV SDSAYVVQATRDVKTALIKYSMDDQLNQLFNLLQQTVRKRNFPFYITHIRAHTNLPGPLT KANEPADLLVSSAFIKAQKFHALTHVNAAGLKNKFDVTWKQAKDIVRHCTQCQVLHLPTQ EAGVNPRGLCPNALRQMDVTHVPSFGRLLYVHVTVDTYSHFIWATCQTGESTSHVKKHLL PCFAVMRVPEKIKTDNGAGYCSKAFQKFLSQWKISHTTGIPYNSQGQAIVERTNRTLKTQ LVK >gi568815597r:150633049_150864763|GENSCAN_predicted_CDS_1|1452_bp atgggtatttatttcacctgggtgccgaaaagagagtcagcgaagggagatggattatca ttaattcttataggttttgggataggcggtgaagttgagagcaatgttttgcgggcaggg gtggatctcacaaagtacattctcaagggtggaatgagagacttaagaaaagaaggagac acagagacaaagtatagagaaagaaaagtgggcttagggaagtggcgctcagcatacagt ggacccgcaccggcaccagtctctgagttccctcagtatttattgatcattattgggcgt ttctcggagagggggatgtggcaggacaatagggtaatagtggagagaaggtcagcagga aaacatgtgaacaaacgtctccgcatcataaacaaggaacaagttagacaagcctttatc aattctggtgcatggcagattggtcttgctaattttgtgggaattattgatcatcattac ccaaaaacaaaaatcttccagtttttaaaattaactacttggattctacctaaaattacc agacgtgaacctttagaaaatgctctgacagtgtttactgatggttccagcaatggaaaa gcggcttacacagggccaaaagagcaagtaatcaaaactcaatatcaattggctcgaagg gcagagttggttgcagtcattacagtgttacaagattttgatcaacctatcaatattgta tcagattctgcatatgtagtacaggctacaagggatgtaaagacagctctaattaaatat agcatggatgatcagttaaaccagctattcaatttattacaacaaactgtaagaaaaaga aatttcccattttatattactcatattcgagcacacactaatttaccagggcctttgact aaagcaaatgaaccagctgacttactggtatcatctgcattcataaaagcacaaaaattt catgctttgactcatgtaaatgcagcaggattaaaaaacaaatttgatgtcacatggaaa caggcaaaagatattgtacgacattgcacccagtgtcaagtcctacacctgcccactcaa gaggcaggagttaatcccaggggtctgtgtcctaatgcattacggcaaatggatgtcacg catgtaccttcatttggaagattattatatgttcatgtaacagttgatacttattcacat ttcatatgggcaacctgccagacaggagaaagtacttcccatgttaaaaaacatttatta ccttgttttgctgtaatgagagttccagaaaaaattaaaactgacaatggggcaggatac tgtagtaaagctttccaaaaattcttaagtcagtggaaaatttcacatacaacaggaatt ccttataattcccaaggacaggccatagttgaaagaactaatagaacactcaaaactcaa ttagttaaataa >gi568815597r:150633049_150864763|GENSCAN_predicted_peptide_2|2030_aa XMTSDVPSLGPAIASGNSGPGIQGGGAIVQRAIKRRPGLDFDDDGEGNSKFLRCDDDQMS NDKERFARLGAVHENGIPEFPWLVLVQTLYIFGPYRDRLAKCFLCLDHVNLLYIGFAFML TFILSDDEQSSADKERLARENHSEIERRRRNKMTAYITELSDMVPTCSALARKPDKLTIL RMAVSHMKSLRGTGNTSTDGSYKPSFLTDQELKHLILEAADGFLFIVSCETGRVVYVSDS VTPVLNQPQSEWFGSTLYDQVHPDDVDKLREQLSTSENALTGRILDLKTGTVKKEGQQSS MRMCMGSRRSFICRMRCGSSSVDPVSVNRLSFVRNRCRNGLGSVKDGEPHFVVVHCTGYI KAWPPAVASPRVTSSPNCTDMSNVCQPTEFISRHNIEGIFTFVDHRCVATVGYQPQELLG KNIVEFCHPEDQQLLRDSFQQVVKLKGQVLSVMFRFRSKNQEWLWMRTSSFTFQNPYSDE IEYIICTNTNVKNSSQEPRPTLSNTIQRPQLGPTANLPLEMGSGQLAPRQQQQQTELDMV PGRDGLASYNHSQVVQPVTTTGPEHSKPLEKSDGLFAQDRDPRFSEIYHNINADQSKGIS SSTVPATQQLFSQGNTFPPTPRPAENFRNSGLAPPVTIVQPSASAGQMLAQISRHSNPTQ GATPTWTPTTRSGFSAQVATQATAKTRTSQFGVGSFQTPSSFSSMSLPGAPTASPGAAAY PSLTNRGSNFAPETGQTAGQFQTRTAEGVGVWPQWQGQQPHHRSSSSEQHVQQPPAQQPG QPEVFQPITGADFRNPDGINLAPLMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAP DSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDG CGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKALK RAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKN RMKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNPNRILPDSV DWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKG CNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLK EAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNS KNQSNESSMLSTDTKKASILLIRKIYILMQNLGPLPNDVCLTMKLFYYDEVTPPDYQPPG FKDGDCEGVIFEGEPMYLNVGEVSTPFHIFKVKVTTERERMENIDSTILSPKQIKTPFQK ILRDKDVEDEQEHYTSDDLDIETKMEEQEKNPASSELEEPSLVCEEDEIMRSKESPDLSI SHSQVEQLVNKTSELDMSESKTRSGKVFQNKMFQQPYSKNIHIHGFVSNKRYRECWYICD GRNVFILFKKQVRMTTLTHRARRTEISKNSEKKMESEEDSNWEKSPDNEDSGDSKDIRLT LMEEVLLLGLKDKEGYTSFWNDCISSGLRGGILIELAMRGRIYLEPPTMRKKRLLDRKVL LKSDSPTGDVLLDETLKHIKATEPTETVQTWIELLTAEHSVAMTRKLRGAIQAKHPLMGP ISNDSIRIIAQHLFLFCNAIFLNKYVHFLWPGGWLCSTGVANDSWGPPQHQSRHGCNRGV LGILPESLPRHLAHDKVSDVLMVSKSAVDSVLEKHKHNLYPKLLFYLFPNFLLPDLSTKP VVLLLSLQLDRFRGRQFFHWGGRRGDGFGETWNPFKLQYQLRNVRERIAKNLVEKGILTT EKQNFLLFDMTTHPVTNTTEKQRLVKKLQDSVLERWVNDPQRMDKRTLALLVLAHSSDVL ENVFSSLTDDKYDVAMNRAKDLVELDPEVEGTKPSATEMIWAVLAAFNKS >gi568815597r:150633049_150864763|GENSCAN_predicted_CDS_2|6093_bp naaatgacatcagatgtaccatcactgggtccagccattgcctctggaaactctggacct ggaattcaaggtggaggagccattgtccagagggctattaagcggcgaccagggctggat tttgatgatgatggagaagggaacagtaaatttttgaggtgtgatgatgatcagatgtct aacgataaggagcggtttgccagattgggagcagtgcatgaaaatggtattcctgaattc ccttggttggttcttgttcagactctgtatatctttggtccctacagagatcgattggca aaatgctttctgtgtttagatcatgttaatttactatatattggctttgcttttatgttg acctttatcttgtcggatgatgagcagagctctgcggataaagagagacttgccagggaa aatcacagtgaaattgaacggcggcgacggaacaagatgacagcctacatcacagaactg tcagatatggtacccacctgtagtgccctggctcgaaaaccagacaagctaaccatctta cgcatggcagtttctcacatgaagtccttgcggggaactggcaacacatccactgatggc tcctataagccgtctttcctcactgatcaggaactgaaacatttgatcttggaggcagca gatggctttctgtttattgtctcatgtgagacaggcagggtggtgtatgtgtctgactcc gtgactcctgttttgaaccagccacagtctgaatggtttggcagcacactctatgatcag gtgcacccagatgatgtggataaacttcgtgagcagctttccacttcagaaaatgccctg acagggcgtatcctggatctaaagactggaacagtgaaaaaggaaggtcagcagtcttcc atgagaatgtgtatgggctcaaggagatcgtttatttgccgaatgaggtgtggcagtagc tctgtggacccagtttctgtgaataggctgagctttgtgaggaacagatgcaggaatgga cttggctctgtaaaggatggggaacctcacttcgtggtggtccactgcacaggctacatc aaggcctggcccccagcagtggcatcacctagggtaactagttctcccaactgtacagac atgagtaatgtttgtcaaccaacagagttcatctcccgacacaacattgagggtatcttc acttttgtggatcaccgctgtgtggctactgttggctaccagccacaggaactcttagga aagaatattgtagaattctgtcatcctgaagaccagcagcttctaagagacagcttccaa caggtagtgaaattaaaaggccaagtgctgtctgtcatgttccggttccggtctaagaac caagaatggctctggatgagaaccagctcctttactttccagaacccttactcagatgaa attgagtacatcatctgtaccaacaccaatgtgaagaactctagccaagaaccacggcct acactctccaacacaatccagaggccacaactaggtcccacagctaatttacccctggag atgggctcaggacagctggcacccaggcagcagcaacagcaaacagaattggacatggta ccaggaagagatggactggccagctacaatcattcccaggtggttcagcctgtgacaacc acaggaccagaacacagcaagccccttgagaagtcagatggtttatttgcccaggataga gatccaagattttcagaaatctatcacaacatcaatgcggatcagagtaaaggcatctcc tccagcactgtccctgccacccaacagctattctcccagggcaacacattccctcctacc ccccggccggcagagaatttcaggaatagtggcctagcccctcctgtaaccattgtccag ccatcagcttctgcaggacagatgttggcccagatttcccgccactccaaccccacccaa ggagcaaccccaacttggacccctactacccgctcaggcttttctgcccaggtggctacc caggctactgctaagactcgtacttcccagtttggtgtgggcagctttcagactccatcc tccttcagctccatgtccctccctggtgccccaactgcatcgcctggtgctgctgcctac cctagtctcaccaatcgtggatctaactttgctcctgagactggacagactgcaggacaa ttccagacacggacagcagagggtgtgggtgtctggccacagtggcagggccagcagcct catcatcgttcaagttctagtgagcaacatgttcaacaaccgccagcacagcaacctggc cagcctgaggtcttccagccgatcactggagctgacttccgcaatcccgatggaataaat ctagcacccctgatgaccagtgaagaggtggttcagaagatgactggactcaaagtaccc ctgtctcattcccgcagtaatgacaccctttatatcccagaatgggaaggtagagcccca gactctgtcgactatcgaaagaaaggatatgttactcctgtcaaaaatcagggtcagtgt ggttcctgttgggcttttagctctgtgggtgccctggagggccaactcaagaagaaaact ggcaaactcttaaatctgagtccccagaacctagtggattgtgtgtctgagaatgatggc tgtggagggggctacatgaccaatgccttccaatatgtgcagaagaaccggggtattgac tctgaagatgcctacccatatgtgggacaggaagagagttgtatgtacaacccaacaggc aaggcagctaaatgcagagggtacagagagatccccgaggggaatgagaaagccctgaag agggcagtggcccgagtgggacctgtctctgtggccattgatgcaagcctgacctccttc cagttttacagcaaaggtgtgtattatgatgaaagctgcaatagcgataatctgaaccat gcagttttggcagtgggatatggaatccagaagggaaacaagcactggataattaaaaac agaatgaaacggctggtttgtgtgctcttggtgtgctcctctgcagtggcacagttgcat aaagatcctaccctggatcaccactggcatctctggaagaaaacctatggcaaacaatac aaggaaaagaatgaagaagcagtacgacgtctcatctgggaaaagaatctaaagtttgtg atgcttcacaacctggagcattcaatgggaatgcactcatacgatctgggcatgaaccac ctgggagacatgaccagtgaagaagtgatgtctttgatgagttccctgagagttcccagc cagtggcagagaaatatcacatataagtcaaaccctaatcggatattgcctgattctgtg gactggagagagaaagggtgtgttactgaagtgaaatatcaaggttcttgtggtgcttgc tgggctttcagtgctgtgggggccctggaagcacagctgaagctgaaaacaggaaagctg gtgtctctcagtgcccagaacctggtggattgctcaactgaaaaatatggaaacaaaggc tgcaatggtggcttcatgacaacggctttccagtacatcattgataacaagggcatcgac tcagacgcttcctatccctacaaagccatggatcagaaatgtcaatatgactcaaaatat cgtgctgccacatgttcaaagtacactgaacttccttatggcagagaagatgtcctgaaa gaagctgtggccaataaaggcccagtgtctgttggtgtagatgcgcgtcatccttctttc ttcctctacagaagtggtgtctactatgaaccatcctgtactcagaatgtgaatcatggt gtacttgtggttggctatggtgatcttaatgggaaagaatactggcttgtgaaaaacagt aaaaaccaaagcaacgaatctagcatgttgtctactgacaccaagaaagcaagcattctc ctcattcgcaagatttatatcctaatgcaaaatctggggcctttacctaatgatgtttgt ttgaccatgaaacttttttactatgatgaagttacacccccagattaccagcctcccggt tttaaggatggtgattgtgaaggagttatatttgaaggggaacctatgtatttaaatgtg ggagaagtctcaacaccttttcacatcttcaaagtaaaagtgaccactgagagagaacga atggaaaatattgactcaactatactatcaccaaaacaaataaaaacaccatttcaaaaa atcctgagggacaaagatgtagaagatgaacaggagcattatacaagtgatgatttggac attgaaactaaaatggaagaacaggaaaaaaaccctgcatcttctgaacttgaagaacca agtttagtttgtgaggaagatgaaattatgaggtctaaagaaagtccagatctttctatt tctcattctcaggttgagcagttagtcaataaaacatctgaacttgatatgtctgaaagc aaaacaagaagtggaaaagtctttcagaataaaatgtttcagcagccatattccaaaaac atacacatccatgggtttgtcagtaataagcgctatagggaatgctggtacatatgcgat ggtagaaatgtattcatcctgtttaagaaacaggtgagaatgaccactttaactcaccgg gcccgtcgcactgaaataagcaagaactctgaaaagaagatggaaagtgaggaagacagt aattgggagaaaagtccagacaatgaagattctggagactctaaggatatccgccttact cttatggaagaagtattgcttctgggactaaaagataaagaggggtacacatctttctgg aatgactgcatatcatcaggcctgcgagggggcatcctgatagagctggccatgcggggt cgaatctatctggaacccccgaccatgcgtaagaagcgactactagacagaaaggtactg ctaaagtcagacagcccaacaggtgatgttttactggatgaaactctgaaacacatcaaa gcaactgaacccacagaaactgtccaaacatggatagagctactcactgctgaacatagt gtggccatgacacgcaaactgagaggtgcaattcaagctaaacatccccttatgggacca atcagtaatgattccataagaatcattgcgcagcacctcttcctgttctgcaatgcaatc ttcctaaacaagtacgttcattttctctggccaggtgggtggctgtgttcaacgggtgtt gctaatgacagttggggtcctcctcagcatcagtctcgacatggctgcaaccggggggtc ctcgggatcctcccggaatctcttcctcggcatctggctcatgataaggtttcagatgtc ttgatggtatccaaatcagctgttgattcggtcctggagaaacacaagcataacctctac cccaagttattattttacctatttcccaactttttgctaccggatctctccaccaaacca gttgttttgcttctgtctttgcagctggacaggtttcgtggaagacagtttttccactgg ggtgggagacggggggatggtttcggtgagacctggaaccccttcaaattacagtaccag ctgagaaatgtacgagagcgcatcgcaaagaacctagtagagaaaggtattctaaccact gagaagcagaatttcctgctatttgacatgactactcatccagtgaccaatacaacagag aaacagcgactagtgaaaaaacttcaagatagtgtactagagcggtgggtaaatgaccct cagcgtatggacaagcgaacactagcactcctggtgctagcccactcctctgatgtgcta gagaatgtcttctcctctctgacagatgacaagtatgatgtggcaatgaatcgagccaag gacttagtagaactggaccctgaagtggaagggacaaagcctagtgccacagaaatgatc tgggctgtgctggcagccttcaataaatcttaa