GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:13:02 Sequence gi568815597f:65048408_65326174 : 277767 bp : 42.15% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 706 570 137 0 2 40 74 121 0.190 5.19 1.07 Intr - 1315 1126 190 2 1 36 98 64 0.027 -0.08 1.06 Intr - 10184 10057 128 1 2 -3 87 129 0.008 3.10 1.05 Intr - 17884 17714 171 0 0 55 83 102 0.061 4.54 1.04 Intr - 19777 19500 278 1 2 50 38 164 0.065 2.99 1.03 Intr - 24874 24748 127 0 1 76 29 102 0.023 2.86 1.02 Intr - 44240 44033 208 0 1 -11 59 188 0.048 3.21 1.01 Init - 54127 53941 187 1 1 82 53 140 0.184 9.17 1.00 Prom - 54877 54838 40 -7.75 2.00 Prom + 58899 58938 40 -4.55 2.01 Sngl + 62243 62422 180 1 0 99 49 137 0.463 5.65 2.02 PlyA + 66023 66028 6 1.05 3.02 PlyA - 66962 66957 6 1.05 3.01 Sngl - 75540 74572 969 0 0 44 47 275 0.842 15.46 3.00 Prom - 93754 93715 40 -6.05 4.11 PlyA - 93800 93795 6 1.05 4.10 Term - 100153 99845 309 1 0 101 47 250 0.730 16.28 4.09 Intr - 105468 105373 96 0 0 93 72 22 0.010 0.39 4.08 Intr - 106301 106151 151 1 1 28 10 201 0.035 5.54 4.07 Intr - 117637 117562 76 2 1 92 91 70 0.698 5.35 4.06 Intr - 126946 126729 218 0 2 86 -14 212 0.041 8.02 4.05 Intr - 128359 128224 136 2 1 75 42 68 0.373 -0.39 4.04 Intr - 130000 129967 34 1 1 53 60 46 0.141 -4.82 4.03 Intr - 132579 132419 161 1 2 104 41 141 0.364 9.79 4.02 Intr - 145282 145027 256 1 1 90 48 118 0.109 4.09 4.01 Init - 146835 146785 51 0 0 74 94 18 0.378 2.21 4.00 Prom - 148450 148411 40 -3.75 5.00 Prom + 155094 155133 40 -5.35 5.01 Init + 157521 157646 126 2 0 7 66 150 0.565 4.36 5.02 Intr + 161874 161950 77 2 2 56 121 41 0.622 1.59 5.03 Intr + 169473 169495 23 0 2 47 108 47 0.407 -1.13 5.04 Intr + 170347 170519 173 2 2 83 66 145 0.640 10.54 5.05 Intr + 176345 176463 119 1 2 65 89 218 0.454 17.94 5.06 Term + 186138 186222 85 0 1 116 49 55 0.303 0.55 5.07 PlyA + 187087 187092 6 1.05 6.04 PlyA - 187211 187206 6 1.05 6.03 Term - 199674 199559 116 1 2 77 53 140 0.900 7.15 6.02 Intr - 206471 206375 97 2 1 57 -3 221 0.014 8.56 6.01 Init - 208636 208622 15 1 0 81 65 25 0.614 -0.16 6.00 Prom - 209067 209028 40 -6.35 7.02 PlyA - 209272 209267 6 1.05 7.01 Sngl - 210012 209836 177 0 0 69 41 189 0.296 7.00 7.00 Prom - 210101 210062 40 -6.35 8.00 Prom + 212549 212588 40 -4.75 8.01 Init + 217201 217503 303 0 0 65 5 181 0.008 4.82 8.02 Intr + 229304 229540 237 1 0 102 94 91 0.645 7.99 8.03 Intr + 261306 261531 226 2 1 127 74 241 0.730 23.24 8.04 Term + 270911 271185 275 0 2 6 47 196 0.496 1.95 8.05 PlyA + 271219 271224 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 40901 40763 139 1 1 72 38 128 0.902 2.65 S.002 Init - 44208 44033 176 0 2 65 59 157 0.807 9.37 S.003 Init - 101742 101692 51 0 0 85 87 60 0.815 6.81 S.004 Term - 106301 106145 157 1 1 28 47 214 0.819 7.72 S.005 Init - 201843 201828 16 0 1 99 77 -6 0.896 -0.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:65048408_65326174|GENSCAN_predicted_peptide_1|476_aa MKKQENITPPKENNNSPAADPNKKEIHKILILKNLGELQEDSEKEYREIRKTTQDMNEKF TRGPAERRVKSKVMAKHLKFIARTVMVQEGNMEGAYRTLSRILTTDRLIEGIKPQRYCEK PCHRRQRESYESKQGKKVAASVAGFNDPSSLDHKPSQQLGGRLLPTVPEKELARWDFSER SPQRYSRPGGKASSGADRRVKAARGGEQADSSPSFRGAWGTRTGDRHNQRVRAGPTGATG LQGGRMAAACPLPEPPFSQQRRESARTKGEGTWRRRRLLAAADPRASPCLPVRGILSLTL LPARQLPPSWRKTTPVLIPALKESPSLHGEPSERVMQPPERMPLTDTVTDRLPWLSYHSA DAVYSKVNMWVMAWNLSFFASAPQKSTLHNPSVDHSSASQQTCAPRSELEYGWYPKGRTR GQSESSCPPRWLQLPLEGADCGCKPPNSRKRLLTVNEGLPILEIDPVALGVDSGQX >gi568815597f:65048408_65326174|GENSCAN_predicted_CDS_1|1428_bp atgaaaaagcaagaaaatataacaccaccaaaggaaaacaataattctccagcagcagat cccaataaaaaagaaattcacaaaatactgattttaaagaaccttggtgagctacaagag gattctgaaaaagaatacagagaaatcagaaaaacaactcaggatatgaatgagaaattt accagaggtcctgcagagcggagagttaaatccaaggtcatggcaaaacatctgaagttc atcgccaggactgtgatggtacaggaagggaacatggaaggtgcatacaggaccctaagc agaatcctcactacggataggctgattgagggcattaagcctcaacggtactgtgagaag ccatgccaccggcgacagagggaaagctatgaaagcaagcaaggcaaaaaggtagcagca tctgtggcaggtttcaatgatccatccagtttagatcacaagcccagccagcagctggga ggaaggctcttacccacagtcccagagaaagagctggcaaggtgggacttctctgaacgt agcccacaacggtacagccgcccgggagggaaagcgagctctggggcggacaggcgagtg aaggcagccagaggaggggagcaggctgacagctcccccagcttccgcggcgcgtggggg accaggacgggcgaccgccacaatcagagggtgcgagctggacccaccggggccactggt ctccagggtgggaggatggctgctgcctgtccgctgccggagcctcctttctcccagcag aggagggagagcgcgcgcacgaagggggaggggacttggcggagacgccggctcctggcc gcagctgaccccagggcttctccctgccttcctgtgcgaggaattctttccctcactctg ctacccgcacgccagctgccaccttcctggagaaagaccacacctgtgcttatccctgct ttgaaggagagtccttcacttcatggggagccttcagagagagtaatgcagccaccagaa aggatgccgttgaccgacacagtgactgacaggctgccctggctcagttatcacagtgct gatgctgtctattctaaagtgaatatgtgggtgatggcttggaacttaagcttttttgct agtgccccacaaaagagtaccctccacaacccatctgtggaccacagcagtgcctctcaa caaacatgtgccccaaggagtgagttagaatatggatggtatcccaaaggaaggacccgt ggccagtcagaaagctcatgcccaccaaggtggttacagctgcccctggaaggggcagat tgtggatgcaagccccctaacagcagaaaaaggctattgacagtgaatgagggacttcca attctagaaatagatcccgttgccttaggggtggatagtggacaagnn >gi568815597f:65048408_65326174|GENSCAN_predicted_peptide_2|59_aa MAPASAPDEGLRMLPIMVEGEGGAGMSHGERRSKEVVVGEGARPLNNQISCELTNQEFT >gi568815597f:65048408_65326174|GENSCAN_predicted_CDS_2|180_bp atggctccagcatctgctcctgatgaagggctcagaatgcttccaatcatggtggaaggt gaagggggagcaggtatgtcacatggcgagagaagaagcaaggaggtggtggtgggggaa ggtgccagaccgttaaacaaccagatctcatgtgaactaactaaccaagaattcacttag >gi568815597f:65048408_65326174|GENSCAN_predicted_peptide_3|322_aa MSELPFTIASKRIKYLGIKLTRDVKDLFKENYKSLLNEIKEDTNKWKNIPYSWIGRINIM KMAILPKLIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKTILSKKNKAGDITLP DFKLYYKATVTKTAWYWYQNRDIDQWNRTEASEITPHIYNHLIFDKPDKNKKWGKDSLFN KWCWENWLAICRKLKLGPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKD FMTKTPKTMATKAKIDKWDLIKLKSFCTAKETTIRVDRQPTEWEKIFAIYPSDKGLIPRI YKELKQIYKKKSNNPIKKWQRI >gi568815597f:65048408_65326174|GENSCAN_predicted_CDS_3|969_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaataaaactt acaagggatgtgaaggacctcttcaaggagaactacaaatcactactcaatgaaataaaa gaggacacaaacaaatggaagaacattccatactcatggataggaagaatcaatatcatg aaaatggccatactgcccaaactaatttatagattcaatgccatccccattaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagacaatcctaagcaaaaagaacaaagctggagacatcacgctacct gacttcaaactatactacaaggctacagtaacaaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagaggcctcagaaataacaccacacatctacaac catctgatttttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagctatatgcagaaagctgaaactgggtcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaatgttagaccgaaaacc ataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaagac ttcatgactaaaacaccaaaaacaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaaccaccatcagagtggacaggcaacct acagaatgggagaaaatttttgcaatctacccatctgacaaagggctaatacccagaatc tacaaagaacttaaacaaatttacaagaaaaaatcaaacaaccccatcaaaaagtggcaa aggatatga >gi568815597f:65048408_65326174|GENSCAN_predicted_peptide_4|495_aa MDELMGWLSQEWAPDKSGRPWGLSKHGFWYEGVVLEPIPSHTPYCQQILRDNHIVKREAK QKNLENSRHGHVKSGKACSRGNIKCVVKQPIAKEISMARRKPENAESKIPVNELNSLVRP LYPEVSDCMYGTELGEALLTLVSSLRQNITNNGSAHADSAVCRRTQLGLSTQQCISLIVA SSFLICKREVDRSTSAISVVALQLPAFGGLCEGNSGNALLENYSKGSKNVCCSATSGRET GTGRADADTLLTGPPQQSARTSSPLSGPTHNHPSTASKNNDTSGNTYSQLPVPTTCCAEG HQIPPPETSHRTIVVLYWVDALDNPLKEDHLVQLNKTDIVRVLTETLAARIEATFLDQTV PTRYANSHLRAFALAVPSSWNNLFLIITSSLFTPLTGAGLDVLPQEVAAGEMLETKVLGD PLAHGALARAGRPEDDRAQEFGSHCLREEGGGQTRSPDRGPGGSPGNFVSRPPTSGTLRS HWDSPAAPTGGRRDF >gi568815597f:65048408_65326174|GENSCAN_predicted_CDS_4|1488_bp atggatgaattaatgggttggttatctcaggagtgggctcctgataaaagcggccgaccg tggggcttgagtaagcatggattttggtatgagggagtggtcctggaaccaatcccttcc cacaccccttactgtcagcagatactgagggataaccatatagttaaaagagaagcaaaa cagaagaatttggaaaattctcggcatggccatgtaaagagtggaaaagcatgttccaga gggaatattaagtgtgtggtcaaacaacctattgctaaagagattagcatggctagaagg aagccagaaaatgctgagagtaagattcctgtaaatgagctcaattcccttgtaagacca ctctaccctgaagtatcagactgcatgtatgggacagaactaggagaagccttgctcacc ctggtatcatcccttcgccaaaacatcactaataatggcagtgcacacgcagactcagcc gtgtgccggagaacacagctcggactctcaacacagcaatgtatttcacttatcgtggcc tcaagtttcctcatctgcaagagagaggtagaccgttcgacctctgctatctctgtggtg gcacttcaactccctgcatttggaggcttgtgtgaaggtaactctggcaatgccctgttg gagaactacagcaagggctccaaaaatgtgtgctgctcagcaacctccggaagggaaacg ggaacagggagagctgatgcggacactcttctgacaggacctcctcagcaatcagcaagg acaagcagtccactgagcgggccaacacacaatcatccttcaacagccagcaagaacaac gataccagtggcaacacatatagccaactccctgtgcctaccacttgctgtgccgaaggc caccagatacctccacctgaaacttctcacagaactatcgtggttttgtattgggtagat gccttggataatcctttgaaggaggatcatttagtccaacttaataaaactgatatcgtt cgcgtactgacggaaacactggcagcacgtattgaggccacatttctggatcagacagtg ccgacacgctatgctaattcccaccttagggcctttgcacttgctgttccttcttcctgg aacaacctgttcctaatcatcacaagctcattgtttacgcccctcaccggtgctggcctt gatgttctcccgcaagaagtggccgctggagagatgctggagaccaaagttctgggcgat cctctggcacacggtgcccttgcccgagccgggcggcccgaggatgaccgcgcgcaggag tttggaagccattgccttcgcgaggaggggggcggtcaaacgcgcagccccgaccgcggc cccggaggaagccccgggaactttgtttctcggccccctacctctggcaccctccgctct cactgggactcgccggccgcccctacagggggaaggagagacttttaa >gi568815597f:65048408_65326174|GENSCAN_predicted_peptide_5|200_aa MCHAPVLWALALTVPSTGSALSPDIRVDRWIQIPTQITLPQKDTTLALKVALFEASHFQL CLSFLRGSFIAVYLTSFPRTLGQAEALDKICEVDLVISLNIPFETLKDRLSRRWIHPPSG RVYNLDFNPPHVHGIDDVTGEPLVQQEDDKPEAVAARLRQYKDVAKPVIELYKGTTIKAP FALLRTTRSWRQGALLFESD >gi568815597f:65048408_65326174|GENSCAN_predicted_CDS_5|603_bp atgtgccatgcacctgtgctgtgggcccttgcacttactgttccctctaccggcagtgcc ctgtctccagatatccgtgtggatcgctggatacagatccctacacagataacccttcct cagaaggacaccacactcgccctgaaagtggcactctttgaggccagtcacttccaactc tgcctttcatttctgagaggaagctttatcgcggtatacttgacaagttttcctaggaca ttaggacaagccgaagccctggacaaaatctgtgaagtggatctagtgatcagtttgaat attccatttgaaacacttaaagatcgtctcagccgccgttggattcaccctcctagcgga agggtatataacctggacttcaatccacctcatgtacatggtattgatgacgtcactggt gaaccgttagtccagcaggaggatgataaacccgaagcagttgctgccaggctaagacag tacaaagacgtggcaaagccagtcattgaattatacaaagggaccacaatcaaggcacct tttgccttgctcaggaccacaaggagttggagacagggagctctactatttgaaagtgac tag >gi568815597f:65048408_65326174|GENSCAN_predicted_peptide_6|75_aa MGPVVSQSNEQGLEERRLCRHQPLRFPSGPGTRDRGSGRSSGRLLPQSLESMGKECGLRE DLPPSGNTQVPEAHS >gi568815597f:65048408_65326174|GENSCAN_predicted_CDS_6|228_bp atggggcccgtggtgagccaaagcaacgagcaggggctagaggaaaggagactgtgccga catcagcccctgcgcttccccagcggtcccggtacccgcgaccgagggtcgggcagatcc agtggcaggcttttgccacagtctctggagtccatgggaaaagaatgtggtctgagagaa gatcttccaccaagtggaaacacccaagtgcctgaggcacattcatga >gi568815597f:65048408_65326174|GENSCAN_predicted_peptide_7|58_aa MTGVPRSYVPSPLQPPDEKEEGGRKMDAAAEDLLVDPEGLTSSKKPPAVVASQSTLNF >gi568815597f:65048408_65326174|GENSCAN_predicted_CDS_7|177_bp atgacaggagtccctagaagctatgttccttctcccctgcagcctcctgatgaaaaagaa gaaggtggtagaaagatggatgctgctgcagaagaccttctggttgacccagaagggtta acttcctcaaagaagcctccagctgtggttgcttcccagagcacactgaacttttaa >gi568815597f:65048408_65326174|GENSCAN_predicted_peptide_8|346_aa MSQPNFKFNNKHSGGFWGFKFGGSLNILPQLGEERWTPRSPEATRVPRSRFQGEPFLPIA LVVLLARVPRKGAAESRARPGGGRRLRAAVPGPCFRNANFRVVELVPSRAELGGKVARVL RYSPSGSEFHHGHTLGYNLLSLEAVSWSKRRLALELLSKQELPSKHTDKLALQGVSGEGK VDYFLFSPGLPMSLLGSYRKKTSNDGYESLQLVDSNGDLSAGSGGVGGKQRVNAGAAARS PARQPPDRASTMDSSGCADAQGSAVEGCVVEAGPVFAWRKSEIMGRNARTEALWLWAESR PKARGQYGVCIGWPLLTGRALDSGSLLVDGGTARIIKHLRNLEDDY >gi568815597f:65048408_65326174|GENSCAN_predicted_CDS_8|1041_bp atgagccaaccaaattttaagttcaacaacaagcattctggaggattctggggttttaaa ttcggaggctccttaaatattctgccgcagttgggggaggagcgatggacccctaggtct ccggaggctacacgtgttccccgatcccgtttccagggggagcccttccttcctattgcg ctggtcgtgctcctggcccgcgtgcccaggaagggcgcagcggagtcgcgtgcccgtccg ggcggaggacgccgtctccgcgcggcggtgccgggcccctgcttccggaacgctaatttt agggtggttgagctggtgccctcgcgggcagaactaggtggtaaggtggccagagtcttg agatactctccaagtgggagtgagtttcatcatgggcatactttaggttataatttgtta tctctcgaggcagtctcctggtcgaagagacgtctggctctggagcttctaagtaagcaa gagcttccaagtaagcacacagataagcttgccctacagggagtgtctggtgaaggaaag gttgattattttctcttttctccgggcttgcccatgagcctcctcgggagctaccggaaa aagaccagcaacgatggttatgaatctttgcagctggtggacagtaacggggacttaagt gcgggaagcggcggggttggcggcaagcagagagtgaacgccggggcagcggcgcggagt cccgcccgacagcctccggaccgcgccagcaccatggacagctcaggctgtgcagatgcc caggggagtgctgtggaggggtgtgtggtagaagcaggccctgtgtttgcctggaggaag agtgagataatgggtagaaatgctagaactgaagctctgtggctctgggctgagtccagg cctaaggcccgaggccagtacggtgtctgtatcgggtggccccttctcactgggagagcc ctagactctggttccttgcttgttgatggaggcacagcaagaataatcaaacacctgaga aacttggaagatgattattga