GENSCAN 1.0 Date run: 3-Nov-116 Time: 23:08:48 Sequence gi568815595r:44346578_44555549 : 208972 bp : 43.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 798 837 40 2 1 68 111 61 0.586 6.76 1.02 Intr + 11262 11299 38 1 2 78 59 21 0.166 -3.82 1.03 Intr + 14788 14941 154 0 1 96 104 63 0.730 8.35 1.04 Intr + 21014 21131 118 0 1 2 60 105 0.051 -1.38 1.05 Intr + 46278 46400 123 0 0 95 110 44 0.669 7.00 1.06 Intr + 49823 49920 98 2 2 71 92 61 0.684 4.45 1.07 Intr + 63491 63631 141 0 0 59 75 42 0.049 0.32 1.08 Intr + 69770 69860 91 0 1 84 53 21 0.022 -2.75 1.09 Intr + 73891 74044 154 1 1 93 62 88 0.288 6.77 1.10 Intr + 74064 74126 63 2 0 86 62 42 0.262 0.31 1.11 Intr + 81464 81581 118 1 1 93 116 35 0.572 6.84 1.12 Intr + 82489 82646 158 2 2 82 81 81 0.935 6.53 1.13 Term + 82693 82857 165 0 0 60 43 65 0.650 -2.88 1.14 PlyA + 82932 82937 6 1.05 2.06 PlyA - 85164 85159 6 1.05 2.05 Term - 102162 99998 2165 1 2 39 38 751 0.482 50.57 2.04 Intr - 102874 102812 63 2 0 109 87 11 0.769 1.79 2.03 Intr - 103046 102936 111 0 0 105 109 47 0.995 8.85 2.02 Intr - 103996 103870 127 1 1 15 106 88 0.160 3.55 2.01 Init - 108972 108544 429 0 0 72 96 425 0.963 35.96 2.00 Prom - 111900 111861 40 -4.76 3.00 Prom + 117638 117677 40 -3.86 3.01 Init + 123636 123670 35 2 2 94 80 29 0.598 2.06 3.02 Intr + 132005 132092 88 2 1 73 115 43 0.699 5.47 3.03 Term + 142163 142204 42 1 0 74 45 61 0.039 -2.44 3.04 PlyA + 145177 145182 6 1.05 4.02 PlyA - 145212 145207 6 1.05 4.01 Sngl - 154049 152568 1482 2 0 79 44 1047 0.516 94.91 4.00 Prom - 167271 167232 40 -6.06 5.09 PlyA - 171966 171961 6 1.05 5.08 Term - 178356 178171 186 0 0 84 49 113 0.701 4.49 5.07 Intr - 178800 178613 188 1 2 61 65 68 0.380 1.21 5.06 Intr - 187910 187746 165 0 0 77 103 71 0.411 7.43 5.05 Intr - 199455 199267 189 1 0 91 80 66 0.269 5.66 5.04 Intr - 203983 203862 122 0 2 51 73 81 0.592 3.04 5.03 Intr - 204500 204383 118 0 1 75 72 58 0.445 2.42 5.02 Intr - 205612 205564 49 1 1 92 76 4 0.299 -2.15 5.01 Init - 208632 208561 72 0 0 76 86 61 0.390 3.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 103990 103870 121 1 1 89 106 83 0.815 10.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:44346578_44555549|GENSCAN_predicted_peptide_1|486_aa MGAVPEGLQQYSPVHPDFFGQHPVEREINENSLKRLSVYLENLQKPGFKSLKPTQLTFYV RETDQSSSDGQEPFSTSEAKRMPDRPIKWDKSYYSFTGFKDPDEDLEQVSRVETTLTSWL DNNGKSAVKKLKNSLPLRKELDRLKDELSHQLQLSDIRWQRSWGIAHRCSQLHSLSRLAQ QNLETLKKAKASLENSAQASLEQTKQIKSVPWRLMGPKAAGSSVGDHGPGNLPPTGQGHR WGHAGPGSMAQHQAITKANSSPQGPGTGVPDVLEVHPDDAFHLELLRYNLEPPSTCSGTL VGLLQQLLLKPGHSVCPGQDALLYPRGFLQPSTSLIKEGQTDWASSSNPKAAPLTRPIFS TSHQTANRLPRPRNLSNLEGTESRDTPCCCLRESGPRSTGPVPADARSPLLLPDTSILRF PLPGPPRKRGPQPLPPAPELSYIYDLSKNVMPFSPQKVILQAVIVTQGVPHGVLTGLQPS TNRGLV >gi568815595r:44346578_44555549|GENSCAN_predicted_CDS_1|1461_bp atgggggctgtccctgaaggcttgcagcagtacagcccagtacatccagatttctttgga cagcaccccgtagaaagggaaatcaatgaaaattctcttaaaaggttaagtgtctaccta gaaaacctccagaaaccaggcttcaagtctttgaaaccaactcagcttacattttatgta agagaaacagaccagagttcctccgatggccaggaaccttttagtacttccgaagctaaa aggatgcctgacaggcccatcaaatgggacaagtcttattactcctttactggattcaag gaccctgatgaagaccttgaacaagtctcgagagtggaaacaactctcacatcctggtta gataacaatgggaaaagtgctgttaaaaagctaaagaacagtttgccacttagaaaagaa ctagatcgtttaaaagatgaactgtctcatcaattgcaactctcagatatcaggtggcag aggagctggggcatcgcccaccgctgtagccagctgcatagtttaagccgcttagcacag cagaatttggaaacacttaaaaaagcaaaagcttccctggagaattctgcccaggcttcc ctggagcaaaccaagcagatcaaatcagttccatggagactgatgggccccaaggctgca ggctcctctgtaggggaccatggtcctggcaacctcccacccacaggccaaggtcaccga tggggacatgcaggtccagggagcatggctcagcatcaggccatcactaaagcaaactcc agcccacagggacctggcacaggtgttcctgatgtacttgaagtccatcctgatgacgca tttcacctggagctgctccggtacaacttggagccacctagcacctgttcagggacacta gtggggcttctccagcagctcctcctcaaaccaggtcactctgtgtgcccaggccaggat gcactcctgtacccccgaggcttcctccagccctcaaccagcctgatcaaggagggccag acagactgggcatccagttccaatccaaaggctgcccctttgaccaggcccatcttctcc acctcacatcagacagccaatcggctgcccagacctcgaaacctcagtaacctagaaggc acggagtccagggacaccccctgctgctgcctaagagaaagtggcccaagaagcactggc ccggtgcctgctgatgccaggagccctctgctactgcctgacacaagcatcctgcgcttc cccctcccaggccctccccgaaagcgtgggccacagccactcccaccagccccagagctt tcctacatttatgatttgtccaaaaatgtgatgcccttcagcccccagaaggtcatcctc caggctgtgatcgtcacacagggggtcccacatggggtcctcacaggccttcaacccagc acaaatcgtggcctggtttga >gi568815595r:44346578_44555549|GENSCAN_predicted_peptide_2|964_aa MPPGRWHAAYPAQAQSSRERGRLQTVKKEEEDESYTPVQAARPQTLNRPGQELFRQLFRQ LRYHESSGPLETLSRLRELCRWWLRPDVLSKAQILELLVLEQFLSILPGELRVWVQLHNP ESGEEAVALLEELQRDLDGTSWRETMTFKDVEVTFSQDEWGWLDSAQRNLYRDVMLENYR NMASLVGPFTKPALISWLEAREPWGLNMQAAQPKGNPVAAPTGQLHHWEFTPLFAGRKIF NSVGDDLQSKTNKFILNQEPLEEAETLAVSSGCPATSVSEGIGLRESFQQKSRQKDQCEN PIQVRVKKEETNFSHRTGKDSEVSGSNSLDLKHVTYLRVSGRKESLKHGCGKHFRMSSHH YDYKKYGKGLRHMIGGFSLHQRIHSGLKGNKKDVCGKDFSLSSHHQRGQSLHTVGVSFKC SDCGRTFSHSSHLAYHQRLHTQEKAFKCRVCGKAFRWSSNCARHEKIHTGVKPYKCDLCE KAFRRLSAYRLHRETHAKKKFLELNQYRAALTYSSGFDHHLGDQSGEKLFDCSQCRKSFH CKSYVLEHQRIHTQEKPYKCTKCRKTFRWRSNFTRHMRLHEEEKFYKQDECREGFRQSPD CSQPQGAPAVEKTFLCQQCGKTFTRKKTLVDHQRIHTGEKPYQCSDCGKDFAYRSAFIVH KKKHAMKRKPEGGPSFSQDTVFQVPQSSHSKEEPYKCSQCGKAFRNHSFLLIHQRVHTGE KPYKCRECGKAFRWSSNLYRHQRIHSLQKQYDCHESEKTPNVEPKILTGEKRFWCQECGK TFTRKRTLLDHKGIHSGEKRYKCNLCGKSYDRNYRLVNHQRIHSTERPFKCQWCGKEFIG RHTLSSHQRKHTRAAQAERSPPARSSSQDTKLRLQKLKPSEEMPLEDCKEACSQSSRLTG LQDISIGKKCHKCSICGKTFNKSSQLISHKRFHTRERPFKCSKCGKTFRWSSNLARHMKN HIRD >gi568815595r:44346578_44555549|GENSCAN_predicted_CDS_2|2895_bp atgcctccaggcaggtggcatgctgcctatccagctcaggcccagtcttcgagggagcga gggcggcttcagacagtaaagaaggaagaagaggatgaaagctatactccagtgcaggct gccaggccacagactctcaaccgccctggccaggagctgttccgccagctcttcagacag cttcgctaccatgagtcttcagggcccctagaaactctgagccggctccgggaactctgt cgctggtggctgaggcctgacgttctctccaaggcacagatcctagagctgctggtgctg gaacagttcctgagcatcctgcctggggagctccgggtttgggtgcagcttcataaccct gagagtggcgaggaggctgtggccttgctggaggagctgcagagggaccttgatgggaca tcctggagggagaccatgactttcaaggatgtggaggtgaccttctcccaggacgagtgg gggtggctggactctgctcagaggaacctgtacagggatgtgatgctggagaattatagg aacatggcttccctggtgggaccattcaccaaacctgctctgatctcctggttggaagca agggagccatggggcctgaacatgcaggcagctcagcctaaggggaatccagttgctgct cctacaggtcagttacatcactgggaatttactccattgtttgctggaaggaagatcttt aacagtgttggagatgacctccagagtaaaacaaacaaattcatcttaaatcaggaacct ttggaagaagcagaaaccttagctgtgtcatcaggatgtcctgcgacaagtgtttctgag ggaattgggctcagagaatcttttcaacagaagagcaggcagaaggatcaatgtgaaaat cccatacaagtaagagttaagaaagaagagaccaatttcagtcacaggacaggaaaagac tctgaagtatcaggaagtaatagtcttgacttaaaacatgttacatatttgagagtttct ggaagaaaggaatcccttaaacatggctgtggcaaacacttcagaatgagttcacaccac tatgactacaagaaatatgggaaggggctcagacacatgattgggggcttcagcctacat cagagaattcatagtggactgaaagggaacaaaaaggacgtgtgtggaaaagacttcagc cttagctctcatcaccaacgtgggcagagtcttcacacagtgggagtgtcatttaagtgc agtgactgtggaaggactttcagtcatagctcccatcttgcgtatcatcagagacttcac actcaagagaaagcatttaaatgtagggtgtgtgggaaagccttccggtggagttccaac tgtgcgcggcatgagaaaattcacactggagtgaagccttataaatgcgatttatgtgag aaagctttccgacgcctgtcagcctaccgtctgcaccgagaaacccatgctaagaagaaa tttcttgaattgaatcagtatagggcagctctcacctacagctcagggtttgatcatcat ttgggagaccaaagtggggagaaactctttgactgcagccagtgcaggaaatccttccac tgtaagtcatatgttcttgaacatcaaaggattcacacccaggagaagccctataaatgt accaaatgtaggaaaacctttagatggagatcaaactttactcgtcatatgaggttgcat gaggaggaaaaattctacaaacaagatgaatgtcgtgaaggcttcaggcaatctcctgac tgcagtcagccccagggtgctcccgctgtggagaaaacatttctgtgtcagcagtgtggg aaaacttttactagaaagaaaactctcgttgaccaccagagaattcacacaggtgagaaa ccttaccagtgtagcgattgtgggaaggactttgcctataggtcagcctttattgttcat aagaagaagcatgccatgaaaagaaaacctgagggcgggccatcttttagtcaggacaca gtgttccaggttcctcagagcagtcactccaaagaggagccctacaaatgcagccagtgt ggcaaggccttccgcaatcactcattcctcctcatccatcagagagttcacactggagag aagccatataagtgcagggagtgtgggaaagccttcagatggagttccaatctctaccga catcagaggattcactctcttcaaaaacagtatgattgccatgaaagtgaaaagactcca aatgtggagccaaaaatcctcactggtgagaaacgtttttggtgtcaagaatgtgggaaa acctttacacgtaaaagaacccttttagatcataagggaatacacagtggagagaagcgc tataaatgtaatctatgtgggaaatcttatgatagaaactatcgccttgttaaccatcag aggatccactctacagagagacctttcaaatgtcagtggtgtgggaaagagttcattggg agacataccctttccagtcaccagaggaaacacaccagagcagcacaggctgaacgtagc ccgcctgcacggtcttcctctcaggacacaaagttgagattacagaagctaaaaccaagt gaagagatgcccctcgaagactgcaaagaagcttgcagccagagctccaggctcactgga ctccaggacataagcattgggaaaaagtgccacaaatgcagcatatgtgggaaaactttt aacaagagttcacaactcattagccacaagagatttcatactcgagagaggcccttcaaa tgcagcaagtgtggaaagaccttcaggtggtcttcgaacctggctcggcatatgaaaaac catattagagattag >gi568815595r:44346578_44555549|GENSCAN_predicted_peptide_3|54_aa MPGVQGHSYTANHQRWKQRIDPEHQEDGDKVIEKQMAHSNKLQGEKNYMWYIHI >gi568815595r:44346578_44555549|GENSCAN_predicted_CDS_3|165_bp atgcctggcgtccaaggacactcctacacagccaatcaccaaagatggaaacagagaata gatcctgaacaccaggaagatggtgacaaggtgatagagaaacagatggcacattcaaac aagttgcagggtgagaaaaactacatgtggtacatccacatctga >gi568815595r:44346578_44555549|GENSCAN_predicted_peptide_4|493_aa MMESSELTPKQEIFKGSESSNSTSGGLFGVVPGGTETGDVCEDTFKELEGQPSNEEGSRL ESDFLEIIDEDKKKSTKDRYEEYKEVEEHPPLSSSPVEHEGVLKGQKSYRCDECGKAFYW SSHLIGHRRIHTGEKPYECNECGKTFRQTSQLIVHLRTHTGEKPYECSECGKAYRHSSHL IQHQRLHNGEKPYKCNECAKAFNQSSKLFDHQRTHTGEKPYECKECGAAFSRSKNLVRHQ FLHTGKKPYKCNECGRAFCSNRNLIDHQRTHTGEKPYKCNECGKAFSRSKCLIRHQSLHT GEKPYKCSECGKAFNQISQLVEHERIHTGEKPFKCSECGKAFGLSKCLIRHQRLHTSEKP YKCNECGKSFNQNSYLIIHQRIHTGEKPYECNECGKVFSYNSSLMVHQRTHTGEKPYKCN SCGKAFSDSSQLTVHQRVHTGEKNLMNVLSVGKPLVSVPLLITTSELMLERSPQVWLGHL LKAWFSETDSKDL >gi568815595r:44346578_44555549|GENSCAN_predicted_CDS_4|1482_bp atgatggagagttcagagctgactccgaagcaggaaatttttaaaggatcagagtcatct aatagcacatcagggggactctttggggtggttcctgggggaacagagactggagatgtt tgtgaagataccttcaaagagttagaaggacaaccctcaaatgaagaagggagcagacta gaaagtgatttcttggaaataatagatgaggataagaaaaaatccacaaaagacagatat gaggaatataaggaagttgaggaacatccacctctgtcttccagtcctgttgaacatgaa ggagttttaaagggacagaaatcctatcgatgtgatgaatgtggcaaagctttttattgg agttcgcacctcattggtcatcggagaatccacactggagagaaaccctatgagtgtaat gagtgtgggaagaccttcaggcaaacctcccagctcattgttcatctcagaacccacaca ggggaaaagccctatgaatgcagtgagtgtggaaaggcctataggcacagctcccatctc attcaacaccagagactccataatggggagaaaccctataaatgtaatgaatgtgcaaaa gcttttaatcagagctccaaactcttcgaccaccagagaacccatactggggagaaacct tatgaatgtaaggagtgtggggcggcctttagtcggagtaaaaatcttgttcgacatcag tttctgcacactggtaagaaaccttataagtgtaatgaatgtgggagagcattctgttcc aatagaaatctcattgaccatcagagaacccacactggggagaagccttataaatgtaat gaatgtggcaaagccttcagtcggagtaaatgtcttattcgacatcagagcctccacact ggggaaaagccatacaaatgtagtgaatgtgggaaagccttcaatcagatctctcaactt gttgaacatgagcgaattcatactggagaaaaaccatttaagtgtagtgagtgtggtaag gcattcggtctgagtaaatgtcttattcggcaccagaggcttcacacaagtgaaaagccc tataaatgcaatgagtgtggaaaatccttcaatcaaaactcatacctcattatacaccag agaattcacactggtgagaaaccctatgaatgtaatgagtgtgggaaggtcttcagttat aattctagtcttatggtacatcagagaacccatactggggaaaaaccctataaatgcaat agttgtgggaaagcctttagtgacagctcacagcttactgtgcaccagagagtccacact ggagagaaaaaccttatgaatgtattgagtgtgggaaagcctttagtcagcgttccactt ttaatcaccaccagcgaactcatgctggagagaagccctcaggtctggctcggtcatctt cttaaggcatggttttctgagacagacagcaaagacctttga >gi568815595r:44346578_44555549|GENSCAN_predicted_peptide_5|362_aa MAEPEAREASTSAPSGIPEVSFAKLSPLTLTLDVVVPLAGFQDLSQAIACGQAIWDPSMG AVAPGCPHLPLVRILPHDDRAVTLMAKVHSFILEVSETKNPPEGTNSGHILSTMKGLSPS KDGESIEHDCQQIIVQTYATQKDLLEVPLANPDPNLYTNGSSFVENGIQTAGYAIVSDVT VLEMISIAAQKILWMEGPAATYPPDGSTEYIGLRLESSPIAHCPAPGLTNAQDRLEMDRL WVLPLYMPSKVVNAMWFQDTADLAFEASGRKPHPFPSVHKITADMLSNLLLQALFPIQGM SCNEAKIPGQAYLFELHLSSFVVFLSNRLFHKTLSPAVCLEQLSLCREVPFTTYVSTQIE GY >gi568815595r:44346578_44555549|GENSCAN_predicted_CDS_5|1089_bp atggccgagccggaagcgagagaggcctcgacttccgccccctccgggatcccggaagtc tcgtttgcgaagttgagtcccctgacactaactcttgatgtggttgttcccttagctgga tttcaggacttgtctcaggcaattgcctgtgggcaagccatctgggaccccagcatggga gctgtggccccaggctgcccccacttgccattggtacggattctccctcatgatgacaga gctgtgacactcatggcgaaggtccacagcttcattcttgaagtcagtgagaccaagaac ccaccagaaggaaccaactctggacacatcttgtcaaccatgaagggactatcgccaagc aaggatggggaatcaatcgagcatgactgccaacaaattatagtccagacttatgccacc caaaaggatctcttagaagtccccttagctaatcctgaccctaacctatataccaacgga agttcatttgtggagaatgggatacaaacagcaggttatgccatagttagtgatgtaaca gtacttgaaatgatcagtatagctgcccaaaagatactgtggatggaagggccagctgcc acataccctcctgatggaagtaccgagtacattggcctgagactggagtcctctccaata gcccactgtccagcccctggccttaccaacgctcaggacagattagaaatggaccgcctt tgggtgctccccttgtacatgcctagcaaagttgtgaatgccatgtggttccaagataca gctgacctggcatttgaagcatcagggaggaagcctcacccattccctagtgtccacaaa ataactgctgacatgctctccaatcttttgctgcaggctcttttccccattcaaggaatg tcttgcaatgaggcaaagatccccggccaagcatacctctttgagttgcacctctcctcc ttcgtggtcttcttaagcaacagactcttccacaagacactctcacctgcagtctgcctt gagcagctcagcctctgcagagaagttcccttcaccacatatgtctctacccagattgaa ggctactga