GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:03:46 Sequence gi568815594f:101690881_102173740 : 482860 bp : 35.87% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 516 511 6 1.05 1.05 Term - 13858 13692 167 2 2 23 47 181 0.281 4.60 1.04 Intr - 19317 19248 70 0 1 22 83 53 0.017 -3.96 1.03 Intr - 49763 49596 168 2 0 19 86 129 0.187 5.02 1.02 Intr - 50841 50595 247 2 1 45 41 169 0.527 4.44 1.01 Init - 52888 52749 140 1 2 71 44 151 0.604 8.66 1.00 Prom - 60155 60116 40 -3.85 2.05 PlyA - 60206 60201 6 1.05 2.04 Term - 68901 68723 179 1 2 5 49 178 0.592 2.47 2.03 Intr - 69294 69150 145 0 1 0 56 168 0.936 3.73 2.02 Intr - 70803 70716 88 2 1 99 71 52 0.876 3.65 2.01 Init - 88305 88166 140 0 2 74 94 92 0.208 8.06 2.00 Prom - 97496 97457 40 -6.85 3.03 PlyA - 98284 98279 6 1.05 3.02 Term - 100380 100114 267 0 0 -52 41 242 0.041 0.01 3.01 Init - 108158 107799 360 2 0 71 42 154 0.398 6.22 3.00 Prom - 119568 119529 40 -2.65 4.00 Prom + 128245 128284 40 -2.45 4.01 Init + 138948 139326 379 2 1 36 67 148 0.231 4.61 4.02 Intr + 164155 164309 155 2 2 37 78 139 0.371 6.67 4.03 Intr + 171646 171784 139 0 1 76 82 95 0.927 6.82 4.04 Intr + 179625 179764 140 1 2 59 111 78 0.601 6.46 4.05 Intr + 190119 190244 126 2 0 8 67 104 0.129 0.26 4.06 Intr + 204425 204530 106 1 1 25 82 72 0.035 -0.83 4.07 Intr + 206598 206663 66 1 0 73 40 86 0.010 0.26 4.08 Intr + 209126 209194 69 0 0 81 121 19 0.026 2.74 4.09 Term + 216656 218097 1442 0 2 67 47 503 0.011 34.07 4.10 PlyA + 220742 220747 6 1.05 5.00 Prom + 239396 239435 40 -4.75 5.01 Init + 249849 249953 105 2 0 89 80 58 0.921 5.37 5.02 Term + 251427 251582 156 2 0 36 39 119 0.607 -1.25 5.03 PlyA + 251639 251644 6 1.05 6.03 PlyA - 251958 251953 6 1.05 6.02 Term - 263290 263014 277 0 1 71 43 207 0.768 8.45 6.01 Init - 267803 267742 62 2 2 55 99 6 0.393 -0.79 6.00 Prom - 276401 276362 40 -5.45 7.00 Prom + 277387 277426 40 -3.95 7.01 Init + 297007 297169 163 0 1 71 35 129 0.168 5.84 7.02 Intr + 334321 334629 309 2 0 43 111 206 0.426 13.76 7.03 Intr + 339080 339385 306 0 0 45 86 322 0.521 23.20 7.04 Intr + 352959 353027 69 1 0 83 74 56 0.164 1.94 7.05 Intr + 354675 354991 317 1 2 25 86 145 0.067 2.86 7.06 Intr + 359341 359428 88 0 1 95 110 42 0.459 5.72 7.07 Intr + 369331 369509 179 2 2 98 90 189 0.997 18.82 7.08 Intr + 372195 372258 64 2 1 47 92 77 0.477 1.37 7.09 Intr + 380395 380424 30 2 0 91 103 15 0.240 0.48 7.10 Intr + 383125 383170 46 2 1 111 34 52 0.028 -1.35 7.11 Intr + 393411 393509 99 0 0 70 72 92 0.045 4.11 7.12 Intr + 407140 407266 127 1 1 18 38 135 0.000 1.26 7.13 Intr + 438357 438515 159 2 0 85 64 81 0.001 4.66 7.14 Intr + 450350 450515 166 1 1 61 103 69 0.442 4.31 7.15 Term + 478850 479025 176 0 2 99 47 112 0.872 5.14 7.16 PlyA + 479287 479292 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 216427 218097 1671 0 0 51 47 542 0.947 41.31 S.002 Init + 477772 477835 64 0 1 79 29 85 0.935 3.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:101690881_102173740|GENSCAN_predicted_peptide_1|263_aa MEKSLNDLMELKNMAQGLRDKCTSFSSSFDKLEERVSAIEDQMNEMKNTNYIREYYKHLY ANKVENLEEMDKFLDTYTLPRLNQEEVESLNTPITGSEIEAIINSLPTKKSPGPDEFTAE FYHRDKEELRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNVPCSRIGRINIVKM AILPKGDGGIFHNTNKGDNRSRICGKHEANYKTTSGRSFRRHPEEGIVTTGDDSSMQVTA PEDLPVRQDVEVEDSDTDDPDTV >gi568815594f:101690881_102173740|GENSCAN_predicted_CDS_1|792_bp atggagaagtccttaaatgacctgatggagctgaaaaacatggcacaaggactacgtgac aaatgcacaagcttcagtagctcattcgataaactggaagaaagggtatcagcgattgaa gatcaaatgaatgaaatgaaaaatacaaactacatcagagaatactataaacacctctat gcaaataaagtagaaaacctagaagaaatggataaattcctggacacatacaccctccca agactaaaccaggaagaagttgaatccctgaatacaccaataacaggctctgaaattgag gcaataattaatagcctaccaaccaaaaaaagtccaggaccagatgaattcacagcggaa ttctaccatagggacaaggaagagctgagaataaaatatctaggaatccaacttacaagg gacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggac acaaacaaatggaagaacgtcccatgctcacggataggaagaatcaatatcgtgaaaatg gccatactgcccaagggagatggagggatctttcataataccaataaaggggacaatcga tcaagaatttgtggaaaacatgaagctaactataaaacaacctcaggcaggtccttcagg aggcatccagaagaaggcattgttaccacaggagatgacagctccatgcaagttactgcc cctgaagaccttccagtgagacaagatgtggaggtggaagacagtgatactgatgatcct gacactgtgtag >gi568815594f:101690881_102173740|GENSCAN_predicted_peptide_2|183_aa MGHQATMQFELPIINWVLSDPSIHKVRHAQQHYIIKWKWYIYDWYQAEACSLEKILTLLT GCLEKEVGAVAGGMVQDTGWSLNVDAKQEHHTSLKDVDGRGTPMMLMIGYSSTVGSLAAP TGSKVYLALSGSDDFTGYPAFDHEYPYLTSMILEAHQVVNIEANHPNVTRNCTYSVIPFS GPG >gi568815594f:101690881_102173740|GENSCAN_predicted_CDS_2|552_bp atgggtcaccaagctaccatgcagtttgagctgcctatcataaactgggtgttatctgac ccatcaatccataaagttcgccatgcacagcagcactacatcatcaagtggaagtggtat atatacgattggtaccaagcggaggcttgtagcctggagaaaattctcaccctgctcacc ggctgcctggaaaaggaagttggtgctgttgcaggaggcatggtccaagatactgggtgg tcattaaatgtggatgcaaagcaggagcatcacacatcgctcaaagacgtggacggcagg gggacaccaatgatgttgatgattggctattccagcacagtgggaagtttggcagcaccc actggatccaaagtttatttggctctatctgggagtgacgatttcactgggtacccagca tttgaccatgaatacccgtaccttacatccatgatcctggaggcccaccaggtggttaac attgaggctaatcaccccaatgtaactcgtaactgcacctatagtgtcattcccttcagt gggcctggttag >gi568815594f:101690881_102173740|GENSCAN_predicted_peptide_3|208_aa MGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKSFTIYPSDKGL ISRIYKELTQIYKKKSNNPIKKWAKDMNTHFSKEYIYAANRHMKKCSSSLAIREIQIKTT KLQDPDPGAGESSSSSEVADTEGSPLARTPSALPPRGVQMPLREVRDRRTLWAAWRQDKK EKPCLRDALASVPECSCLTMARGLRRAP >gi568815594f:101690881_102173740|GENSCAN_predicted_CDS_3|627_bp atgggcaaggacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggagaaaagttttacaatctacccatctgacaaagggcta atatccagaatctacaaagaacttacacaaatttacaagaaaaaatcaaacaaccccatc aaaaagtgggcaaaggatatgaacacacacttctcaaaagaatacatttatgcagccaac agacacatgaaaaaatgctcatcatcattggccatcagagaaatacaaatcaaaaccaca aaactacaggaccctgacccgggtgctggggaaagttccagctcctcagaagttgcagac actgagggtagcccgctggccagaactcccagcgcgctccctcctcgtggcgtccagatg cccctcagagaagtcagggacagaagaaccctctgggctgcctggcggcaggataaaaag gagaagccctgtctcagggacgccctggcctcagtgcccgagtgcagttgtctaacgatg gcccgtggtctccgcagagccccgtag >gi568815594f:101690881_102173740|GENSCAN_predicted_peptide_4|873_aa MIYEEDAEEWALYLTEVFLHVVKREAILLYRLENFSFRHLELLNLTSYKCKLLILSNSLL RDLTPKKCQFLEKILHSPKSVVTLLCGVKSSDQLYELLNISQSRWEISTEQEPEDYISVI QSIIFKDSEDYFEVNIPTDLRAKHSGEISERKEIEELSEASRNTIPLAVVLPTEIPCENP GEIFIILRDEVIGDTVEVEFTSSNKRIRTRPALWNKKVWCMKALEFPAGSVHVNVYCDGI VKATTKIKYYPTAKAKECLFRMADSGESLCQDIGLGKNFLTNIRQAQATKAKTDKYDNIE LKSFCTAKETINKNSIEELDGVLTSIFKHEIPYYEFQSLQTEICSQNKYGELWMGYIGVP ENMNLVQQGWAQFHYHNNTYSEIIEDPVIIDEKVLEVLARAIRQEKEIKGIQLGKEEVKL SLFADDMIVYLENPIISAPNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESKIMSEL PFTVTSKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKGKNIPCSWVGRISIVKIAI LPKVIYRFNAIPIKLPMTFFTELEKTTLKVIWNQKRARIAKSILNQMNKAGGITLPDFKL SYKATVTKTAWYWYQNRDIDQWNGTEPSEIRPHIYSHLIFDKPDKNKKWGKDSLFNKWCW ENWPAICRKLKLDPFLIPYTKIKSRWIKDLNVRRKTIKTLEENLSNTIQDIGMGKDFMTK TPKAMATKAKIDKWYLIKPKSFCTAKETTIRVNRQPTEWEKIFAIYSSDKRLISRIYNEL KQIYKKKTNNPIKKWAKDMNRYFSKEEIYAANRHMRKCSSSLAIREIQIKTTIRYHLTPV RMVIIKKSGNNRCWRGCGETGTLLHCWWDCKLV >gi568815594f:101690881_102173740|GENSCAN_predicted_CDS_4|2622_bp atgatatatgaagaagatgctgaggaatgggctctgtacttgacagaagtatttttacat gttgtgaaaagggaagccatcctgttatatcgcttggagaatttctcttttcggcatttg gagttgctgaacttaacgtcttacaaatgtaaacttttgatattatcaaatagcctgctt agagacctaactccaaagaaatgtcagtttctggaaaagatacttcattcaccaaaaagt gtagttactttgctttgtggagtgaagagttcagatcagctctatgaattactaaatatc tctcaaagcagatgggagatctcaactgaacaggaacctgaagactacatctctgtaatc cagagtatcatattcaaagattctgaagactactttgaggtcaacattccaacagaccta cgagcaaaacattctggggaaataagtgagagaaaggaaattgaagaactatcagaagct tcaagaaacaccataccactagcagtggtgcttcccactgaaattccatgtgagaatcct ggtgaaatattcataattttgagagatgaagtaattggtgatactgtagaggttgaattt acatcaagtaataagcgcattagaacacggccagccctttggaataagaaagtctggtgc atgaaagctttagagtttcctgctggttcagtccatgtcaatgtctactgtgatggaatc gttaaagctacaaccaaaattaagtactacccaacagcaaaggcaaaggaatgcctattc agaatggcagattcaggagagagtttgtgccaggacattggtttgggcaaaaatttcttg acaaatatccgacaagcacaggcaaccaaagcaaaaacggacaaatatgataatattgag ttaaaaagtttctgcacagcaaaggaaacaatcaacaagaatagcattgaagaacttgat ggtgtccttacatccatattcaaacatgagataccatattatgagttccagtctcttcaa actgaaatttgttctcaaaacaaatatggagaactctggatgggttatatcggagtccct gaaaatatgaatctggttcagcagggctgggcccaattccattaccataacaacacatat tctgaaataatagaggaccctgtaattatagatgaaaaagtgttggaagttctggccagg gcaatcaggcaggagaaagaaataaagggtattcaattaggaaaagaggaagtcaaattg tccctgtttgcagatgacatgattgtatatttagaaaaccccatcatctcagccccaaat ctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaa tcacaagcattcttatacaccaataacagacaaacagagagcaaaatcatgagtgaactc ccattcacagttacttcaaagagaataaaatacctaggaatccaacttacaagggatgtg aaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggacacaaac aaagggaagaacattccatgctcatgggtaggaagaatcagtatcgtgaaaatagccata ctgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttcttc acagaattggaaaaaactactttaaaggtcatatggaaccaaaaaagagcccgcattgcc aagtcaatcctaaaccaaatgaacaaagccggaggcatcacactacctgacttcaaactg tcctacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagac caatggaacggaacagagccctcagaaataagaccacacatctacagccatctgatcttt gacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaatggtgctgg gaaaactggccagccatatgtagaaagctgaaactggatcccttccttataccttataca aaaattaaatcaagatggattaaagacttaaatgtgagacgtaaaaccataaaaacccta gaagaaaacctaagcaataccattcaggacataggcatgggcaaggacttcatgactaaa acaccaaaagcaatggcaacaaaagccaaaattgacaaatggtatctaattaaaccaaag agcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacagaatgggag aaaatttttgcaatctactcatctgacaaacggttaatatccagaatctacaatgaactc aaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggatatgaac agatacttctcaaaagaagaaatttatgcagccaacagacacatgagaaaatgctcatca tcactggccatcagagaaattcaaatcaaaaccacaataagataccatctcacaccagtt agaatggtgatcattaaaaagtcaggaaacaacaggtgctggagaggatgtggagaaaca ggaacacttttacactgttggtgggactgtaaactggtttga >gi568815594f:101690881_102173740|GENSCAN_predicted_peptide_5|86_aa MEVEDHSYHITSSHDISYQHNSSLLMDVSLDHLAEAAFFIDTVGNSSNKPQSTEPVVGIL NLFFRESSRGPKSSVDDSMLRVELES >gi568815594f:101690881_102173740|GENSCAN_predicted_CDS_5|261_bp atggaggtggaggaccattcttatcacatcacatcatctcatgatatttcctatcaacat aactcatcattgctgatggatgtttccctcgatcacctggctgaggcagccttcttcatc gacacagtgggaaattcctctaataaaccacagagcacagagcctgtagtgggcatcttg aacctgttcttcagagagagctctagaggcccaaagtcctctgtagacgattctatgtta agagttgaactagaaagttag >gi568815594f:101690881_102173740|GENSCAN_predicted_peptide_6|112_aa MLSSSEFILRQGGHSRVITAREPCEDTDTEIHTGKTYCEDRGNAAGIQGMSRITGNHRKL GERQGIDSSSGLQKEQTLQTHLDFRLLACSTMKKQISVAFSYPVCGNLLQQP >gi568815594f:101690881_102173740|GENSCAN_predicted_CDS_6|339_bp atgcttagctcatcagaattcattttgcggcaaggtggccatagtagagtgatcactgcc agagagccttgtgaagacacggacacagagatacacacagggaaaacatattgtgaagac cgaggcaatgcagctggaattcaaggaatgtcaagaattactggcaaccaccggaagcta ggagagaggcaaggaatagattcttcctcgggtctccagaaagaacaaactctgcagaca caccttgatttcagacttctagcctgcagtactatgaaaaaacaaatttctgttgctttt agctacccagtttgtggtaatttgctacaacagccctag >gi568815594f:101690881_102173740|GENSCAN_predicted_peptide_7|765_aa MCVACGRKHEKPVQDSPHPLPTAEVIIEALVELKPESQNNHEEQRPGAHHSDTLTTQNPA FHHESRKTYGQSADGAEANEMEGEGKQNGSGMETKHSPLEVGSESSEDQYDDLYVFIPGA DPENNSQEPLMSSRPPLPPPRPVANAFQLERPHFTLPGTMVEGQMERSQNWGHPGVRQET GDEPKGEKEKKEEEKEQEEEEDPYTFAEIDDSEYDMILANLSIKKKTGSRSFIINRPPAP TPRPTSIPPKEETTPYIAQVFQQKTARRQSDDDKFCGLPKKQAQNLLKLISSFSKVSGYK INVQKSQAFLYTNIRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEI KEDTNKWKNISCSWVGRINIVKMAVLPKLNSCLQRGKNQVAADLYGLAVANKTMSLKDRA RIESPAFSTLRGCLTDGQEELILLQEKVKNGKMSMDEALEKFKHWQMGKSGLEMIQQEKL RQLRDCIIGKRPEEENVYNKLTIVHHPGGYYNETHESTDILLSGSSEKPLTGFNQKKSNL SQNGEWTVESRVEESDQLQVLQPLDSGTCTNDFPRLLGLQQQTKGYAVGFPDFEAFGLGE TVPHILASPVTASDQRGPRTAHTTTPESTYHKPRQLPCGIKSAYAQNARFKELLPMSVPA FGCKPPVTLVLILWPASPKLSKMKAIDRQSHPSQGTPRTLDDLCVLGDLRITTTTSFQSV GLWGSTNPHSSPRNITGWAHSSRSSFSLEALASSSGEPSCKMSLF >gi568815594f:101690881_102173740|GENSCAN_predicted_CDS_7|2298_bp atgtgcgttgcctgtgggagaaagcatgaaaagccagtgcaagattctccacatcccctt cccactgcagaagtgatcatagaagcactggttgagctaaagcctgagtcccagaataac catgaagagcagaggcctggtgcacatcacagtgacacattgaccacacagaacccagca tttcatcatgaaagcaggaagacatacgggcagagtgcagatggagctgaggcaaatgaa atggaaggggaaggaaaacagaatggatcaggcatggagaccaaacacagcccactagag gttggcagtgagagttctgaagaccagtatgatgacttgtatgtgttcattcctggtgct gatccagaaaataattcacaagagccactcatgagcagcagacctcctctccccccgccg cgacctgtagctaatgccttccaactggaaagacctcacttcaccttaccagggacaatg gtggaaggccaaatggaaagaagtcaaaactggggtcatcctggtgttagacaagaaaca ggagatgaacccaaaggagaaaaagagaagaaagaagaggaaaaagagcaggaggaggaa gaagacccatatacttttgctgagattgatgacagtgaatatgacatgatattggccaat ctgagtataaagaaaaaaactgggagtcggtctttcattataaatagacctcctgccccc acaccccgacccacaagtatacctccaaaagaggaaactacaccttacatagctcaagtg tttcaacaaaagacagccagaagacaatctgatgatgacaagttctgtggtcttcctaag aaacaagcccaaaatctccttaagctgataagcagcttcagcaaagtctcaggatacaaa atcaatgtacaaaaatcacaagcattcttatacaccaacatcagacaaacagagagccaa atcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaa cttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaata aaagaggatacaaacaaatggaagaacatttcatgctcatgggtaggaagaatcaatatc gtgaaaatggccgtactgcccaagctgaattcttgcctccaaagaggcaagaatcaagtt gctgcagatctgtacggattggctgttgctaacaaaaccatgtctttgaaggacagagct cggatagagagtccagccttttctactctcaggggctgtctaactgatggtcaggaagaa ctcatcctcctgcaggagaaagtaaagaatgggaaaatgtctatggatgaagctctggag aaatttaaacactggcagatgggaaaaagtggcctggaaatgattcagcaggagaaatta cgacaactacgagactgcattattgggaaaaggccagaagaagaaaatgtctataataaa ctcaccattgtgcaccatccaggtggttattataatgaaactcacgaatctacggacatt ttgctttcagggtctagtgagaagccattgactggttttaaccagaaaaagtcaaactta tcgcagaatggagaatggactgtagagtcaagagtagaggaaagtgaccaactccaggtt cttcagcctttggactctgggacttgcaccaatgacttccccaggctcttgggccttcag cagcagactaaaggttatgctgttggcttccctgattttgaggcatttggacttggagaa actgttcctcacatccttgcttctccagttacagcctcagatcaaaggggcccaagaaca gctcacaccaccactccagagagcacataccataagcctcggcagcttccatgtggcatt aagtctgcatatgctcagaatgcaagatttaaggagcttctacctatgtcagttcctgca tttggatgcaagcctcccgttacacttgtgcttatcctctggccagcatctcctaaattg tctaaaatgaaagcaattgatagacaaagccatccaagtcaagggacacctagaactcta gatgacctttgtgtgcttggtgatctccgaataacaactacaacctcctttcagtccgtg ggcctgtgggggtccaccaaccctcactccagtcccaggaacatcaccgggtgggcccac tcttccagatctagcttctcactggaggcccttgcaagcagctcaggagagccatcctgt aagatgtccctcttctga