GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:13:06 Sequence gi568815597r:229225238_229442465 : 217228 bp : 44.66% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1636 1631 6 1.05 1.02 Term - 7286 7105 182 0 2 64 46 83 0.671 -0.53 1.01 Init - 9249 9189 61 0 1 76 63 79 0.840 5.61 1.00 Prom - 9972 9933 40 -5.26 2.04 PlyA - 10196 10191 6 1.05 2.03 Term - 19297 19145 153 1 0 52 47 94 0.007 -0.38 2.02 Intr - 43196 43006 191 0 2 53 91 64 0.406 2.50 2.01 Init - 46119 45723 397 0 1 93 75 188 0.444 12.87 2.00 Prom - 48631 48592 40 -2.36 3.00 Prom + 56589 56628 40 -6.06 3.01 Init + 70171 70280 110 0 2 69 81 100 0.741 7.09 3.02 Intr + 72245 72354 110 2 2 50 -21 151 0.010 0.33 3.03 Intr + 73740 73835 96 1 0 95 98 42 0.015 5.88 3.04 Intr + 86179 86294 116 2 2 25 84 113 0.003 4.77 3.05 Intr + 101209 101271 63 0 0 104 28 77 0.028 2.21 3.06 Intr + 101659 101696 38 0 2 106 68 11 0.052 -2.04 3.07 Term + 104967 105258 292 0 1 75 43 210 0.113 10.02 3.08 PlyA + 105658 105663 6 1.05 4.06 PlyA - 106555 106550 6 1.05 4.05 Term - 116254 116103 152 2 2 102 42 95 0.898 4.37 4.04 Intr - 117276 116862 415 0 1 70 80 691 0.869 60.18 4.03 Intr - 117561 117426 136 2 1 86 76 -17 0.511 -2.53 4.02 Intr - 117808 117675 134 1 2 75 94 65 0.912 5.24 4.01 Init - 120140 120042 99 2 0 63 64 61 0.559 1.46 4.00 Prom - 129209 129170 40 -5.56 5.00 Prom + 141128 141167 40 -2.86 5.01 Init + 142984 143041 58 0 1 74 88 27 0.598 2.85 5.02 Intr + 151448 151487 40 0 1 104 101 4 0.293 0.58 5.03 Intr + 152260 152471 212 1 2 101 42 59 0.151 1.16 5.04 Term + 153687 153805 119 1 2 73 44 124 0.214 5.20 5.05 PlyA + 157963 157968 6 1.05 6.00 Prom + 169313 169352 40 -2.46 6.01 Init + 178067 178120 54 1 0 88 95 -28 0.153 -0.82 6.02 Intr + 182507 182631 125 1 2 42 116 143 0.890 11.88 6.03 Intr + 186271 186355 85 1 1 133 73 15 0.617 4.32 6.04 Term + 190428 190622 195 2 0 55 33 109 0.251 -0.49 6.05 PlyA + 191403 191408 6 1.05 7.08 PlyA - 192111 192106 6 1.05 7.07 Term - 206405 206262 144 2 0 88 42 316 0.999 24.91 7.06 Intr - 206665 206484 182 2 2 105 98 366 0.999 38.89 7.05 Intr - 206948 206757 192 0 0 128 97 522 0.999 56.56 7.04 Intr - 207194 207033 162 0 0 92 85 406 0.999 40.65 7.03 Intr - 207643 207319 325 1 1 85 117 668 0.999 64.75 7.02 Intr - 207890 207750 141 2 0 101 71 274 0.989 27.55 7.01 Intr - 209712 209530 183 0 0 79 40 77 0.246 1.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 63149 63246 98 0 2 75 39 116 0.864 3.53 S.002 Term + 72245 72407 163 2 1 50 43 193 0.984 8.41 S.003 Term - 100174 99998 177 1 0 71 43 126 0.809 4.09 S.004 Init - 103054 102982 73 1 1 67 49 37 0.868 -1.07 S.005 Term - 155799 155684 116 1 2 73 51 152 0.940 8.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:229225238_229442465|GENSCAN_predicted_peptide_1|80_aa MGDENFRVLENALNNNCQRKEHCGDGEASLENANKGSTVEMAEQLEKRNQGLRSCAAAIP ALTYYLPGRLGDRAIDIYLV >gi568815597r:229225238_229442465|GENSCAN_predicted_CDS_1|243_bp atgggggatgaaaacttcagggtgctggaaaatgctcttaacaacaactgccagagaaaa gaacattgtggggacggggaagccagtttggagaatgccaacaagggcagcaccgtagag atggcagagcaactggaaaagaggaaccagggtcttagatcctgtgcagctgccattcca gccttgacttactatttgcctggacgactgggagacagagcaatagacatctatcttgtt taa >gi568815597r:229225238_229442465|GENSCAN_predicted_peptide_2|246_aa MAVCDILGAAPPLAGSPAALARGPPARLGGEGPGAGDRRREGPDRSPRQPPVSQRLRPSR TPAPRRRRALHPPSGRDREEEEEMGYARPGPPRVRACARGGRGGAREDFGARRKHVRGLG ALAVCAEVGRAAGNIQSEWARLTAVMQISQPIYDLRSKVPRSDYLSTPTSRDLLMGPPRP RLTECVLSKCISTITQCCEPLKGTEIVHSRSSDFKAVACQCSQLNKALPSTTWCLTGFVC GSSCYN >gi568815597r:229225238_229442465|GENSCAN_predicted_CDS_2|741_bp atggccgtctgcgacatcttgggagcggcgccgcctctcgccgggtcaccggctgcactc gcccgcggtccgccggcgcgtctcggtggggaaggacctggggcgggggaccggaggagg gaggggcccgaccggtccccacggcaaccgccggtctcccagcggctgcggcccagccgg actccagctccgcgcaggcgcagggccctccaccctccgtccggccgcgaccgcgaggag gaagaggagatgggctacgcgaggcccggcccaccccgcgttcgcgcgtgcgcacgggga ggccggggcggggcgcgtgaggacttcggcgcgcgccggaagcacgtgcgcgggctgggc gctctggcggtgtgcgctgaggtgggcagagcggcagggaatattcagtcagaatgggct agattaactgcagtaatgcagatctctcaacccatttatgacttacgcagcaaagttcct cgctcagactacctgtccactcccacgagcagagacttgctcatgggccctcctagaccc aggctgaccgagtgtgttttatctaaatgcatctccacaatcactcagtgttgtgagccc ttaaaagggacagaaattgtgcactcgagaagctcggattttaaggcagtagcttgccaa tgctcccagctgaataaagcccttccttctacaacttggtgtctgacaggttttgtctgc ggctcgtcctgctacaattga >gi568815597r:229225238_229442465|GENSCAN_predicted_peptide_3|274_aa MDSCGDYGFHRAERWRELEEMEKIVPGSGVDEELAENRETYNALTNWLTDARMLASQNIV IILCGNKKDLDADQLMFLETSALTGENVEEAFVQCARKILNKIESGLFHGWVTLHGKGIL ELGFKETFSYVVIVESTDDKRGLSELVNPNSIYIFAWLGVTKLFWQIRHLVAVEAQYWCS EWMVMLSSDIGPLKDEERVLVTLDARHSWSRQPDLQLELRREDGMEMKATAPGVDQVTQK KHRAGRQEVQTGPGTPSFNAQAKKGEPEMGAEEG >gi568815597r:229225238_229442465|GENSCAN_predicted_CDS_3|825_bp atggattcctgtggtgactatggattccaccgcgccgagaggtggcgagaactggaagag atggagaaaattgtgccagggagtggggtagatgaggagctcgcagagaaccgagaaacc tacaatgcgcttactaattggttaacagatgcccgaatgctagcgagccagaacattgtg atcatcctttgtggaaacaagaaggacctggatgcagatcagctgatgtttttggaaaca agtgcgctcacaggggagaatgtagaagaggcttttgtacagtgtgcaagaaaaatactt aacaaaatcgaatcaggtttgttccatggctgggtgacactacacggcaaaggaatactt gagctgggcttcaaggaaacatttagctatgtggtcattgtagaaagcacagatgacaaa cgggggctgagtgaacttgtgaaccccaactccatctacatctttgcctggcttggtgtc actaaactcttctggcaaataaggcacttggttgctgtcgaggctcagtactggtgttct gagtggatggtcatgctgtcctctgacataggccccctaaaggatgaggaaagggtgctt gtcacgttggatgcaagacactcctggtcccggcagccagaccttcaactggagctcagg agagaagatggcatggagatgaaagccacggctcctggtgtggaccaagtcacccagaag aaacacagagccggaagacaggaggtgcaaacaggcccggggacaccgtcatttaatgca caggcaaagaaaggagaacccgagatgggagcagaggaagggtag >gi568815597r:229225238_229442465|GENSCAN_predicted_peptide_4|311_aa MQQEMYMTLRCQSRLLEDVTKTAQPTAHPTASYLERGPPRRHLAQEESARESELRFSCCA TRASGLGAEDRPGAAGRSPSLLRSPRPSPQPPCLPPTEQPSSAARPSTPAAPVPASALGP GRRAAGSEERLALEAADGTMSPGSGVKSEYMKRYQEPRWEEYGPCYRELLHYRLGRRLLE QAHAPWLWDDWGPAGSSEDSASSESSGAGGPAPRCAPPSPPPPVEPATQEEAERRARGAP EEQDAEAGDAEAEDAEDAALPDPKAPVALRVFSRHGPFTFFPAQRLAGLGTQLNASVQPG VTNGSFQITGF >gi568815597r:229225238_229442465|GENSCAN_predicted_CDS_4|936_bp atgcagcaggagatgtatatgaccttgagatgccagtccaggcttctggaggatgtgacc aagacggcccagcccacagcccaccccactgccagttatctggagcgcgggccgccgcgg cgacatcttgctcaggaggaaagcgcgcgggagagcgagctgcgctttagctgctgcgcc acgcgcgcctcgggcctgggcgcagaggatcggccgggcgcggcgggaaggagccccagt ctcctgcggtcccctcgccccagcccgcagcctccctgccttccgcccactgagcagccc tcctcggcggcgcgcccctcgaccccagcagccccggtccccgcctctgctcttggtccc ggccgccgggctgcgggcagcgaggagcggctggcgctcgaggcggcggacggcaccatg tccccggggagcggggtgaagagcgagtacatgaagcgctaccaggagccgcgctgggag gagtacgggccgtgctaccgcgagctgctgcactaccgcctaggccgccggctgctggag caggcgcacgcgccctggctctgggacgactggggcccggccggctcctcggaggactcg gcgtcgtcagagtcgtcgggcgccgggggccccgcaccccggtgcgccccgccctcgccc ccgccgcccgtagagccggcgacccaggaggaggcggaacggcgggcgcgcggggccccg gaggagcaggacgcggaggccggggacgcggaggccgaggacgcggaggacgcggctctg ccagatccgaaagcgccggtggcgctgcgcgtgttttcaaggcatggccccttcaccttc tttcctgcccagaggcttgcagggctggggacccagttgaacgcctcagtacagcctgga gtaaccaatggttccttccagatcactggattttaa >gi568815597r:229225238_229442465|GENSCAN_predicted_peptide_5|142_aa MVPGSASGEGFRLLLFVVRGQKDKEEPPSRPLREKGSLSLTHGGALSGHLCTGPLQGQMR QPEQNPLPAWPTKEPTTLHLAQPQPKAAPTRCAFRDLPGPLECESGITVIFPCHFKTYEY MPPVATQQYIIEHKQYLTDPSC >gi568815597r:229225238_229442465|GENSCAN_predicted_CDS_5|429_bp atggtgccaggatctgcttctggggagggcttcaggctgctgctattcgtggtgagaggg cagaaagataaagaggagcctccctccaggcctctgcgggagaagggaagtctctccctc acccatgggggagcactgagtggccacctctgcacaggccctctgcaggggcagatgagg cagcccgaacagaaccctctgcccgcctggccaaccaaagagcccacgacactccatctg gcccaaccgcagccgaaggcggcaccaactcgctgcgccttcagggacttgccagggcca cttgagtgtgaatctggcatcacagtcattttcccatgtcacttcaaaacctatgagtat atgcctccagtggccacacagcagtacatcatagagcacaagcagtacctcacagaccca tcatgctag >gi568815597r:229225238_229442465|GENSCAN_predicted_peptide_6|152_aa MEHILSTSPHVPSAWPHLERHPSRIRPRTPQLGTLTRLSRGFSGIRLQGNQLFAQTFKPS PRAQAFKQTLGPARLFEQVSHIYHSLEAPLFTFTWTNPDTHQAQQITWAVLLQGFTDIPH YFSQAQISTSSVTYLGIILIKTLVLSLLIVSG >gi568815597r:229225238_229442465|GENSCAN_predicted_CDS_6|459_bp atggagcacatattaagcacctcaccccacgtgccctcagcatggccccacctggaaagg caccctagccggattcgccctcggacgccccagctgggcacactcacgcggctctcccgc ggcttctcgggaattcgcctccagggcaatcagctcttcgcacaaacgttcaaaccaagc cccagagcacaggccttcaagcagaccctgggtccagccaggctttttgagcaggtttct catatttatcattccctcgaggcacctctcttcactttcacttggactaatcctgacacc catcaggctcagcaaattacctgggctgtactgctgcaaggcttcacagacatcccccat tacttcagtcaagcccaaatttcaacctcatctgttacctatctcggcataattctcata aaaacactcgtgctctccctgctgatcgtgtccggctaa >gi568815597r:229225238_229442465|GENSCAN_predicted_peptide_7|442_aa ETQGRCASLRRTAPCAQALRLTGLSPGAPAPPPRRSALASPRQLSILRQLRALRPPVALC AKLDTMCDEDETTALVCDNGSGLVKAGFAGDDAPRAVFPSIVGRPRHQGVMVGMGQKDSY VGDEAQSKRGILTLKYPIEHGIITNWDDMEKIWHHTFYNELRVAPEEHPTLLTEAPLNPK ANREKMTQIMFETFNVPAMYVAIQAVLSLYASGRTTGIVLDSGDGVTHNVPIYEGYALPH AIMRLDLAGRDLTDYLMKILTERGYSFVTTAEREIVRDIKEKLCYVALDFENEMATAASS SSLEKSYELPDGQVITIGNERFRCPETLFQPSFIGMESAGIHETTYNSIMKCDIDIRKDL YANNVMSGGTTMYPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQ QMWITKQEYDEAGPSIVHRKCF >gi568815597r:229225238_229442465|GENSCAN_predicted_CDS_7|1329_bp gagacccaagggcgctgcgcgtccctgaggcggacagctccgtgtgctcaggctttgcgc ctgacaggcctatccccgggagcccccgcgcctcctccccggcgctccgccctcgcctcc ccccgccagttgtctatcctgcgacagctgcgcgccctccggccgccggtggccctctgt gcgaaactagacacaatgtgcgacgaagacgagaccaccgccctcgtgtgcgacaatggc tccggcctggtgaaagccggcttcgccggggatgacgcccctagggccgtgttcccgtcc atcgtgggccgcccccgacaccagggcgtcatggtcggtatgggtcagaaagattcctac gtgggcgacgaggctcagagcaagagaggtatcctgaccctgaagtaccctatcgagcac ggcatcatcaccaactgggatgacatggagaagatctggcaccacaccttctacaacgag cttcgcgtggctcccgaggagcaccccaccctgctcaccgaggcccccctcaatcccaag gccaaccgcgagaagatgacccagatcatgtttgagaccttcaacgtgcccgccatgtac gtggccatccaggccgtgctgtccctctacgcctccggcaggaccaccggcatcgtgctg gactccggcgacggcgtcacccacaacgtgcccatttatgagggctacgcgctgccgcac gccatcatgcgcctggacctggcgggccgcgatctcaccgactacctgatgaagatcctc actgagcgtggctactccttcgtgaccacagctgagcgcgagatcgtgcgcgacatcaag gagaagctgtgctacgtggccctggacttcgagaacgagatggcgacggccgcctcctcc tcctccctggaaaagagctacgagctgccagacgggcaggtcatcaccatcggcaacgag cgcttccgctgcccggagacgctcttccagccctccttcatcggtatggagtcggcgggc attcacgagaccacctacaacagcatcatgaagtgtgacatcgacatcaggaaggacctg tatgccaacaacgtcatgtcggggggcaccacgatgtaccctgggatcgctgaccgcatg cagaaagagatcaccgcgctggcacccagcaccatgaagatcaagatcatcgccccgccg gagcgcaaatactcggtgtggatcggcggctccatcctggcctcgctgtccaccttccag cagatgtggatcaccaagcaggagtacgacgaggccggcccttccatcgtccaccgcaaa tgcttctag