GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:06:12 Sequence gi568815575r:40834985_41035832 : 200848 bp : 42.39% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 35 935 901 1 1 52 72 926 0.884 81.86 1.02 Term + 7867 8003 137 2 2 76 40 117 0.841 2.90 1.03 PlyA + 8074 8079 6 1.05 2.05 PlyA - 8582 8577 6 1.05 2.04 Term - 26688 26371 318 0 0 90 38 203 0.812 9.40 2.03 Intr - 29364 29266 99 0 0 79 56 56 0.545 0.89 2.02 Intr - 38380 38207 174 1 0 17 89 131 0.732 5.31 2.01 Init - 39898 39812 87 1 0 69 91 41 0.845 3.19 2.00 Prom - 39984 39945 40 -5.75 3.02 PlyA - 40164 40159 6 1.05 3.01 Sngl - 56553 56047 507 0 0 70 43 257 0.776 15.19 3.00 Prom - 57843 57804 40 -6.45 4.00 Prom + 60315 60354 40 -4.95 4.01 Init + 62233 62299 67 0 1 21 111 40 0.116 0.89 4.02 Term + 73004 73407 404 0 2 64 48 404 0.293 28.33 4.03 PlyA + 76479 76484 6 1.05 5.00 Prom + 83896 83935 40 -7.15 5.01 Init + 85048 85152 105 0 0 54 82 42 0.053 0.48 5.02 Term + 91237 91383 147 0 0 23 47 220 0.530 8.32 5.03 PlyA + 98655 98660 6 1.05 6.02 PlyA - 99980 99975 6 1.05 6.01 Sngl - 100873 99998 876 1 0 37 46 876 0.516 74.02 6.00 Prom - 113512 113473 40 -4.25 7.02 PlyA - 114355 114350 6 1.05 7.01 Sngl - 115508 115104 405 2 0 32 37 320 0.931 17.34 7.00 Prom - 141756 141717 40 -3.25 8.00 Prom + 149223 149262 40 -5.85 8.01 Init + 149520 149605 86 2 2 51 39 107 0.107 2.44 8.02 Intr + 166430 166595 166 2 1 90 47 149 0.028 10.04 8.03 Term + 174193 174261 69 0 0 43 36 156 0.216 2.86 8.04 PlyA + 174268 174273 6 1.05 9.04 PlyA - 174436 174431 6 1.05 9.03 Term - 175874 175588 287 0 2 22 36 225 0.228 5.28 9.02 Intr - 183794 183733 62 1 2 102 62 21 0.017 -1.64 9.01 Init - 199484 199468 17 2 2 102 83 39 0.345 4.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 167210 167447 238 1 1 74 81 165 0.828 12.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:40834985_41035832|GENSCAN_predicted_peptide_1|345_aa MNTNEAESRNSNFATVVAGSEDWANAIEFVPGQPYCGRTVPSCTEAPLQGSVTKEESEEE QTAVETKKQLCPYAAVGQCRYGENCVYLHGDLCDMCGLQVLHPMDAAQRSQHIQACIEAH EKDMEFSFAVQRSKDKVCGICMEVVYEKANPNEHRFGILSNCNHTFCLKCIRKWRSAKEF ESRIVKSCPQCRITSNFVIPSEYWVEEKEEKQKLIQKYKEAMSNKACKYFDEGRGSCPFG ENCFYKHMYPDGRREEPQRQQVGTSSRNPGQQRNHFWEFFEEGANSNPFDDEEEAVTFEL DDSTQTRDSRLNWSCGPHSEADSVHKDHFPYLYDCTSNQSAVPIP >gi568815575r:40834985_41035832|GENSCAN_predicted_CDS_1|1038_bp atgaatacaaacgaagctgagtcaagaaattcaaattttgcaactgtagtagcaggttca gaggactgggcgaatgccattgagtttgttcctgggcaaccctactgtggccgtactgtg ccttcctgcactgaagcacccctgcagggctcagtgaccaaggaagaatcagaggaagag caaaccgccgtggaaacaaagaagcagctgtgcccctatgctgcagtgggacagtgccga tatggggagaactgtgtgtatctccacggagatttatgtgacatgtgtgggctgcaggtc ctgcatccgatggatgctgcccagagatcacagcatatacaagcgtgcattgaagcccat gagaaagacatggagttctcatttgctgtgcagcgcagcaaggacaaggtgtgtgggatc tgcatggaggtggtctatgagaaagccaaccccaacgagcaccgcttcgggatcctctcc aactgcaaccacaccttctgtctcaagtgcattcgcaagtggaggagtgctaaggaattt gagagcaggatcgtcaagtcctgcccacaatgccgaatcacatctaactttgtcattcca agtgagtactgggtggaggagaaagaagagaagcagaaactcattcagaaatacaaggag gcaatgagcaacaaggcatgcaagtattttgatgaaggacgtgggagctgcccatttgga gagaactgtttttacaagcatatgtaccctgatggccgtagagaggagccacagagacag caagtgggaacatcaagcagaaacccaggccaacaaaggaaccacttctgggaattcttt gaggaaggagcgaacagcaacccctttgacgatgaagaagaggctgtcacctttgagctg gatgactccactcagacccgagactcaagactcaactggtcctgtggcccccactcagaa gcagactcagtgcacaaggaccattttccatacctctatgattgcacctccaaccaatca gcagtacccattccctaa >gi568815575r:40834985_41035832|GENSCAN_predicted_peptide_2|225_aa MDLPNAVLHQYKRHPKGVCSDLQRLTLQKEWDFRVIGVVSVKATTVLSVSKEDFLKKVWI RASTPTLSHQPKNQKPKGVWILEKHCMDSAESPHQQEGPHQIQPLTRYSPLTLNFSTSIT AQKPAKEEWLQGMGLRHPLRACRPGLPQDSSAPHTLAQCFLAPPAMAQAAPGADRPTTQK DTSFKPWQHPHGANSSGVQKARAAEPWQPPCRFERMSLKAWGPRQ >gi568815575r:40834985_41035832|GENSCAN_predicted_CDS_2|678_bp atggaccttcccaatgctgttttgcaccaatacaagagacatccaaaaggagtgtgctca gatctacagaggctgactttacaaaaggagtgggacttcagagttataggggtggtgtct gtaaaagctacaacagttttaagtgtttccaaagaggatttcttaaagaaagtgtggatt cgagcctctactcccactctctcacatcagcccaagaaccagaagccaaaaggggtgtgg attttagagaaacactgtatggactctgcagagagtccccaccagcaagaaggccctcac cagatacagccccttaccagatacagccccttgaccctgaacttctccacctccataact gcccagaagcctgcgaaggaagaatggcttcaggggatgggcctgaggcaccctctacgg gcttgccgcccagggctgcctcaagactcttctgctccccacactctggcacagtgcttc ttggcccccccagccatggctcaagcagccccaggtgcagatcgacccactacccaaaaa gatacaagttttaagccttggcagcatccacatggtgctaattcttcaggcgttcagaaa gcaagagctgcggagccttggcagcctccatgcagatttgaaagaatgtcactgaaagcc tgggggcccaggcaatga >gi568815575r:40834985_41035832|GENSCAN_predicted_peptide_3|168_aa MSLYPSLENLKVDKVIGAQTTFSTNSANPAILSEASAPISQNGNLYPKLYPELSQYMGLS LNEEEICANVAMVSGAPIQGQLVARPSSMNYMVAPVTGNDVGIHRAEIKQGICEVILCKD HNGKIGLRLKSIDNGIFVQLVQANSPASLVGLRIEDQVLQINSENCAG >gi568815575r:40834985_41035832|GENSCAN_predicted_CDS_3|507_bp atgtctctctatccatctcttgaaaacttgaaggtagacaaagtaattggggctcaaact actttttctacaaactctgccaatccagcaattttgtcagaagcttctgctcccatctct caaaatggaaatctctatcccaaattatatccagagctctctcaatacatgggcctgagt ttaaatgaagaagaaatatgtgcaaatgtggccatggtttctggtgcaccaattcagggg cagttggtagcaagaccttccagtatgaactatatggtggctcctgtaactggtaatgat gttggaattcacagagcagaaattaagcaagggatttgtgaagtcattttgtgtaaggat cacaatggaaaaattggactcaggcttaaatcaatagataatggtatatttgttcagcta gtccaggctaattctccagcctcattggttggtctgagaattgaggaccaagtactccag attaatagtgaaaactgtgcagggtag >gi568815575r:40834985_41035832|GENSCAN_predicted_peptide_4|156_aa MTLPFVGTADEQRAYNPGRTGPAALRPLVKLKIVKKRTKKFILYQSDWYVKIKPNWQKPR GIDNRVCRRFKGQILKPIVGYGSNKKTKHMLPNGFRKFLVHDINELEVLMMCNKSYCAKI AHNVSSKNCKAIVERAAQLAIRVNNLNAILRSKENE >gi568815575r:40834985_41035832|GENSCAN_predicted_CDS_4|471_bp atgacactgccttttgtgggcactgcagatgaacagagagcctacaacccaggcagaaca ggaccagctgccctcagaccccttgtgaaactcaagattgtcaaaaagagaaccaagaag ttcatcctgtaccagtcagactggtatgtcaaaatcaagcctaactggcagaaacccaga ggtattgacaatagggtttgtagaaggttcaagggccagatcttgaagcccatcgttggt tatgggagcaacaaaaaaacaaagcacatgctgcccaatggtttccggaaattcctggtc catgacatcaatgagctggaagtgctgatgatgtgcaacaaatcttactgtgctaagatc gctcacaatgtttcctccaagaactgcaaggccatcgtggaaagagcagcccagctggcc atcagagtcaacaacctcaatgccatactgcgcagcaaagaaaatgaatag >gi568815575r:40834985_41035832|GENSCAN_predicted_peptide_5|83_aa MIDFARPALWKAEGTVLSSEQQNILLVQNSLEPLKAGPSGGIPEEGIVIIGDDSSMNVID LEDLPVGQDVEVEDCDIDDSDPV >gi568815575r:40834985_41035832|GENSCAN_predicted_CDS_5|252_bp atgattgattttgctagaccagctctgtggaaagcagaagggacagtgctgagttctgaa caacaaaacattttgttagttcagaattccttggagcctctcaaggcaggtccttcagga ggtatcccagaagaaggcattgttatcataggagatgacagctccatgaatgttattgac cttgaagatcttccagtgggacaagatgtggaagtggaagactgtgatattgatgattct gaccctgtgtag >gi568815575r:40834985_41035832|GENSCAN_predicted_peptide_6|291_aa MTPVQRGGSGGPGGPGMGNRGGFRGGFGSGIRGRGRGRGRGRGRGRGARGGKAEDKEWMP VTKLGRLVKDMKIKSLEEIYLFSLPIKESEIIDFCLGASLKDEVLKIMPVQKQIRAGQRT RFKAFVAIGDYNGHVGLGVKCSKEVATAIRGAIILAKLSIVPVRRGYWRNKIGKPHTVPC KVTGRCGSVLVRLIPAPRGTGIVSAPVPKKLLMMAGIDDCYTSARGCTATLGNFAKATFD AISKTYSYLTPDLWKETVFTKSPYQEFTDHLVKTHTRVSVQRTQAPAVATT >gi568815575r:40834985_41035832|GENSCAN_predicted_CDS_6|876_bp atgacgccggtgcagcgaggagggtccggaggccctggtggccctgggatggggaaccgc ggtggcttccgcggaggtttcggcagtggcatccggggccggggtcgcggccgtggacgg ggccggggccgaggccgcggagctcgcggaggcaaggccgaggataaggagtggatgccc gtcaccaagttgggccgcttggtcaaggacatgaagatcaagtccctggaggagatatat ctcttctccctgcccattaaggaatcagagatcattgacttttgcttgggggcctctctc aaggatgaggttttgaagattatgccagtgcagaagcagatccgtgccggccagcgcacc aggttcaaggcgtttgttgctatcggggactacaatggccacgtcggtctgggtgttaag tgttccaaggaggtggccaccgccatccgtggggccatcatcctggccaagctctccatt gtccccgtgcgcagaggctactggaggaacaagatcggcaagccccacaccgtcccttgc aaggtgacaggccgctgcggctctgtgctggtgcgcctcatccctgcacccaggggcact ggcatcgtctccgcacctgtgcctaagaagctgcttatgatggctggtatcgatgactgc tacacctcagcccggggctgcactgccaccctgggcaacttcgccaaggccacctttgat gccatttctaagacctacagctacctgacccccgacctctggaaggagactgtatttacc aagtctccctatcaggaattcactgaccacctcgtcaagacccacaccagagtctccgtg cagcggactcaggctccagctgtggctacaacatag >gi568815575r:40834985_41035832|GENSCAN_predicted_peptide_7|134_aa MSREKPAAGAEPSWRTPTRAVLRGNTGLEPPNNVPTGALRRRAVRGGPPSSRPQDGRSTG SSHHAPGKVVGTQCQPMRAAVWAEPWKATGAELPKVLGAHPLHQCALDVRHGVKGDYFEA LTAALLGFRLVWGL >gi568815575r:40834985_41035832|GENSCAN_predicted_CDS_7|405_bp atgtccagagagaagcctgctgcaggggcagagccctcatggagaacccccactagggca gtgctgaggggaaatacggggttggagcctccaaacaacgtccccactggggcacttcgt agaagagctgtaagaggagggccaccatcctccagaccccaggatggtagatccactggc agctcgcaccatgcacctggaaaagtcgtaggtactcaatgccagcccatgagagcagct gtgtgggctgagccctggaaagccacaggggcagagctgcccaaggttttgggagcccac cccttgcatcagtgtgccctggatgtgaggcatggagtcaaaggagattattttgaagct ttaacagctgccctgctgggtttcagacttgtgtggggcctgtag >gi568815575r:40834985_41035832|GENSCAN_predicted_peptide_8|106_aa MEAEYGIIDIGDLEEGGEGGVDDEKLLKGQGTVMHYPDSPSVKDFLPNSKSVVSRQPSAV IPSRSASAAEGSPCPLTDQGKGLKANNNDDDDDVDDDDDSVLFTLC >gi568815575r:40834985_41035832|GENSCAN_predicted_CDS_8|321_bp atggaagcagagtatggaataatagacattggagacttggaagagggaggggaaggagga gtggatgatgagaaattacttaaggggcagggcactgtgatgcactacccagattcccct tcagtgaaggacttcttgcccaactccaagagtgtggtcagcagacagccctcagctgtc atcccctccaggtctgcctcagctgcagagggcagcccatgcccactgactgatcaaggc aagggtctaaaggcaaataataatgatgatgatgatgatgttgatgatgatgatgatagt gtcctcttcactctttgttaa >gi568815575r:40834985_41035832|GENSCAN_predicted_peptide_9|121_aa MSGKFYCFLAKQVFSMSPFLWSLKTIAESVSIVTGFSQGNRATVSILGIKPFTAVGRAGE VNVWKEESEDQRVTSRCRKHWCSWTNQSLQENSRSQSTSSHRSRAAEGEFVERFMGAVAS T >gi568815575r:40834985_41035832|GENSCAN_predicted_CDS_9|366_bp atgtccggcaaattttattgcttcctcgcaaagcaagttttctcaatgtccccttttctg tggtccctgaagacaattgcagaatctgtaagtatcgtaacaggattcagccagggaaac agagccactgtgagtattctaggaataaaaccgttcacagctgtgggaagagctggggaa gtgaatgtctggaaggaggagtcagaagatcagagagtcactagccgttgccggaagcac tggtgcagttggacaaatcagagcttgcaggaaaattccagaagccaaagcacatctagc caccgaagcagggccgcggagggggagttcgtggagaggtttatgggagctgtggcctct acgtag