GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:15:59 Sequence gi568815597r:212735873_212936149 : 200277 bp : 41.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1907 1902 6 1.05 1.03 Term - 21288 21157 132 0 0 105 43 86 0.014 2.91 1.02 Intr - 48621 48491 131 1 2 70 99 134 0.971 12.19 1.01 Init - 55891 55555 337 1 1 86 1 366 0.693 25.19 1.00 Prom - 56283 56244 40 -10.65 2.00 Prom + 58553 58592 40 -10.55 2.01 Init + 58874 58936 63 1 0 34 61 25 0.329 -4.40 2.02 Intr + 59223 59255 33 2 0 84 115 72 0.965 7.00 2.03 Intr + 60645 60718 74 2 2 129 43 117 0.854 8.59 2.04 Intr + 61240 61324 85 1 1 84 87 25 0.698 0.90 2.05 Intr + 66829 66891 63 0 0 122 82 80 0.991 8.80 2.06 Term + 88318 88725 408 0 0 -71 38 381 0.176 11.43 2.07 PlyA + 90888 90893 6 -0.45 3.03 PlyA - 91851 91846 6 1.05 3.02 Term - 93029 92898 132 2 0 107 54 91 0.074 4.71 3.01 Init - 115129 113954 1176 1 0 70 86 464 0.021 37.87 3.00 Prom - 120395 120356 40 -3.35 4.00 Prom + 120512 120551 40 -7.45 4.01 Sngl + 122581 123405 825 0 0 65 43 756 0.905 62.28 4.02 PlyA + 123856 123861 6 1.05 5.00 Prom + 124893 124932 40 -5.95 5.01 Init + 128442 128647 206 2 2 45 52 236 0.344 14.07 5.02 Intr + 152617 152722 106 1 1 55 97 87 0.484 5.60 5.03 Intr + 153274 153385 112 0 1 103 99 -20 0.412 -0.37 5.04 Intr + 154650 154764 115 1 1 11 59 71 0.190 -4.91 5.05 Intr + 156728 157130 403 2 1 57 78 326 0.462 22.00 5.06 Term + 159344 159418 75 1 0 91 49 103 0.990 3.56 5.07 PlyA + 159518 159523 6 1.05 6.00 Prom + 160045 160084 40 -4.15 6.01 Init + 164185 164323 139 0 1 67 81 92 0.563 5.52 6.02 Intr + 180128 180226 99 0 0 95 32 134 0.053 7.66 6.03 Intr + 182464 182514 51 2 0 96 119 49 0.107 6.76 6.04 Term + 184101 184255 155 1 2 76 55 49 0.053 -2.50 6.05 PlyA + 184493 184498 6 1.05 7.03 PlyA - 184538 184533 6 1.05 7.02 Term - 185482 185229 254 2 2 71 39 199 0.727 8.12 7.01 Intr - 188975 188784 192 0 0 60 65 137 0.694 7.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 68448 68561 114 2 0 112 39 67 0.932 1.79 S.002 Sngl - 115129 113393 1737 1 0 70 38 484 0.932 36.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:212735873_212936149|GENSCAN_predicted_peptide_1|199_aa MAGSPELVVLDPPWDKELAAGTESQALVSATPREDFRVRCTSKRAVTEMLQLCGRFVQKL GDALPEEIREPALRDAQWVRDQLASSLLIILLPNTQDLRIQKDPTPGLGRINDSDIKVLE DQFDEIIVDIATKRKQYPRKILECVIKTIKAKQEILDIPRTHWPYDFHTRNSLYLKLLSE TTCLTLALHSALCSKCHFI >gi568815597r:212735873_212936149|GENSCAN_predicted_CDS_1|600_bp atggcggggtctcctgagttggtggtccttgaccctccatgggacaaggagctcgcggct ggcacagagagccaggccttggtctccgccactccccgagaagactttcgggtgcgctgc acctcgaagcgggctgtgaccgaaatgctacaactgtgcggccgcttcgtgcaaaagctc ggggacgctctgccggaggagattcgggagcccgctctgcgagatgcgcagtgggtacgg gaccagttagccagttctttactcatcatcctcttacccaacacccaggatcttaggatt cagaaagatcccacgccaggccttggcagaattaacgattctgacatcaaagtacttgaa gatcagtttgatgaaatcatagtagatatagccacaaaacgtaagcagtatcccagaaag atcctggaatgtgtcatcaaaaccataaaagcaaaacaagaaattctggacataccaaga acacactggccctacgactttcatactcgcaactccctgtatttgaaactgctctccgaa actacctgcctgactcttgcacttcattcagctctctgttcaaagtgtcacttcatctga >gi568815597r:212735873_212936149|GENSCAN_predicted_peptide_2|241_aa MREWYCRSEGWKAFQEGRSKQDLDDVLEKAKKANVVALVAVAEHSGEFEKIMQLSERYNG FVLPCLGVHPVQGLPPEDQRSVTLKDLDVALPIIENYKDRLLAIGEYAAKRFRKAQHPIV ECRTHSMMPQQLQETRDLAHHQACLRDHSPAHRRGLSAGPGERHHQQWSPGGLHTYWASR AWAVRPQAVEVSPLCRVNQAIWLLSTGAREALWGSPTKKREKRREEKRREERERERKIKN C >gi568815597r:212735873_212936149|GENSCAN_predicted_CDS_2|726_bp atgagagagtggtattgcagaagtgaagggtggaaagcatttcaagaaggaaggagtaag caagatttggatgatgtgttggagaaagccaagaaggccaatgttgtggcccttgtggca gttgccgaacattcaggagaatttgaaaagattatgcaactttcagaaaggtataatggg tttgtcctgccatgcttgggtgttcatccagttcaaggacttccaccagaagaccaaaga agtgtcacactaaaggatttggatgtagctttgcccattattgagaattataaggatcgg ttgttggcaattggagagtatgccgccaaacgcttccgcaaagctcagcaccccattgtg gagtgccgcactcactccatgatgccgcaacaactgcaagaaactcgtgaccttgcgcat caccaagcatgccttcgagaccattcacctgctcacaggagaggactctctgcaggtcct ggtgaacgccatcatcaacagtggtccccgggaggactccatacgtattgggcgagccga gcctgggccgtgagaccacaggctgtggaagtgtcccccttgtgccgtgtgaatcaggcc atctggctgttgtccacaggtgctcgtgaggccctttggggcagccccacgaaaaagaga gagaagagaagagaagagaagagaagagaagaaagagaaagagaaagaaagatcaaaaac tgttaa >gi568815597r:212735873_212936149|GENSCAN_predicted_peptide_3|435_aa MDKFLDTYTLPRLNQEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKV SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPL LKEIKEDTNKWKNIPCSWVGRINIVKMAILPKEAQIHREPKHSDIRRDMDKPAGLKPYVS QDYIVGHSVDKTHQP >gi568815597r:212735873_212936149|GENSCAN_predicted_CDS_3|1308_bp atggataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggagctgaaattgtggcaataatcaatagtttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcattctgataccaaagccgggcagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtattgatgggacatatttcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaacgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaaaggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtttatcta gaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaaggaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtagga agaatcaatatcgtgaaaatggccatactgcccaaggaagctcagattcatagagagcca aagcattctgatattagaagggatatggataagccagctggcttaaaaccctatgtcagt caggactatatagtagggcacagtgtggataagactcatcaaccatga >gi568815597r:212735873_212936149|GENSCAN_predicted_peptide_4|274_aa MARPDDEEGAAVAPGHPLAKGYLPLPRGAPVGKESVELQNGPKAGTFPVNGAPRDSLAAA SGVLGGPQTPLAPEEETQARLLPAGAGAETPGAESSPLPLTALSPRRFVVLLIFSLYSLV NAFQWIQYSIISNVFEGFYGVTLLHIDWLSMVYMLAYVPLIFPATWLLDTRGLRLTALLG SGLNCLGAWIKCGSVQQHLFWVTMLGQCLCSVAQVFILGLPSRIASVWFGPKEVSTACAT AVLGNQVSTGVVGESQILKRPEKVIGRENYGLYG >gi568815597r:212735873_212936149|GENSCAN_predicted_CDS_4|825_bp atggcgcggccagacgatgaggagggggcggcggtggcgcccggacacccgctcgcgaaa ggatacctcccgttgccgaggggcgcgcccgttgggaaggagagcgtggagctgcagaac gggcccaaagcgggcaccttcccggtgaatggggccccccgggacagcctcgctgccgcc tcgggagttctgggcgggcctcagactccactggccccagaagaggagacccaggcccgg ctgctgcctgcgggcgcgggagctgagaccccgggggccgagagcagcccgctgcccctt acggcgctctccccgcggcgcttcgtggtgctcctgatcttcagcctgtactcgctggtc aacgcctttcagtggatccagtacagcatcattagcaacgtcttcgagggcttctacggt gtcaccttgctgcacatcgactggctgtccatggtgtacatgctggcctacgtgcccctc atcttcccggccacctggctgctggacaccagaggcctgcggctcaccgccctgctgggc tccggcctcaactgcctgggtgcctggatcaagtgcggcagtgtgcagcagcatctcttc tgggtcaccatgttgggccagtgcttgtgctcggtggcccaggtgttcatcctgggcttg ccctcccgcatcgcctcagtgtggtttgggcccaaagaggtgtccacagcttgtgccacc gccgtgctgggcaatcaggtaagtactggagtggtaggtgaaagtcagatccttaaaaga ccggaaaaagtcataggccgtgagaactatggcctgtatggatga >gi568815597r:212735873_212936149|GENSCAN_predicted_peptide_5|338_aa MLTSEWSWAAASCSAEFQTVATIYLLGHSSKARLPPLVTESSRLSLSLEAVSQNQLQAAI ISLEEALGSFFMTGYLPLGFEFAVEITYPESEGTSSGLLNASAQIFGILFTLAQGKLTSD YGPKAGNIFLCVWMFIGIILTGHIQNCPGPHMGCRPQVEQAYTTPFHIKALSIRGFWYLR ALMKMYEEINVAFMSGNTASLLQPVDQGVISTFKSYSLRNTLQKAVAARNNYSSLGSEQS KLRTFWKGCTILDAIKNILDSWEEVKLSTLTGVWRKLIPILMDDFEVFKTSVEEIAADVV GITRELELEVEYEEIPADSPTDQEPKTVMLSKQSESAI >gi568815597r:212735873_212936149|GENSCAN_predicted_CDS_5|1017_bp atgttgactagtgaatggtcctgggcagcagcaagctgttcagcagaatttcaaactgtg gcaactatctacttgttaggccactccagcaaagcacggctgccaccactcgtcactgag tcctctcgtctgagcctgtctcttgaagcagtcagtcaaaatcaactgcaagcagccatc atcagtttggaagaagctttgggaagcttcttcatgactggttacctccctttgggtttt gaatttgctgttgaaatcacttaccctgaatctgaaggtacttcatctggtcttcttaat gcttctgcacagatatttggaattttgttcacattggctcaaggaaagctcacatcagac tatggtcctaaggcagggaacatttttctctgtgtctggatgtttataggcatcatatta acaggccacattcaaaactgtcctgggccacatatgggctgcagaccacaggttgaacaa gcttatactacaccatttcatataaaggccttgagcattcgtggattttggtatctgaga gctctgatgaagatgtacgaggagatcaatgttgctttcatgtctggaaacacagcatcc cttctgcagcccgtggatcaaggagtaatttcaactttcaagtcttattctttaagaaat acacttcaaaaggctgtagctgccagaaataattattcctctcttggatctgagcaaagt aaattgagaactttctggaaaggatgcaccatcctagatgccattaagaacattcttgat tcatgggaggaggtcaaactatcaacattaacaggagtttggaggaagttaattcccatt ctcatggatgattttgaggtgttcaagacttcagtggaggaaatagctgcagatgtggtg ggaataacaagagaactggaattagaagtggaatatgaagagataccagctgacagtccc acagaccaagaaccaaaaacggttatgttgtccaagcagtcagaatcagcaatttga >gi568815597r:212735873_212936149|GENSCAN_predicted_peptide_6|147_aa MTHGHPKFSVHLTPISLRLASQACHIGWQLQASNIETDGLIVELKQVPGDEQARNGGPAL KQQMHVGQQDLDTVLASEPGFLPALSELTVPKGHLAGLRHDCLDYSMEPGPVQTLKRGLL QPTCWAAALVPSRCPPFCAHLIFGASQ >gi568815597r:212735873_212936149|GENSCAN_predicted_CDS_6|444_bp atgacccatggccaccccaaattctctgtgcacctcacccccatctccctgaggctggct tcccaagcatgccacattggatggcaactgcaagcatcaaatatagagacagatggcttg attgtggaattgaagcaagtccctggggatgaacaagcaagaaatggtggccctgcactc aaacagcagatgcacgtggggcaacaggatttggacacggtgttagcctcggagccaggt tttcttcctgcactgtctgagctgactgttcccaaaggacacttggctggcctgagacac gattgcctggattactccatggagccggggcctgtgcagaccctcaagagaggtcttttg cagccaacctgttgggcagctgccctggtgccctcaaggtgcccccctttctgtgcccac ctcatctttggggcctcacagtga >gi568815597r:212735873_212936149|GENSCAN_predicted_peptide_7|148_aa XLEGSHKDEGVKNEESKIHLFPGLAGEQLSGAVMIWGGLAQRRMNGLEARWRKEGQKQSG KGQRAAISPRNLGMQTLSPKPAKSEIQLRKVSNLPQVTRQVRDRTKAFEQDFQMIGVHAG LTVSAVGMDGAHGKGANGKVLLTVLQWP >gi568815597r:212735873_212936149|GENSCAN_predicted_CDS_7|447_bp nagctggagggaagtcacaaggatgaaggagtgaaaaatgaggagtcaaaaattcactta tttccaggtcttgctggggagcagctcagtggggcagtgatgatctggggaggactagcc caaagaagaatgaacgggctagaggccaggtggaggaaggaaggccagaaacaaagtggc aagggacaaagagcagccatatcgccaaggaacttaggaatgcaaactctcagccccaaa ccagctaaatctgaaattcagctccgaaaagtcagtaacctgccccaagtcacacggcaa gtaagggacaggacaaaagcatttgaacaagattttcagatgattggggtgcacgcaggt ctgacagtctctgctgtaggaatggatggggcccacgggaagggtgcaaatgggaaagtg cttcttactgtcctccagtggccataa