GENSCAN 1.0 Date run: 2-Nov-116 Time: 19:14:43 Sequence gi568815586r:42213070_42424102 : 211033 bp : 39.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 13960 14283 324 0 0 72 55 156 0.731 6.49 1.02 PlyA + 16337 16342 6 1.05 2.06 PlyA - 16402 16397 6 1.05 2.05 Term - 22928 22727 202 1 1 24 54 124 0.929 -1.52 2.04 Intr - 24655 24530 126 0 0 83 117 214 0.995 22.67 2.03 Intr - 25707 25236 472 1 1 74 50 282 0.763 14.21 2.02 Intr - 33613 33446 168 2 0 52 55 151 0.332 7.20 2.01 Init - 34318 34312 7 1 1 83 58 4 0.336 -2.15 2.00 Prom - 36324 36285 40 -2.75 3.02 PlyA - 37084 37079 6 1.05 3.01 Sngl - 38596 37442 1155 1 0 60 48 382 0.987 27.39 3.00 Prom - 39535 39496 40 -6.15 4.04 PlyA - 39692 39687 6 1.05 4.03 Term - 40871 40729 143 0 2 34 42 145 0.679 1.61 4.02 Intr - 41411 41282 130 2 1 59 39 103 0.606 1.85 4.01 Init - 47633 47538 96 2 0 79 115 27 0.803 4.86 4.00 Prom - 51181 51142 40 -4.45 5.04 PlyA - 51925 51920 6 1.05 5.03 Term - 58056 57950 107 1 2 107 50 97 0.485 5.39 5.02 Intr - 60080 59934 147 0 0 19 43 138 0.331 1.69 5.01 Init - 61192 60856 337 1 1 59 49 213 0.687 10.00 5.00 Prom - 61459 61420 40 -7.05 6.00 Prom + 65644 65683 40 -3.65 6.01 Init + 73845 74519 675 2 0 64 41 826 0.808 70.71 6.02 Intr + 74627 74873 247 1 1 11 38 218 0.581 5.21 6.03 Term + 74932 75491 560 2 2 63 55 611 0.976 48.82 6.04 PlyA + 76882 76887 6 1.05 7.09 PlyA - 77618 77613 6 1.05 7.08 Term - 90084 89926 159 0 0 35 41 146 0.015 1.56 7.07 Intr - 104378 104271 108 2 0 58 98 124 0.992 9.96 7.06 Intr - 104829 104718 112 2 1 92 115 24 0.811 4.96 7.05 Intr - 109377 109349 29 0 2 92 106 24 0.603 0.60 7.04 Intr - 111035 110950 86 0 2 87 114 5 0.297 1.62 7.03 Intr - 113286 113176 111 1 0 43 93 71 0.257 2.53 7.02 Intr - 113887 113773 115 1 1 67 70 30 0.316 -1.80 7.01 Init - 117336 117232 105 0 0 62 94 132 0.993 11.47 7.00 Prom - 121811 121772 40 -8.75 8.00 Prom + 121958 121997 40 -4.55 8.01 Init + 122834 122905 72 1 0 61 84 129 0.972 10.92 8.02 Intr + 138816 138980 165 2 0 100 30 150 0.657 9.64 8.03 Intr + 171871 171927 57 0 0 86 105 37 0.689 3.36 8.04 Intr + 180501 180620 120 2 0 54 94 127 0.998 9.67 8.05 Intr + 185785 185925 141 0 0 58 94 127 0.977 9.93 8.06 Intr + 188975 189049 75 1 0 87 62 43 0.401 0.39 8.07 Intr + 194855 194902 48 1 0 53 97 51 0.238 0.46 8.08 Term + 198201 198356 156 2 0 29 53 95 0.303 -2.95 8.09 PlyA + 198537 198542 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 65819 65894 76 1 1 83 99 45 0.904 6.40 S.002 Init + 161805 162005 201 2 0 75 47 140 0.902 7.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:42213070_42424102|GENSCAN_predicted_peptide_1|107_aa MKRSRCRDRPQPPPPDRREDGVQRAAELSQSLPPRRRAPPGRQRLEERTGPTGPEGKEQP PALASQSAEIAASARPPPRLGSEECLCLAAHRLRREEPLCLAAQSGK >gi568815586r:42213070_42424102|GENSCAN_predicted_CDS_1|324_bp atgaagcggagccgctgccgcgaccgaccgcagccgccgccgcccgaccgccgggaggat ggagttcagcgggcagcggagctgtctcagtctttgccgccgcgccggcgagcgccgccc gggaggcagcggctggaggagcggacgggccccacggggcccgagggcaaggagcagccg cctgccttggcctcccaaagtgccgagattgcagcctctgcccggccgccaccccgtctg ggaagtgaggagtgtctctgcctggccgcccatcgtctgcgacgtgaggagcccctctgc ctggctgcccagtctggaaagtga >gi568815586r:42213070_42424102|GENSCAN_predicted_peptide_2|324_aa MDVLSLLLQASLQQQQFLSLAEAESTLHFSRYLQNQLNKTLSETPASVGHSAALSEVWGG GWVAKDSGGQESRSTWRSRAGTSVQTWTGLRTPAGPTLDQASHGMTEHPLPVGGTDVFLL RGALEREAVPGSRGYQGGNGGTARRGGAGARGSRGHRGHPRTARRLTRRGCWAALAAFEV SGAAAGWPARVTCAQAGVSPTAGGCVGSGGGSRARRPKRQPKPSSDEGYWDCSVCTFRNS AEAFKCMMCDVRKGTSTRDSKEGGKLVSYSTASLGVRGTLRNRVGGGSSEEKKQAEYLAP GRRRNIVHRGVGPGQRSGPSLKEA >gi568815586r:42213070_42424102|GENSCAN_predicted_CDS_2|975_bp atggacgtcctctccctgctcctccaagcatcgctccagcagcagcagttcctttctttg gcagaggcagaatccactttacacttttcccggtacttgcagaaccagctgaataagacc ttgtcagagacaccagcatcagtgggccacagtgctgccctttcagaggtgtggggcggg ggctgggtagctaaggacagcggaggccaggaaagtcgttctacctggagatcccgggct ggcacttctgtccagacctggacaggcctccgaacgccggccggccctaccttggaccag gcttcgcacgggatgacggagcatcccctgccagtcggtgggacagacgttttcctcctc agaggggctctggagcgcgaggcagtccccggcagccgggggtaccaagggggcaacggg ggaacagcgcggagaggcggcgccggcgcccgaggcagccgcggccaccgcggccacccc cgcaccgcccgccggctaactcgccgcgggtgctgggccgcgctggccgcgtttgaagtc tccggcgcggctgctggttggccggcgagggtcacgtgcgcccaggcaggagtttccccg acagctggaggctgcgtgggatccggcggcggctcccgagcgcggcggccgaagcggcag ccgaagccgtcctcggatgagggttactgggactgtagcgtctgcaccttccggaacagc gccgaggccttcaagtgcatgatgtgcgatgtgcggaagggcacctccacccgggacagc aaggaaggggggaagctggtgtcctactccacagccagtcttggggttagaggaaccctg agaaatagagtaggtggtggcagctcagaagagaagaaacaggctgaatacctggcacct ggaagaagaaggaatatagtacacaggggagttggcccaggacagagaagtgggcctagt ttaaaagaggcttga >gi568815586r:42213070_42424102|GENSCAN_predicted_peptide_3|384_aa MDKFLDTYTLPRLNQEEVESLSRPITGSEIEAIFNSLPTKKSPGQDRCTAEFYQRYKEEL VPFLLKQLQSTEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNINVKILNKILP NQIQQHIKKLIHHDQVGFIPGMQGWFNICKSINKIHHLNRTNDENQMIISIDAEKAFDKI QQPFMLKTLNKPRIDGTYLKIIRTIYDRPTANIILNGQKLKAFPLKPSTRQGCPLSPLLF NIVLEDLVREIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLLSNFSKV SGYKLNVQKSQAFLYTNNRQTESQIMSELPFTITAKRIKYLGIQLTRDVKGLFKVNYKPL LNEIKEDTNKWKNIPCSWLEESIS >gi568815586r:42213070_42424102|GENSCAN_predicted_CDS_3|1155_bp atggataaattcctggacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgagtagaccaataacaggttctgaaattgaggcaatatttaatagcctaccaaccaaa aaaagtccaggacaagacagatgcacagctgaattctaccagaggtacaaagaggagctg gtaccattccttctgaaacaattacaatcaacagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagag aattttaggccaatatccctgatgaacatcaatgtgaaaatcctcaataaaatactgcca aaccaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaacatatgcaaatcaattaacaaaatccatcacttaaacaga accaatgacgaaaaccaaatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaaccacgtattgatggaacgtatctcaaa ataataagaactatttatgacagacccacagccaatatcatactgaatgggcaaaaactg aaagcattccctttgaaacccagcacaagacaaggatgccctctctcaccactcctattc aatatagtgttggaagatctggtcagggaaatcaggcaagagaaagaaataaagggtatt caattaggaaaagaggaagtcaaattgtctctgtttgcagatgacatgattgtatattta gaaaaccccatcgtctcagcccaaaatctccttaagctgctaagcaacttcagcaaagtc tcaggatacaaactcaacgtgcaaaaatcacaagcattcctatacaccaataacagacaa acagagagccaaatcatgagtgaactcccattcacaattactgcaaagagaataaaatac ctaggaattcaacttacaagggatgtgaagggcctctttaaggtgaactacaaaccactg ctcaatgaaataaaagaggacacaaacaaatggaagaacattccatgctcttggttagaa gaatcaatatcatga >gi568815586r:42213070_42424102|GENSCAN_predicted_peptide_4|122_aa MGYLTIPLSCLGFHESEERKEQCSFSSPSGTQGISEQKAADSFSRPKRPCLTALKRAAVL PARRSSSENGQTASSRSQLLASKEQNWMENKFDELTEVGIRRSVITNISELKEHVLTHHK EA >gi568815586r:42213070_42424102|GENSCAN_predicted_CDS_4|369_bp atgggctacctgacaattccactaagctgtttgggcttccatgagtcagaggaaagaaaa gagcaatgctccttttcaagccccagtggcactcagggcatatcagaacaaaaggcagca gacagtttcagcagacctaaacgtccctgtctgacagctctgaagagagcagcggttctc ccagcacggcgttcaagctctgagaacggacagactgcctcctcaagatcgcagctcctc gccagtaaggaacaaaactggatggagaataagtttgatgaattgacagaagtaggcatc agaaggtcggtaataacaaacatctctgagctaaaggagcatgttctaacccatcacaag gaagcttaa >gi568815586r:42213070_42424102|GENSCAN_predicted_peptide_5|196_aa MTACWQPSQPSMALGASLASANILAALEEPFSLLLHGGSPSLGWPRSELAPSACGEVWRE RHERELGLHPVLAGQLEFRVGVGLVGPTLGGAGRSCWSWAARGLTPAPAAAEANLVGKWR TFVSSSGIVNAPISTLSKRTNQLSVKQTNRLSAKWTNQRDPGCERMNNVFSMGCRQSVIR GQPVKTRAGETHSGEN >gi568815586r:42213070_42424102|GENSCAN_predicted_CDS_5|591_bp atgacagcgtgctggcagccctcgcagccctcgatggctctgggtgcctccttggcctcc gcgaacattctggctgcgcttgaggagcccttcagcctgctgctgcacggtggaagccct tctctgggctggccgaggtcggagctggctccctcggcttgcggggaggtgtggagggag aggcatgagcgggaactggggctgcacccagtgcttgcgggccagctggagttccgggtg ggtgtgggcttggtgggccccacactcggaggggccggccggtcctgctggtcctgggca gcaaggggcttaacacccgcgccagcagctgcggaggctaatctagtggggaagtggaga acttttgtgtctagctcagggattgtaaatgcaccaatcagcaccctgtcaaaacggacc aatcagctctctgtaaaacagaccaatcggctctctgcaaaatggaccaatcagcgggat ccggggtgtgagaggatgaacaacgtcttctcaatgggttgtagacaaagtgtgataaga gggcagccagtgaagacaagagcaggggagacacattcaggggaaaactga >gi568815586r:42213070_42424102|GENSCAN_predicted_peptide_6|493_aa MNKLFIGNLSPAVTAEDLRQLFGDRKLPLAGQVLLKSRYAFVDYPDQNWAIRTIETLSGQ VELHGKIMEVDYSVSIKLRSRNIPIRNIPPHLQWEVLDGLLAQYGTVENVEQVNTDTETA VVNVTYATKEEVKIAMKKLSGHQFENHYFKISYIPDDEVSCPSPPQRAQRGDHSSWEQGQ APGGSSQARQIDFPLRVLFPTQFVGAIIGKEGLTIKNITKQSRSRKEADEAKLAEEIPLK ILAHNGLVGRLIGKEGRNLKKNEHETGTKITISSSQDLSIYNPERTITVKGTVEVCASAE IEIMKKLPCTPIASLARSRIITLIQSRRLSISSSQPRVWAPSSGRKGHTSNSWRDSWEPP SRSPLRQRKVIITWPPESQFKAQGRIFGKLKEENFFNPKEDVKLETHIRVPSSTAGRVIG KGGKTVNELQNLISAEVIVPRDQTPDENEEMIVRIIGHFFASQTAQRKIREIVQQVKQQE QKYPQGVASQRSK >gi568815586r:42213070_42424102|GENSCAN_predicted_CDS_6|1482_bp atgaacaagcttttcatcgggaacctgagccccgccgtcaccgccgaagacctccggcag ctctttggggacaggaagctgcccctggcgggacaggtcctgctcaagtcccgctacgcc ttcgtggactaccccgaccagaactgggccatccgcaccatcgagaccctctcgggtcaa gtggaattgcatgggaaaatcatggaagttgattattcagtctctataaagctaaggagc aggaacattccgattcgaaatatccctcctcacctgcagtgggaggtgttggatggactt ttggctcaatatgggacagtggagaatgtggaacaagtcaacacagacacagagaccgct gttgtcaacgtcacatatgcaacaaaagaagaagtaaaaatagccatgaagaagctaagc gggcatcagtttgagaaccactacttcaagatttcctacatcccggatgacgaggtgagc tgcccttcgccccctcagcgagcccagcgtggggaccactcttcctgggagcaaggccaa gcccctgggggctcttctcaggccagacagattgatttcccactgcgtgtcctgttcccc acccagtttgttggtgccatcatcggaaaggagggcttgaccataaagaacatcactaag cagagccggtcccggaaagaggcagatgaggccaaactagccgaagagattcctctgaaa atcttggcccacaatggcttggttggaagactgattggaaaagaaggcagaaatttgaag aaaaatgaacatgaaacagggaccaagataacaatctcatcttcgcaggatttgagcata tacaacccggaaagaaccatcactgtgaagggcacagtcgaggtctgtgccagtgctgag atagagattatgaagaagctgccctgtacccccatcgccagtttggcccgttcccgcatc atcactcttatccagagcaggagattgtcaatctcttcatcccaacccagggtgtgggcg ccatcatcgggaagaaaggggcacacatcaaacagctggcgagattcgtgggagcctcca tcaagatcgcccctgcgtcagcggaaggtcatcatcacctggccaccggaatcccagttc aaggcccagggacggatctttgggaaactgaaagaagaaaacttttttaaccccaaagaa gacgtgaagctggaaacccatatcagagtgccctcttccaccgctggccgggtgattggc aaagggggcaagaccgtgaatgaactgcagaatttaatcagtgcagaagtcatcgtgcct cgtgaccaaacgccagatgaaaatgaggaaatgatcgtcagaattatcgggcacttcttt gctagccagactgcacagcgcaagatcagggaaattgtacaacaggtgaagcagcaggag cagaaataccctcagggagtcgcctcacagcgcagcaagtga >gi568815586r:42213070_42424102|GENSCAN_predicted_peptide_7|274_aa MASWDEKDLTVPQPDTHEGSVLRRISKRGRPLAVEKTSASLWIDCAGSRRGCGFQAAGTQ APREWLEAWSVCAELSHNSKAGESRTRHRKLPPTPPRLAARTRSASISGKEMSGGLAPSK STVYVSNLPFSLTNNDLYRIFSKYGKVVKVTIMKDKDTRKSKGVAFILFLDKDSAQNCTR AINNKQLFGRVIKASIAIDNGRAAEFIRRRNYFDKSKCYECGFVCDLILLGRWTRAWHIE SCHTGPVPLQKGRGSTELVNTEVVHGRQGQKGTL >gi568815586r:42213070_42424102|GENSCAN_predicted_CDS_7|825_bp atggcctcgtgggacgagaaagacctgaccgtcccccagcccgacacccatgaagggtct gtgctgaggaggatcagtaaaagaggaaggcctcttgcggttgagaagacatcggcttca ctctggattgattgtgcaggctccaggagaggctgtgggttccaggctgctggaacgcag gctcccagggagtggttagaggcctggtctgtatgtgctgaattgtcacacaattcaaag gcaggagaatccaggacccgacacagaaaactcccgcccactcccccccgtttagcagcc cgcacacgctctgcatccatctctgggaaagaaatgagtggtggattggctccaagtaag agcacagtgtatgtatccaacttgcctttttccctgacaaacaatgacttgtaccggata ttttccaagtatggcaaagttgtaaaggttaccatcatgaaagataaagataccaggaag agtaaaggggttgcatttattttatttttggataaagactctgcacaaaactgtaccagg gcaataaacaacaaacagttatttggtagagtgataaaagcaagcattgctattgacaat ggaagagcagctgagttcatccgaaggcgaaactactttgataaatctaagtgttatgaa tgtgggttcgtgtgtgacctgattcttctgggacgctggacaagagcctggcatatagag agctgtcacactggccctgtgcccttgcagaaaggcagagggtccactgagctggttaac actgaagttgtccacggacggcaaggccaaaagggcacactgtaa >gi568815586r:42213070_42424102|GENSCAN_predicted_peptide_8|277_aa MWSEGRYEYERIPRERAPPRSHPSDGYNRLVNIVPKKPPLLDRPGEGSYNRYYSHVDYRD YDEGRSFSHDRRSGPPHRGSPCVLVPPTNGRMKEILKEVLDKPSRLTEKELAEAASKWAA EKLEKSDESNLPEISEYEAGSTAPLFTDQPEEPESNTTHGIELFEDSQLTTRSKAIASKT KEIEQRWDFTLLPMLVANFWAREIYPPITGSMDVKPIDTQSQLYQKLLEYKWLLVIWMNC IATLVADCRSYYLRWSLQQLLLLLLETVITAVTTVTA >gi568815586r:42213070_42424102|GENSCAN_predicted_CDS_8|834_bp atgtggtctgagggacgatatgaatatgaaagaattccgagagaacgagcacctcctcga agtcatcccagtgatggctacaatagactagttaatattgtgccaaagaaaccaccactg ctagacagacctggtgaaggaagctacaatagatattacagtcatgttgattaccgagac tatgacgagggccgcagtttttctcatgatcgaagaagtggtccacctcacagaggaagt ccgtgcgtcctggtgcctcctacaaacggcagaatgaaggaaatcctgaaagaggtgtta gacaaacccagtaggctaactgaaaaggaacttgctgaggctgcaagcaagtgggctgct gaaaagctagagaaatcagatgaaagtaacttgcctgaaatttctgagtatgaggcggga tccacagcaccattgtttactgaccagccagaggaacctgagtcaaacacaacacatggg atagaattatttgaagatagtcagctaaccactcgctctaaagcaatagcatcaaaaacc aaagagattgaacagagatgggatttcaccttgttacccatgttggtcgccaacttctgg gctcgagagatctacccacctattacaggctccatggatgtgaaacccatagatactcag agccaactgtaccagaagcttttagagtacaagtggcttttagttatttggatgaattgt atagccacacttgtggctgactgccgcagttactacttgagatggtcactacagcagtta ctactgttactacttgagactgtcattacagcagttactactgttactgcttga