GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:32:14 Sequence gi568815597r:211379039_211581415 : 202377 bp : 44.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12602 12659 58 1 1 79 97 73 0.972 8.77 1.02 Intr + 12789 12938 150 1 0 62 95 55 0.763 3.73 1.03 Term + 13814 13824 11 0 2 129 33 7 0.719 -2.34 1.04 PlyA + 15520 15525 6 1.05 2.05 PlyA - 15789 15784 6 1.05 2.04 Term - 16047 16003 45 0 0 63 48 94 0.053 0.11 2.03 Intr - 25057 24842 216 1 0 12 86 104 0.032 1.30 2.02 Intr - 26079 25832 248 1 2 39 41 185 0.026 6.08 2.01 Init - 28367 28067 301 2 1 88 -19 239 0.065 10.61 2.00 Prom - 30310 30271 40 -6.66 3.00 Prom + 36050 36089 40 -3.36 3.01 Sngl + 37498 38646 1149 0 0 78 38 756 0.855 63.94 3.02 PlyA + 39099 39104 6 1.05 4.00 Prom + 45716 45755 40 -5.16 4.01 Sngl + 75639 76385 747 2 0 58 43 270 0.770 15.69 4.02 PlyA + 76611 76616 6 1.05 5.00 Prom + 76774 76813 40 -10.74 5.01 Sngl + 76873 77910 1038 0 0 45 47 319 0.646 20.63 5.02 PlyA + 78060 78065 6 1.05 6.10 PlyA - 80024 80019 6 1.05 6.09 Term - 100289 99998 292 1 1 80 46 579 0.999 47.72 6.08 Intr - 102321 102082 240 2 0 33 99 308 0.081 23.06 6.07 Intr - 106406 106288 119 2 2 111 53 74 0.068 5.46 6.06 Intr - 109538 109517 22 1 1 76 94 3 0.014 -2.75 6.05 Intr - 120402 120348 55 1 1 96 89 4 0.050 -0.66 6.04 Intr - 123134 122989 146 1 2 102 34 114 0.147 7.33 6.03 Intr - 127129 127095 35 1 2 46 110 21 0.040 -2.88 6.02 Intr - 135302 135170 133 1 1 88 56 94 0.534 6.85 6.01 Init - 137686 137562 125 1 2 104 69 65 0.752 4.79 6.00 Prom - 143891 143852 40 -3.76 7.00 Prom + 146678 146717 40 -6.86 7.01 Init + 147575 147643 69 1 0 108 77 57 0.700 5.70 7.02 Intr + 151715 151834 120 1 0 61 99 23 0.380 1.39 7.03 Term + 152986 153198 213 0 0 124 37 51 0.819 0.83 7.04 PlyA + 157237 157242 6 1.05 8.04 PlyA - 157400 157395 6 -0.45 8.03 Term - 159466 159354 113 2 2 134 49 84 0.864 7.82 8.02 Intr - 164114 164042 73 2 1 97 47 16 0.337 -2.62 8.01 Init - 168088 167894 195 1 0 110 75 127 0.610 12.53 8.00 Prom - 192920 192881 40 -3.96 9.04 PlyA - 196019 196014 6 1.05 9.03 Term - 196862 196350 513 2 0 28 47 181 0.540 2.34 9.02 Intr - 198785 198685 101 0 2 96 43 35 0.326 -0.37 9.01 Init - 199574 198953 622 2 1 110 99 1171 0.849 113.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 28367 28038 330 2 0 88 44 249 0.869 16.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:211379039_211581415|GENSCAN_predicted_peptide_1|72_aa MDKKSTHRNPEDARAGKYEGKHKRKKRRKQNQNQHRSRHRSVTSFSSDDPMFPSSSSSSS GSQTDSSIEGFQ >gi568815597r:211379039_211581415|GENSCAN_predicted_CDS_1|219_bp atggataagaagtccactcacagaaatcctgaagatgccagggctggcaaatatgaaggt aaacacaaacgaaagaaaagaagaaagcaaaaccaaaaccagcaccgatcccgacataga tcagtgacgtctttttcttcagatgatcctatgtttccttcttcctcatcatcgtcttca ggaagccagacagattcaagtattgaaggttttcagtag >gi568815597r:211379039_211581415|GENSCAN_predicted_peptide_2|269_aa MGIKQSRKTGNSKNQSLSPSPMERSSSPATEQSWMENDFDKLREEVFRRSNYSELKEEVQ TNGKEVKNFEKNLDKWITRIINGEKSLKYLMELKTTARELQIQTTIREYYKHLYANKLEN LEEMDKFLDTYTFPRLNQEEVESLNRPITDSEIEAIINSPPTKKNPGPDGLTAKSYQRYK EELTESQIMSELPFTIATKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCS WIGRINIMKMATLPKANRMTDDDRNPTGD >gi568815597r:211379039_211581415|GENSCAN_predicted_CDS_2|810_bp atggggataaaacagagcagaaaaaccggaaactctaaaaatcagagcctctctccttct ccaatggaacgcagctcctcaccagcaacggaacaaagctggatggagaatgactttgac aagttgagagaagaagttttcagaagatcaaactactccgagctaaaggaggaagttcaa accaatggcaaagaagttaaaaactttgaaaaaaacttagacaaatggataactagaata atcaatggagagaagtccttaaagtacctgatggagctgaaaaccacggcacgagaacta caaatacaaactaccatcagagagtactataaacatctctacgcaaataaactagaaaat ctagaagaaatggataaattccttgacacatacaccttcccaagactaaaccaggaagaa gttgaatccctgaatagaccaataacagactctgaaattgaggcaataattaatagccca ccaaccaaaaaaaatccaggaccagatggattgacagccaaatcctaccagaggtacaag gaggagctgacagagagccaaatcatgagtgaactcccattcacaattgctacaaagaga ataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactac aaaccactgctcaacgaaataaaagaggacacaaacaaatggaagaacattccatgctca tggataggaagaatcaatatcatgaaaatggccacactgcccaaggcaaacagaatgacg gatgatgatcgaaacccaaccggggactag >gi568815597r:211379039_211581415|GENSCAN_predicted_peptide_3|382_aa MHPDATDSGGAGLSPARAAGAGGRPVSGFRGERRPESPGDAEAAAAAPGAPGGRSWWKPV AVAALATVALSFLGPGSGEAAGAAGLSSVLLRLSLYLSCAAATFLLGTLFALVCRSPRAP PPDFAAAWSRLAATSAARRPPGSPVYGNSHESAQFRRVVISHNMDKVLKEVFDYSYRDCI LSWYGNLSRDEGQLYHVLLEDFWEIARQLHHRLSHVDVVKVVCNDVVRTLLTHFCDLKGA NARHEQPRPFVLHVCLRNSDDEVRFLQTCSRVLVFCLLPSKDVQSLSLPIMLAEILTTKV LKPVVELLSNPDYINQMLLAQLEYREQMNEHHKRAYTYSPSYEDFIKLINSNSDVEFLKQ LRSVEGTVEKSGRRCVLVVFNN >gi568815597r:211379039_211581415|GENSCAN_predicted_CDS_3|1149_bp atgcaccccgatgcgaccgacagtggcggcgccggcctcagccccgcgcgggccgcaggc gccggcggccgtcctgtctcgggcttcaggggcgagcggcggccagagtccccgggggac gccgaggcagcagcggcggcgccgggggccccgggcggccggagctggtggaagcccgtg gcggtggccgcacttgccaccgtggccctctccttcctggggcccggcagcggggaggcg gcgggggccgcggggctgagctccgtcctgctcaggctcagcctgtacctgagctgcgct gcggccaccttcctgctggggaccctgttcgccctcgtctgccggagcccgcgcgccccg ccgcccgactttgccgccgcctggagccggctggccgcgacctcagccgcccgccgccct cctgggagtcctgtgtatggaaactcacatgagtcagctcagtttagaagggtagtaatt tctcataatatggataaagttctgaaagaagtgttcgactacagttacagagattgcatt ctgtcctggtatggaaacctcagcagagatgagggacaactttaccatgtgctcttggaa gacttttgggaaattgccagacagctgcaccacagactgagtcacgtggatgtggttaaa gttgtctgcaatgatgttgtgaggactttactcactcatttctgtgacctgaaaggtgct aatgccagacatgaacagccaagaccttttgtgttgcacgtatgcttgaggaactcagat gatgaagtaagatttctacaaacgtgttctcgggttctggtgttttgtctcctcccctca aaggatgtgcagtctctcagtttacctataatgcttgcagaaattctcacaacaaaagtc ctgaagccggtagtggagttactgagtaatccagattacattaaccaaatgctgcttgcc cagctggagtacagagagcagatgaatgaacatcacaagagagcctacacctatagcccc tcttatgaggacttcatcaagctcattaacagcaactctgatgtggagttcttgaagcaa ctaaggtctgttgaaggaacggtagagaagagtggtagaagatgtgtgctggtcgttttc aacaactga >gi568815597r:211379039_211581415|GENSCAN_predicted_peptide_4|248_aa MELKNMARELRDECTSFSSRFDQLEERVSVIEDQMSETKQEEKFREKRIQRNEQSLQEIW DYVKRPNLHLIGVPESDGMNGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQRYSSR KATPRHIIVRFTKVEMKEKMLRAARGKGWVTHKGKPIRLTADLLAKTLQARREWGPIFNI LKGKNFQPRISYPAKLSFISEGEIKSFTDKQMLRDFVTTRPALQELLKEALNMERNNQYQ PLQKHAKL >gi568815597r:211379039_211581415|GENSCAN_predicted_CDS_4|747_bp atggagctgaaaaatatggcacgagaactacgtgacgaatgcacaagcttcagtagccga tttgatcaactggaagaaagggtatcagtgattgaagatcaaatgagtgaaacgaagcaa gaagagaagtttagagaaaaaaggatacaaagaaacgaacaaagcctccaagaaatatgg gactatgtgaaaagaccaaatctacatctgattggtgtacctgaaagtgatgggatgaat ggaaccaagttggaaaacactctgcaggatattatccaggagaacttccccaatctagca aggcaggccaacattcaaattcaggaaatacagagaacaccacaaagatactcttcgaga aaagcaactccaagacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatg ttaagggcagccagagggaaaggttgggttacccacaaagggaagcccatcagactaaca gcagatctcttggcaaaaactctacaagcaagaagagagtgggggccaatattcaacatt cttaaaggaaagaattttcaacccagaatttcgtatccagccaaactaagcttcataagt gaaggagaaataaaatcatttacagacaaacaaatgctgagagattttgtcaccaccagg cctgccctacaagagctcctgaaggaagcactaaacatggaaaggaacaaccagtaccag ccactgcaaaaacatgccaaattgtaa >gi568815597r:211379039_211581415|GENSCAN_predicted_peptide_5|345_aa MVDFNTPQSTLDRSVRQKVNKDTQELNSTLHQADLIDIYRTLHPKSTEYAFFSAPHHTYS KTDHIVGSKVLLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNCSTTWKLNKLLLNDYWV HNEMKAEIKMFFESNETKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK GREKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFEKINKTDRPLARLIK KKREKNQIDAIKNDKWDITTNPTEIQTTTREYYKHLYANKLENLEEMDKFLDTYTHPRLN QEEVESLNTPITGSEIEALINSLPTKMESQLNSYRGTRRSWYYSF >gi568815597r:211379039_211581415|GENSCAN_predicted_CDS_5|1038_bp atggtagactttaacaccccacagtcaacattagacagatcagtgagacagaaagttaac aaggatacccaggaattgaactcaactctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatgcattcttctcagcaccacatcacacttattcc aaaactgaccacatagttggaagtaaagtgctcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaactgctcaactacatggaaactgaacaagctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaatccaatgagaccaaagacaca acatatcagaatctctgggacacatttaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaaattgataccctaacatcacaattaaaa ggacgagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaagatcaacaaaactgatagaccgctagcaagactaataaag aagaaaagagagaagaatcaaatagatgcaataaaaaatgataaatgggatatcaccacc aatcccacagaaatacaaactaccaccagagaatactataaacacctctatgcaaataaa ctagaaaatctagaagaaatggataaattcctggacacatacactcacccaagactaaac caggaagaagttgaatccctcaatacaccaataacaggctctgaaattgaggcactaatt aatagcctaccaaccaagatggagtcacagttgaattcttacagaggtacaaggaggagc tggtactattccttctga >gi568815597r:211379039_211581415|GENSCAN_predicted_peptide_6|388_aa MGADLTTLALGCQDSQSGLLSLAAVMASFPGPLHLIQGQYFRVNEACRRLHPPGKDKACE SLWVCPSDAVGPLGGAAWSGIILDARVCSVTPDFHINRLFGKQKEVKMPSIYLPGKELRT LVKGTLENLPLPSGNFQQPAWGLTQGESGIWPGPRDRNTTSMASRAFKTSFQNLIGPTEE PSGGAAGTGDRNSTSRNLVSAKASSPGQSCGKSPAEMVLETLMMELTGQMREAERQQRER SNAVRKVCTGVDYSWLASTPRSTYDLSPIERLQLEDVCVKIHPSYCGPAILRFRQLLAEQ EPEVQEVSQLFRSVLQEVLERMKQEEEAHKLTRQWSLRPRGSLATFKTRARISPFASDIR TISEDVERDTPPPLRSWSMPEFRAPKAD >gi568815597r:211379039_211581415|GENSCAN_predicted_CDS_6|1167_bp atgggggcagatttgaccacactggctctgggctgtcaagacagccaatctgggctgctg tctctggctgctgttatggcttcattcccggggccactccacttaatacagggacagtac ttcagggtaaatgaggcatgcaggcgcctgcaccctccgggcaaggacaaggcgtgtgag agtctctgggtctgccccagcgacgccgtcgggcctctgggcggtgcagcctggtccggc atcatcctggacgcgcgggtgtgctccgtgacacctgacttccacataaacaggttgttt ggcaagcagaaagaagttaagatgccttcaatttatcttccaggaaaagagctaaggaca ctggtgaaaggaaccctggagaacctgccgctgccttctggcaatttccagcagccagca tggggtcttacacagggagagtctggaatttggccagggcccagggatcggaacaccacc tcgatggcctccagagccttcaagacctctttccagaacttaatcggacccactgaagag ccttctgggggagcagctggcactggggacagaaatagcaccagcagaaatcttgtttct gccaaagcctccagtcctggccagagttgcgggaagagccctgctgagatggtgctggag acgcttatgatggagctgacggggcagatgcgagaggctgagaggcagcagcgggagcgc agcaatgcggtcagaaaggtctgcaccggtgtggactacagctggctggccagcacaccc cggtccacctatgacctcagccccattgagcggttgcagctggaagatgtctgcgttaag atccacccatcctattgtgggcctgctatcctcaggttccggcagctgctggcggagcag gagcccgaggtgcaggaggtgtcccagctcttccgctcggtgctgcaggaggtcctggag aggatgaagcaggaagaggaggcccacaagctgacgcgccagtggagcctgcggccccgc ggcagcctggccaccttcaagacccgcgcgcgcatctcgcccttcgccagcgacatcagg accatctccgaggacgtggagcgggacacaccgccgccactgcggtcctggagcatgccc gaattccgggcgcccaaagccgactga >gi568815597r:211379039_211581415|GENSCAN_predicted_peptide_7|133_aa MMQDRGTQALALMDSPLLPRTPKCPSMLIFGLCPVLTGGGLTKALGSSQNPWGSVVWTVG FGQGSLQLLDVWVPQARLCFLSGAGRLLLVSQWPLEHCMGWDLEEGQQVAEGPWDDPGIS DQSMEGRHWPVAF >gi568815597r:211379039_211581415|GENSCAN_predicted_CDS_7|402_bp atgatgcaagacagagggacacaggccttggccctgatggactccccgcttctgcccaga acccccaagtgcccctcaatgcttatatttggcctttgcccagttctaacagggggtggt ctcaccaaagccttgggcagcagccagaacccatggggctctgtagtgtggactgtgggc tttggccagggctccttgcagttattagatgtttgggttccccaagccagattgtgcttc ctctctggtgcagggcgtcttctgctggtctcccagtggccccttgagcactgcatgggc tgggatctggaggagggacagcaggttgctgaaggcccttgggatgacccagggatctca gaccagagcatggagggcagacactggcccgttgccttctag >gi568815597r:211379039_211581415|GENSCAN_predicted_peptide_8|126_aa MPGLEEVFKAINLVEIGLILNSQPPTWALEKIGNPKKEPRNTPMFGGQENEEEPAKDTTK GAASEVPEGQGAQRWQPIVTPIANRMLMKGLSVAPSQKRIPERPCPSGDMDGATSCWAKR GQPGSK >gi568815597r:211379039_211581415|GENSCAN_predicted_CDS_8|381_bp atgcctggcctagaggaggtatttaaagccataaacctggttgaaattggccttattttg aattcacagccaccaacatgggctcttgagaagataggaaatccaaagaaagaaccccgg aacactccaatgtttgggggtcaggaaaatgaagaggagccagcaaaggacaccacgaag ggggcagccagtgaggtgccagaggggcagggagcccagaggtggcagcctattgtgaca cccattgccaacaggatgttgatgaaaggattgagcgtggctccttctcagaagcggatt cctgaaaggccctgcccttcaggggacatggatggggccaccagctgctgggcaaagaga ggacagccaggaagcaagtga >gi568815597r:211379039_211581415|GENSCAN_predicted_peptide_9|411_aa MGCWGRNRGRLLCMLALTFMFMVLEVVVSRVTSSLAMLSDSFHMLSDVLALVVALVAERF ARRTHATQKNTFGWIRAEVMGALVNAIFLTGLCFAILLEAIERFIEPHEMQQPLVVLGVG VAGLLVNVLGLCLFHHHSGFSQDSGHGHSHGGHGHGHGLPKGPRVKSTRPGSSDINVAPG EQGPDQEETNTLVANTSNSNGLKLDPAGNQNPLSRVQGAGDASGIFVGGGCGSDPKKKGQ EESALILLQTVPKQIDIRNLIKELRNVEGVEEVHELHVWQLAGSRIIATAHIKCEDPTSY MEVAKTIKDVFHNHGIHATTIQPEFASVGSKSSVVPCELACRTQCALKQCCGTLPQAPSG KDAEKTPAVSISCLELSNNLEKKPRRTKAENIPAVVIEIKNMPNKQPESSL >gi568815597r:211379039_211581415|GENSCAN_predicted_CDS_9|1236_bp atggggtgttggggtcggaaccggggccggctgctgtgcatgctggcgctgaccttcatg ttcatggtgctggaggtggtggtgagccgggtgacctcgtcgctggcgatgctctccgac tccttccacatgctgtcggacgtgctggcgctggtggtggcgctggtggccgagcgcttc gcccggcggacccacgccacccagaagaacacgttcggctggatccgagccgaggtaatg ggggctctggtgaacgccatcttcctgactggcctctgtttcgccatcctgctggaggcc atcgagcgcttcatcgagccgcacgagatgcagcagccgctggtggtccttggggtcggc gtggccgggctgctggtcaacgtgctggggctctgcctcttccaccatcacagcggcttc agccaggactccggccacggccactcgcacgggggtcacggccacggccacggcctcccc aaggggcctcgcgttaagagcacccgccccgggagcagcgacatcaacgtggccccgggc gagcagggtcccgaccaggaggagaccaacaccctggtggccaataccagcaactccaac gggctgaaattggaccccgcaggaaatcagaaccccctcagtcgggtgcaaggagccggc gacgcgtctggcatatttgtaggtgggggctgtgggtcagaccctaagaaaaagggtcag gaggaatctgctcttattcttctacaaactgttcctaaacaaattgatatcagaaatttg ataaaagaacttcgaaatgttgaaggagttgaggaagttcatgaattacatgtttggcaa cttgctggaagcagaatcattgccactgctcacataaaatgtgaagatccaacatcatac atggaggtggctaaaaccattaaagacgtttttcataatcacggaattcacgctactacc attcagcctgaatttgctagtgtaggctctaaatcaagtgtagttccgtgtgaacttgcc tgcagaacccagtgtgctttgaagcaatgttgtgggacactaccacaagccccttctgga aaggatgcagaaaagaccccagcagttagcatttcttgtttagaacttagtaacaatcta gagaagaagcccaggaggactaaagctgaaaacatccctgctgttgtgatagagattaaa aacatgccaaacaaacaacctgaatcatctttgtga