GENSCAN 1.0 Date run: 2-Nov-116 Time: 22:58:12 Sequence gi568815596f:207455636_207697055 : 241420 bp : 41.18% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11266 11342 77 0 2 94 106 83 0.940 11.31 1.02 Intr + 29200 29251 52 1 1 59 107 26 0.008 -0.41 1.03 Intr + 56543 56637 95 1 2 -13 81 157 0.065 2.74 1.04 Term + 60264 60351 88 0 1 88 45 102 0.064 2.05 1.05 PlyA + 62693 62698 6 1.05 2.00 Prom + 71159 71198 40 -6.85 2.01 Init + 73547 74383 837 1 0 71 97 466 0.777 40.53 2.02 Term + 75013 75246 234 0 0 -1 55 171 0.888 0.24 2.03 PlyA + 76282 76287 6 1.05 3.00 Prom + 88694 88733 40 -4.85 3.01 Init + 100007 100114 108 1 0 78 75 133 0.867 11.27 3.02 Intr + 104591 104737 147 1 0 87 91 140 0.998 13.71 3.03 Intr + 105484 105525 42 0 0 89 109 26 0.826 2.32 3.04 Intr + 111828 111928 101 2 2 76 80 61 0.993 2.09 3.05 Intr + 114544 114686 143 1 2 85 73 145 0.915 11.78 3.06 Intr + 119637 119819 183 1 0 25 116 161 0.775 11.44 3.07 Intr + 121870 122020 151 2 1 79 87 192 0.936 16.50 3.08 Term + 141279 141423 145 0 1 115 28 145 0.858 7.60 3.09 PlyA + 141491 141496 6 1.05 4.02 PlyA - 141505 141500 6 1.05 4.01 Sngl - 156205 155831 375 1 0 68 50 151 0.724 5.19 4.00 Prom - 156831 156792 40 -4.35 5.11 PlyA - 157131 157126 6 1.05 5.10 Term - 157854 157411 444 0 0 109 32 379 0.688 28.65 5.09 Intr - 158049 157970 80 1 2 46 68 43 0.426 -3.55 5.08 Intr - 166282 166171 112 1 1 93 92 82 0.790 8.13 5.07 Intr - 168769 168594 176 2 2 77 115 145 0.982 14.84 5.06 Intr - 169933 169630 304 1 1 120 97 68 0.624 6.14 5.05 Intr - 170124 170042 83 1 2 47 94 96 0.526 4.54 5.04 Intr - 170535 170472 64 0 1 59 96 62 0.541 1.47 5.03 Intr - 177435 177315 121 2 1 97 74 33 0.375 2.38 5.02 Intr - 181359 181212 148 1 1 75 84 66 0.606 3.27 5.01 Init - 184011 184002 10 0 1 82 82 9 0.172 -0.01 5.00 Prom - 184926 184887 40 -7.45 6.00 Prom + 188805 188844 40 -5.25 6.01 Init + 195225 195282 58 2 1 62 99 41 0.671 4.12 6.02 Intr + 204249 204290 42 1 0 105 27 99 0.002 2.79 6.03 Intr + 224820 224867 48 1 0 70 79 51 0.489 0.13 6.04 Term + 226012 226460 449 2 2 32 43 494 0.557 33.69 6.05 PlyA + 228456 228461 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 204249 204304 56 1 2 105 48 108 0.963 5.24 S.002 Term - 215426 215344 83 0 2 129 43 73 0.972 3.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:207455636_207697055|GENSCAN_predicted_peptide_1|103_aa MAARERLELESCGPRKELHPPAKQAGLQEAPWHSGFFNSGTEEILEEEDSRQREELKQRS LQQEHDDEIEKLRDHITKTTLMTEPQLTQRTEALYQDARELQL >gi568815596f:207455636_207697055|GENSCAN_predicted_CDS_1|312_bp atggctgccagagagaggctagaacttgagagctgtggccccaggaaagaactccaccct ccggcaaagcaagctgggctccaggaggctccgtggcacagtggctttttcaacagtgga acagaagagatcctggaggaggaggattccaggcaaagggaagagctaaagcaaagatcc ctccagcaggagcatgatgatgagattgagaaactgcgagaccatattaccaagacaaca ctcatgactgaaccacaactaacacagagaacagaagcgctttatcaagatgcacgtgaa cttcagctttag >gi568815596f:207455636_207697055|GENSCAN_predicted_peptide_2|356_aa MGHISRGPGRLCGLEASEGRRRQPWSGQSPARCHSIGASAPRKSMLRWRPPGFSAESRKG KPVPTGVPRRPRVPGPGTLAEAQQTFFPSPHGSSGPAASQPGLPVPRLPAASESSRGWRR SRAEPGQPQPPSSPPLTAAGAGLSGGFHQVADTPREFPHWATHKPRFLLQISAAGFTDAK AAVEFRPLLPAPGSARLRLPSPRGTPSRRPKKPEGLRQVPSPARGRGRSSCSGPSSAVSV RGRCALGTGRRWLAPWLRLLSRRRLLLPVARAAGRSGVLVPVGDGGGRRRQLRAEEVKVP AAFPVASVPAMPASQGEELQRLTYSSLHSLCNKVRNPRFPLVLYFIRALSPPLLHR >gi568815596f:207455636_207697055|GENSCAN_predicted_CDS_2|1071_bp atggggcatatttccaggggtccgggcaggttgtgtggcttggaggcctccgagggccgt cgacgtcagccatggtccgggcagagccccgcaaggtgccacagtatcggggccagcgcc ccccggaaaagcatgctccgctggcggccgccaggtttctccgcggagtctcggaaagga aaacccgtccccacgggggtcccacgacgcccccgggtcccaggccccgggacgttggca gaggcgcagcagaccttcttccccagcccgcacggctccagcggtccggccgcgtcccag cccgggctgcccgtcccccgtctcccggcggcgtcggaaagttcccggggctggcggcgg tcaagagcagagccagggcagccgcagccgccttctagcccgccgctcaccgccgcaggg gcaggcctgagcgggggtttccaccaagtcgccgacacccctcgggaattcccccactgg gccacacacaaaccccgcttcctccttcaaatttccgctgccggctttaccgatgcgaaa gccgcggtggagtttagaccactcctcccggcgccgggctcagcccgccttcgccttccc tctccccgcggaaccccttctcgtcgcccgaagaaacccgaaggtcttcggcaagttccg tccccagcgcgggggcggggccgctcgagctgctccgggcctagctcggctgtttccgtg cgcggccgctgcgcactcggcactgggcggcgctggctggctccctggctgcggctcctc agtcggcggcggctgctgctgcctgtggcccgggcggctgggagaagcggagtgttggtg ccggtgggggatggaggaggaagaaggcgacagttgagggcagaggaggtgaaggtgcct gctgccttcccggtggcttcggtgcctgccatgcccgcttcgcaaggggaagagctgcaa cgtctgacctactcttctctccactctttgtgcaataaagttcgaaacccaaggtttcct ctcgtgctgtattttattcgtgccctttcccccccgctcctacaccgttga >gi568815596f:207455636_207697055|GENSCAN_predicted_peptide_3|339_aa MESGAENQQSGDAAVTEAENQQMTVQAQPQIATLAQVSMPAAHATSSAPTVTLVQLPNGQ TVQVHGVIQAAQPSVIQSPQVQTVQSSCKDLKRLFSGTQISTIAESEDSQESVDSVTDSQ KRREILSRRPSYRKILNDLSSDAPGVPRIEEEKSEEETSAPAITTVTVPTPIYQTSSGQY IAITQGGAIQLANNGTDGVQGLQTLTMTNAAATQPGTTILQYAQTTDGQQILVPSNQVVV QAASGDVQTYQIRTAPTSTIAPGVVMASSPALPTQPAEEAARKREVRLMKNREAARECRR KKKEYVKCLENRVAVLENQNKTLIEELKALKDLYCHKSD >gi568815596f:207455636_207697055|GENSCAN_predicted_CDS_3|1020_bp atggaatctggagccgagaaccagcagagtggagatgcagctgtaacagaagctgaaaac caacaaatgacagttcaagcccagccacagattgccacattagcccaggtatctatgcca gcagctcatgcaacatcatctgctcccaccgtaactctagtacagctgcccaatgggcag acagttcaagtccatggagtcattcaggcggcccagccatcagttattcagtctccacaa gtccaaacagttcagtcttcctgtaaggacttaaaaagacttttctccggaacacagatt tcaactattgcagaaagtgaagattcacaggagtcagtggatagtgtaactgattcccaa aagcgaagggaaattctttcaaggaggccttcctacaggaaaattttgaatgacttatct tctgatgcaccaggagtgccaaggattgaagaagagaagtctgaagaggagacttcagca cctgccatcaccactgtaacggtgccaactccaatttaccaaactagcagtggacagtat attgccattacccagggaggagcaatacagctggctaacaatggtaccgatggggtacag ggcctgcaaacattaaccatgaccaatgcagcagccactcagccgggtactaccattcta cagtatgcacagaccactgatggacagcagatcttagtgcccagcaaccaagttgttgtt caagctgcctctggagacgtacaaacataccagattcgcacagcacccactagcactatt gcccctggagttgttatggcatcctccccagcacttcctacacagcctgctgaagaagca gcacgaaagagagaggtccgtctaatgaagaacagggaagcagctcgagagtgtcgtaga aagaagaaagaatatgtgaaatgtttagaaaacagagtggcagtgcttgaaaatcaaaac aagacattgattgaggagctaaaagcacttaaggacctttactgccacaaatcagattaa >gi568815596f:207455636_207697055|GENSCAN_predicted_peptide_4|124_aa MTLNEHAAFKHLFNKAHLAPPLIHLTLSGHSTCFREHRVGDKVTDQQDPKAEEFFLVQNK MKSLPCLLLSTQTRQPSDFSIFSPPFPPFYSTKPPLSSWPIPNEPLGTPPRRGRGRAEGL LTSQ >gi568815596f:207455636_207697055|GENSCAN_predicted_CDS_4|375_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacacagcacatgtttcagagagcacagggttggg gataaggtcacagatcaacaggatcccaaggcagaagaatttttcttagtacagaacaaa atgaaaagtctcccatgtctacttctatccacacagacccggcaaccatccgatttctca attttttccccacccttcccgcctttctattccacaaaaccgccattgtcatcgtggccc atccccaatgagccgctgggcacacctcccagacggggtcgtggccgggcagaggggctc ctcacttcccagtag >gi568815596f:207455636_207697055|GENSCAN_predicted_peptide_5|513_aa MVAGVALWCGVGTGQPEEGLVRSQHHREEDEQEDENECLLAGRCANAWTLWVGSNGLPLC INGDRLYATIVNKFQMSVVNMTKIDFLLMSQSTGLRPGYQALGTFHVQRGSSRAGTTRKR QSQRQRAAEHMIEKVERHIQRQRLPRRRGKRIKSSVYPGDARGKAPVPPPATSLLHRATA RPRPRRELRGHLPEAGCRETGNAWLRAGAARPRVAGSLRPAVFAASPGRGKRSARSTRDR LPGRLSRGRGAGGMALVPYEETTEFGLQKFHKPLATFSFANHTIQIRQDWRHLGVAAVVW DAAIVLSTYLEMGAVELRGRSAVELGAGTGLVGIVAALLGTILANSRYVIEANQESLLPV VNAKGANCTDVHITFSLFHIVGAHVTITDRKVALEFLKSNVQANLPPHIQTKTVVKELTW GQNLGSFSPGEFDLILGADIIYLEETFTDLLQTLEHLCSNHSVILLACRIRYERDNNFLA MLERQFTVRKVHYDPEKDVHIYEAQKRNQKEDL >gi568815596f:207455636_207697055|GENSCAN_predicted_CDS_5|1542_bp atggtggcaggtgttgctctgtggtgtggagtggggactggacagccagaggaggggttg gtcaggtcacagcaccacagggaagaggatgaacaggaagacgaaaatgagtgcttgtta gcaggaagatgcgccaatgcttggactctctgggtaggctcaaatggcctgcccttgtgt attaatggagataggctatatgctacaatagtgaataaattccaaatgtcagtggttaac atgacaaagattgatttcttgctcatgtcacaatccactggcctgcggccagggtaccag gccctcggcaccttccacgtccagcgtgggtcctcacgggcaggcacaacgcgcaagcga caaagtcagaggcagagagcggcggagcacatgatagaaaaggtcgagaggcacattcag cgccagcgcctccctcggcggcgggggaaacggattaagtcgtccgtgtaccctggggac gcacgagggaaagccccggtcccccctcccgcaacgtcactcctccaccgcgccacggcc cggccccgccctcgccgtgagctccgggggcacctgccggaggctgggtgccgggagacc ggaaatgcgtggctccgggccggggccgcccgcccccgcgtcgcgggctctttaaggccg gcggttttcgcagccagcccggggcggggaaagcggagcgcgcgctccacgcgggaccgc ctcccgggccgtctgagcagagggcggggtgcaggcggaatggccctcgtgccctatgag gagaccacggaatttgggttgcagaaattccacaagcctcttgcaactttttcctttgca aaccacacgatccagatccggcaggactggagacacctgggagtcgcagcggtggtttgg gatgcggccatcgttctttccacatacctggagatgggagctgtggagctcaggggccgc tctgccgtggagctgggtgctggcacggggctggtgggcatagtggctgccctgctgggc acaattttagcaaatagcaggtatgtcatagaggcaaatcaagagtcacttctaccagta gttaatgcaaaaggagctaattgtactgatgtacacatcactttttctttgttccacatt gtaggtgctcatgtgactatcacggatcgaaaagtagcattagaatttcttaaatcaaac gttcaagccaacttacctcctcatatccaaactaaaactgttgttaaggagctgacttgg ggacaaaatttggggagtttttctcctggagaatttgacctgatacttggtgctgatatc atatatttagaagaaacattcacagatcttcttcagacactggaacatctctgtagcaat cactctgtgattcttttagcatgccgaattcgctatgaacgggataacaacttcttagca atgctggagaggcaatttactgtgagaaaggttcactacgatcctgaaaaagatgtacat atttacgaagcacagaagagaaaccagaaggaggacttataa >gi568815596f:207455636_207697055|GENSCAN_predicted_peptide_6|198_aa MNVHQHFYGKQSDGTGGQPEDINDADYLAVPVLGKLMSREFGVRIFHYPGQWRPRTTRLL SEPPWAAGEGPGGADDEGPVRRQVKVTVKYDRKELRKRLNLEEWILEQLTRLYDCQEEEI PELEIDVDELLDMESDDARTARVKELLVDLLQTHRDLHLWPAGQDPGHAEAEHTPEEVRV PDPGERWLPQDNCCPSTS >gi568815596f:207455636_207697055|GENSCAN_predicted_CDS_6|597_bp atgaatgttcatcagcatttttatgggaaacaatctgatggaacaggaggccagccagaa gacatcaacgatgcagattacctggcagtccccgtgctgggtaaactcatgtcacgagaa tttggtgtacgaatatttcattacccagggcagtggcggcccaggacaacgcgtctactt tcagagcccccctgggccgcaggagagggcccgggcggcgcggacgatgagggcccagtg aggcgccaagtgaaggtcaccgtcaagtatgaccgcaaggagctacggaagcgccttaac ctagaggagtggatcctggagcagctcacgcgcctctacgactgccaggaagaggagatc ccagaactggaaattgacgtggatgagctcctggacatggagagtgacgatgcccgtact gccagggtcaaggagctgctggttgacttgttacaaacccacagagaccttcatctctgg cctgctggacaagatccggggcatgcagaagctgagcacaccccagaagaagtgagggtc cccgacccaggcgaacggtggctcccacaggacaattgctgcccctcgacctcgtag