GENSCAN 1.0 Date run: 6-Nov-116 Time: 21:35:53 Sequence gi568815585f:27324739_27535594 : 210856 bp : 44.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8214 8335 122 1 2 1 -73 321 0.388 7.51 1.02 Intr + 37670 37769 100 1 1 83 58 23 0.028 -1.52 1.03 Intr + 40004 40037 34 0 1 97 105 6 0.162 0.48 1.04 Intr + 47417 47523 107 2 2 46 109 78 0.195 5.56 1.05 Intr + 48634 48821 188 2 2 71 46 65 0.157 0.01 1.06 Term + 58869 59060 192 2 0 31 46 217 0.710 9.22 1.07 PlyA + 59187 59192 6 1.05 2.09 PlyA - 61695 61690 6 1.05 2.08 Term - 62786 62623 164 0 2 87 50 76 0.135 1.80 2.07 Intr - 75805 75630 176 0 2 45 106 39 0.182 0.98 2.06 Intr - 80275 80068 208 2 1 37 103 86 0.399 3.14 2.05 Intr - 83659 83506 154 1 1 81 26 88 0.732 1.55 2.04 Intr - 83997 83895 103 2 1 55 62 98 0.573 4.08 2.03 Intr - 86860 86808 53 1 2 85 94 40 0.905 2.01 2.02 Intr - 88757 88623 135 2 0 75 43 51 0.490 0.06 2.01 Init - 96046 95993 54 1 0 86 89 51 0.726 6.29 2.00 Prom - 96957 96918 40 -9.26 3.00 Prom + 98263 98302 40 -5.96 3.01 Init + 99825 99946 122 2 2 68 -9 159 0.389 1.87 3.02 Intr + 99966 100200 235 0 1 18 78 242 0.493 13.89 3.03 Intr + 102354 102454 101 2 2 69 75 68 0.863 2.51 3.04 Intr + 105132 105228 97 0 1 58 92 59 0.870 3.31 3.05 Intr + 105795 105883 89 2 2 51 58 86 0.554 0.67 3.06 Term + 107993 108080 88 2 1 75 38 88 0.580 -0.37 3.07 PlyA + 108472 108477 6 -0.45 4.14 PlyA - 108753 108748 6 1.05 4.13 Term - 111155 110937 219 2 0 106 28 54 0.676 -1.76 4.12 Intr - 112535 112378 158 0 2 51 115 69 0.824 5.63 4.11 Intr - 115550 115251 300 0 0 10 82 247 0.047 13.01 4.10 Intr - 125403 125326 78 1 0 103 75 45 0.580 4.22 4.09 Intr - 125874 125771 104 2 2 66 95 75 0.681 5.82 4.08 Intr - 126532 126369 164 1 2 78 33 145 0.651 6.77 4.07 Intr - 144120 144021 100 2 1 113 89 71 0.570 9.81 4.06 Intr - 147909 147834 76 1 1 69 96 8 0.454 -1.73 4.05 Intr - 149665 149549 117 2 0 44 61 87 0.489 2.04 4.04 Intr - 150292 150147 146 0 2 76 97 45 0.537 4.13 4.03 Intr - 152417 152357 61 0 1 100 75 33 0.394 0.99 4.02 Intr - 169184 169063 122 1 2 89 92 81 0.633 8.74 4.01 Init - 169675 169596 80 1 2 73 31 42 0.221 -2.37 4.00 Prom - 170904 170865 40 -5.06 5.00 Prom + 171694 171733 40 -3.26 5.01 Init + 174459 174576 118 2 1 21 116 83 0.107 4.76 5.02 Term + 185924 186006 83 0 2 77 49 114 0.806 4.26 5.03 PlyA + 186121 186126 6 1.05 6.00 Prom + 196309 196348 40 -4.96 6.01 Init + 203522 203580 59 1 2 79 97 27 0.295 3.50 6.02 Intr + 208411 208698 288 1 0 50 73 196 0.146 10.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 115413 115251 163 0 1 91 82 172 0.841 16.79 S.002 Sngl + 118178 118372 195 1 0 92 37 134 0.846 3.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:27324739_27535594|GENSCAN_predicted_peptide_1|247_aa XEDEDEDEDEDEEEEEEEEEEEEEEEEEEEEEEGWMQWLMPSPELPGVRPDPWLRLASLP RPERRSQVGRCLHRGPADFPFRKGLRGQLSGKSAAVSAAGTHTPGGGGGGEVGTKIIYYS TGRSLAEDYGDGLSNKTAWISILVPAPGGLTSLQFRFFTCKIGIRILLEELNKLRSMEYL EQGLTTLADLPHPYPGPIKTRDPSRQTHKPLYVEKNKSMKEDKRLDIERHQGEHAGRRAL RQMLAGH >gi568815585f:27324739_27535594|GENSCAN_predicted_CDS_1|744_bp naagaagatgaagatgaagacgaagacgaagatgaagaagaagaggaagaggaggaggaa gaggaagaagaagaagaagaggaagaagaagaagaagaaggctggatgcagtggctcatg cctagtccagagctccccggggtccgccctgacccgtggctccgcctcgcctcgctcccg cgcccagagcggcgctcacaggtcggtcgctgcctgcacagaggtcctgctgacttcccc ttcagaaaaggactgagagggcagctctctgggaagtcggcagctgtgtctgcagccggc actcacacacctggcggggggggggggggggaggtgggaaccaagatcatctactacagc acaggaagaagtttagcagaggattatggagatggtctcagcaacaagactgcctggatt tccatcctagttcctgccccaggtggcttaacctctctgcagttccgtttcttcacctgt aaaatagggataagaattctgttggaagaattaaataagctaagaagtatggagtacttg gaacaaggtctgaccaccctggccgaccttccacacccttatcctggacctataaaaacc cgagaccctagcaggcagacacacaagccgctgtacgtggagaaaaacaaatcgatgaaa gaagacaagcggctggacatcgagagacatcaaggggagcatgccggcagaagagcactc agacagatgctggcaggccattga >gi568815585f:27324739_27535594|GENSCAN_predicted_peptide_2|348_aa MKVNLEGELKQNYGVPAQMFFGHLRVACCASPKAMLRACATEPSSSSLSIKGIETTGASA ILVIRKLGHRVVKRMAQSHSYFCTLNCTIPINPETRCFYVFIGYNYLQASSGNRKASPPC QSVELFKQTNHILPWKPGSTSSSCYYKASSHSPYWFIQCRRCVALHTSHPGANLIFAQQT RRRLPNLSPVPTRASEPADFLPGPAGLAHVLAPGSASLFLLSHGLTCNTHISPVRPALIG AFYKPLGSYRAVISVFYNPAYADWCILQSSCKTEKFSKSPLNPGSPAIFTSQHLASYGHT RLGQRGAASQRRQEMGISTWRSPQPSPPPQRLSPLHTGGATSYQSHKT >gi568815585f:27324739_27535594|GENSCAN_predicted_CDS_2|1047_bp atgaaggtcaacctcgaaggtgaacttaaacagaactatggtgtgcctgctcaaatgttt tttgggcatctccgtgtggcatgctgtgccagccccaaggcgatgctcagggcatgtgcc acagaaccttccagctcctcgctgtctattaaaggcattgagacaactggggccagtgcc atattagtgataaggaaactgggtcaccgagtggttaagcgaatggcccagtcacacagc tatttctgtaccctcaactgtaccattcccatcaatccagagacccggtgcttttacgtc ttcattgggtataactatctccaggcttcttctggaaatagaaaggcttcacctccttgc cagtctgtggaattattcaaacaaaccaatcacatcctcccctggaaaccagggagcacc tcatcctcctgttactacaaagcctcctctcacagcccctactggttcatccagtgccgc cgctgtgtggccctgcacacctcccaccccggcgccaacctgatctttgcccagcagacg cggaggcgcctgcccaacctcagccctgtgcccacgagggcctcagagcccgcagacttc ctgccgggacctgctggccttgcccatgttttagctcctggctctgctagtcttttcctg ctgtctcatggtctcacgtgtaacactcacatctccccagtgagaccagcactgattggt gcattttacaaacctctaggtagctacagagcagtgattagtgtgttttacaatcctgcc tatgctgattggtgcattttacaatcctcttgtaagacggaaaagttctccaagtcccca ctcaacccaggaagtccagctatcttcacctctcaacacctggcctcctatggccacaca cggctgggccagaggggagctgccagccagaggcgacaggaaatgggcattagcacctgg cgcagtcctcaaccctcgcctcctccccagcgcctcagccctttgcacactggtggtgca accagttaccaatcgcataagacttag >gi568815585f:27324739_27535594|GENSCAN_predicted_peptide_3|243_aa MRSSGADAGRCLVTARAPGSVPASREGSAGSRGPGAPVPGTAPGPGLGGAGALDPPAVVA ESVSSLTIADAFIAAGESSAPTPPRPALPRRFICSFPDCSANYSKAWKLDAHLCKHTGER PFVCDYEGCGKAFIRDYHLSRHILTHTGEKPFVCAANGCDQKFNTKSNLKKHFERKHENQ QKQYICSFEDCKKTFKKHQQLKIHQCQHTNEPLFKCTQEGCGKHFASPSKLKRHAKAHEG VYG >gi568815585f:27324739_27535594|GENSCAN_predicted_CDS_3|732_bp atgcgcagcagcggcgccgacgcggggcggtgcctggtgaccgcgcgcgctcccggaagt gtgccggcgtcgcgcgaaggttcagcagggagccgtgggccgggcgcgccggttcccggc accgcgcctggccctgggcttggaggcgccggcgccctggatccgccggccgtggtcgcc gagtcggtgtcgtccttgaccatcgccgacgcgttcattgcagccggcgagagctcagct ccgaccccgccgcgccccgcgcttcccaggaggttcatctgctccttccctgactgcagc gccaattacagcaaagcctggaagcttgacgcgcacctgtgcaagcacacgggggagaga ccatttgtttgtgactatgaagggtgtggcaaggccttcatcagggactaccatctgagc cgccacattctgactcacacaggagaaaagccgtttgtttgtgcagccaatggctgtgat caaaaattcaacacaaaatcaaacttgaagaaacattttgaacgcaaacatgaaaatcaa caaaaacaatatatatgcagttttgaagactgtaagaagacctttaagaaacatcagcag ctgaaaatccatcagtgccagcataccaatgaacctctattcaagtgtacccaggaagga tgtgggaaacactttgcatcacccagcaagctgaaacgacatgccaaggcccacgagggt gtgtacggatag >gi568815585f:27324739_27535594|GENSCAN_predicted_peptide_4|574_aa MGLSTKELGNSRAQGKPELGLGREKALAELSNQHLQMGYRCDREIDPFHLCGSQVGFGPA IAPYPVGTNTRAYGSGQGQVVVMLMELRRKRRPFPGSTLFLLCVPDQGLRCAVAATLSLD VFAGEYPRLVHDGEAVEVDEAAGKLRLLCFRSQHISVSNATVMKNSGLKPQIWKKSKWTQ FGLMTCFAPSKQCTGLGQPLSTQLKPSSLAVIIVSVIGFLCSQQQALNQTPGISESKHSR YEARLGGFTTVDSQSALEEFLIEEEKEGGHGAQTPSTRLSSRLHGSGSGTRGEAPPTTES EGDSPQIRCTCVRYSMSIACPSTGGRRRRLTNNRVGGSRKCSCGLLPGTAFSTAEDTQNE GKKTKKNKTAFSNVGRKISQRVIHLFDEKGNDLGNMHRANVIRLMDERDLRLVQRNTSTE PAEYQLMTGLQILQERQRLREMEKANPKTGPTLRKELILSSNIGQHDLDTKTKQIQQWIK KKHLVQITIKKGKNVDVSENEMEEIFHQILQTMPGIATFSSRPQAVQGGKALMCVLRAFS KNEEKAYKETQETQERDTLNKDHGNDKESNVLHQ >gi568815585f:27324739_27535594|GENSCAN_predicted_CDS_4|1725_bp atgggcttgagcacaaaagaactggggaactcgagggcccagggaaagccagagctgggc ctgggcagagagaaagctttggcagaactttccaatcagcatctgcaaatgggttacagg tgtgaccgagaaattgatcccttccacctttgcggcagccaagtaggatttgggccagcc attgccccatatccagttggcactaacaccagggcttatggctcaggacaaggtcaggtg gtggtaatgctgatggagctgaggaggaagaggaggcccttcccagggagcaccctcttc cttctctgtgtccctgaccaaggcctgaggtgtgccgtggcggccacgctctccctggat gttttcgctggggagtatccgaggctggtccatgacggtgaggcagtagaagtagatgaa gcagcaggaaagctgaggctgctctgtttcagaagccagcacatctctgtatcaaatgcc acggtcatgaaaaactcaggccttaaaccccagatttggaagaaaagcaaatggacacag tttggtttgatgacatgctttgctcccagcaagcagtgcacaggcctgggccagcctctc agcacccaattaaagccttcttctttggcagtaatcatcgtctcagtcattggctttctg tgcagccagcagcaggccctaaaccaaaccccaggcatttcagaatccaaacattccagg tatgaagccagattaggtggcttcacgactgtggactcgcagtctgcccttgaggaattt cttattgaagaagaaaaagaggggggacacggggcccagacccccagcacccggctttcg agcaggctccacgggtccgggtccgggacgcggggtgaagccccgcccactacggagagc gaaggggactcgccgcagatccgctgtacttgcgtccgctacagtatgtcaatcgcttgc cccagcacaggtgggcgtcgccgccgacttaccaacaaccgggtcgggggctcccggaag tgctcttgcggcttactgcctggcacagcctttagtaccgctgaagacacccagaatgaa ggaaaaaagacaaaaaagaataaaacagcttttagtaacgttggaagaaaaattagtcag cgagttattcacttatttgatgagaagggcaatgatttgggaaacatgcaccgagcaaat gtgattagacttatggatgagcgagacctgcgactggttcaaaggaacaccagcacagaa cctgcagagtatcagctcatgacaggattgcagatcctccaggagcggcagaggctgagg gagatggagaaggcgaaccccaaaactggaccaaccctgagaaaggaactgattttgtct tcaaatattggacaacatgatttggacacaaagactaaacagattcagcagtggattaag aaaaaacacctagtccagattaccataaagaaaggaaaaaatgtagacgtgtcagaaaat gaaatggaggagatatttcatcaaatactccagactatgcctggaatagctacattctca tctaggccacaagctgttcaaggaggaaaagctttaatgtgtgttcttcgtgctttcagc aaaaatgaggagaaggcatataaagaaactcaagagacccaggaaagagacactttgaac aaagaccatggaaatgataaggaatcaaatgttctgcatcagtaa >gi568815585f:27324739_27535594|GENSCAN_predicted_peptide_5|66_aa MPPPEKEKTTSKKSHQKVDLIPASKGYSDAAVSLSKSYKVPTYIAGAISHIPPVSFTLTT HQLASP >gi568815585f:27324739_27535594|GENSCAN_predicted_CDS_5|201_bp atgcctcccccagagaaggaaaagacgaccagcaagaaaagtcatcagaaagttgattta attcctgcatccaaaggttattctgacgctgctgtttctctttctaaatcctacaaagtg cccacctacattgctggtgccatctcccacatccccccggtgtccttcacactcaccaca caccagctcgccagtccttga >gi568815585f:27324739_27535594|GENSCAN_predicted_peptide_6|116_aa MGVAHEEALMNGLGHPLGDKYPTLPAAASLTSCTGSHRLQTGCKHRGAASEGGQSIEPAE GAETRPSELGTSRASADHWWCPSSQGSDARLLQSAEKTNRTLHCYFLPDAHESHLX >gi568815585f:27324739_27535594|GENSCAN_predicted_CDS_6|348_bp atgggggtggctcatgaggaggccctcatgaatggcttgggccatccccttggtgataaa tatccaacccttcctgctgcagcttccctgacctcctgcaccgggagccaccggctgcaa acaggctgcaaacaccggggagctgcaagcgagggcggccagtccatcgagccagcagag ggcgctgaaacgcggcccagtgaactagggacgtccagggccagcgctgaccactggtgg tgcccgagctcccagggctctgatgcccggctgctccagtccgcagagaaaactaaccgg acactccactgctactttcttcccgatgctcatgagtctcacctttgn