GENSCAN 1.0 Date run: 6-Nov-116 Time: 14:46:54 Sequence gi568815586r:68148802_68353448 : 204647 bp : 39.41% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 247 242 6 1.05 1.04 Term - 6686 6552 135 2 0 118 39 173 0.973 12.44 1.03 Intr - 9294 9112 183 0 0 77 85 148 0.999 12.46 1.02 Intr - 9458 9390 69 2 0 120 99 39 0.990 6.66 1.01 Init - 10814 10701 114 2 0 67 78 59 0.795 3.15 1.00 Prom - 10976 10937 40 -2.95 2.00 Prom + 25356 25395 40 -3.05 2.01 Init + 26125 26189 65 0 2 78 67 29 0.414 0.47 2.02 Intr + 28841 29080 240 2 0 72 115 112 0.755 8.04 2.03 Intr + 29163 29281 119 0 2 34 87 57 0.621 -0.61 2.04 Term + 36572 36708 137 0 2 29 48 157 0.616 3.00 2.05 PlyA + 36867 36872 6 1.05 3.04 PlyA - 38515 38510 6 1.05 3.03 Term - 50071 49874 198 1 0 42 45 156 0.302 3.12 3.02 Intr - 54435 54308 128 1 2 66 24 118 0.568 2.68 3.01 Init - 57214 57049 166 1 1 49 -18 232 0.716 8.54 3.00 Prom - 61174 61135 40 -2.25 4.22 PlyA - 61366 61361 6 1.05 4.21 Term - 75383 75264 120 2 0 20 41 91 0.300 -4.91 4.20 Intr - 76482 76348 135 0 0 70 75 83 0.586 5.04 4.19 Intr - 94115 93880 236 0 2 46 93 128 0.116 5.58 4.18 Intr - 98102 98024 79 2 1 80 77 38 0.055 0.11 4.17 Intr - 102777 102712 66 0 0 64 97 103 0.990 6.98 4.16 Intr - 103846 103703 144 1 0 133 75 49 0.951 7.66 4.15 Intr - 104028 103963 66 0 0 123 58 66 0.944 5.28 4.14 Intr - 104678 104462 217 1 1 75 59 202 0.560 13.58 4.13 Intr - 123731 123619 113 2 2 91 37 92 0.190 2.66 4.12 Intr - 141643 141597 47 2 2 79 91 38 0.010 0.41 4.11 Intr - 146565 146464 102 1 0 81 50 59 0.124 0.63 4.10 Intr - 154071 153819 253 0 1 76 46 266 0.996 17.28 4.09 Intr - 164751 164642 110 1 2 107 92 27 0.555 4.08 4.08 Intr - 164952 164843 110 2 2 46 17 82 0.456 -4.09 4.07 Intr - 166464 166147 318 2 0 46 76 526 0.729 41.35 4.06 Intr - 167452 167277 176 1 2 48 72 203 0.999 12.52 4.05 Intr - 174439 174272 168 1 0 63 61 70 0.540 1.02 4.04 Intr - 176774 176640 135 2 0 57 70 56 0.492 0.54 4.03 Intr - 177485 177354 132 2 0 44 50 194 0.743 11.12 4.02 Intr - 178220 177856 365 0 2 52 49 294 0.959 15.88 4.01 Init - 183917 183662 256 2 1 18 94 181 0.027 9.14 4.00 Prom - 187559 187520 40 -5.95 5.03 PlyA - 188590 188585 6 1.05 5.02 Term - 192355 192206 150 1 0 127 42 125 0.782 8.73 5.01 Init - 197119 197069 51 1 0 71 60 56 0.306 2.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 76955 76785 171 2 0 58 100 82 0.840 4.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:68148802_68353448|GENSCAN_predicted_peptide_1|166_aa MKYTSYILAFQLCIVLGSLGCYCQDPYVKEAENLKKYFNAGHSDVADNGTLFLGILKNWK EESDRKIMQSQIVSFYFKLFKNFKDDQSIQKSVETIKEDMNVKFFNSNKKKRDDFEKLTN YSVTDLNVQRKAIHELIQVMAELSPAAKTGKRKRSQMLFRGRRASQ >gi568815586r:68148802_68353448|GENSCAN_predicted_CDS_1|501_bp atgaaatatacaagttatatcttggcttttcagctctgcatcgttttgggttctcttggc tgttactgccaggacccatatgtaaaagaagcagaaaaccttaagaaatattttaatgca ggtcattcagatgtagcggataatggaactcttttcttaggcattttgaagaattggaaa gaggagagtgacagaaaaataatgcagagccaaattgtctccttttacttcaaacttttt aaaaactttaaagatgaccagagcatccaaaagagtgtggagaccatcaaggaagacatg aatgtcaagtttttcaatagcaacaaaaagaaacgagatgacttcgaaaagctgactaat tattcggtaactgacttgaatgtccaacgcaaagcaatacatgaactcatccaagtgatg gctgaactgtcgccagcagctaaaacagggaagcgaaaaaggagtcagatgctgtttcga ggtcgaagagcatcccagtaa >gi568815586r:68148802_68353448|GENSCAN_predicted_peptide_2|186_aa MEYYAAIKRNEIMSFARTWMELFYYILYFSALVSVYQQVAFARTFVVTSDENSSHTILIK ERISETGTQVLTGFVSLAVTSASLCETMSILPPTEQHSPAGREMQIKTTRYPHTHYEMAK IWNTDNTKCWLRRGASGTLTEGMDLKTKENGWVHEGPDAGTSSEEVFLGPRGSERWPGVK TEQAER >gi568815586r:68148802_68353448|GENSCAN_predicted_CDS_2|561_bp atggaatactatgcagccataaaaaggaatgagatcatgtcctttgcaagaacatggatg gaacttttttattacatcctttatttttcggcattagtgtcagtataccaacaagttgca tttgccaggacttttgtggtgacaagtgacgaaaattccagtcacactattttgatcaaa gaaaggatctcagagacaggtactcaagtgttgacaggatttgtctctctagctgtcact tctgcttctctttgtgagacaatgtcaatcctgcctcccacagagcagcattcaccagct ggaagggaaatgcaaattaaaacaacaagatacccacacacccattatgaaatggcaaaa atctggaacactgacaacaccaaatgctggctgagacgtggagcatcaggaactctgact gaagggatggatctcaagacaaaggaaaatggttgggtgcacgaggggccagatgctgga accagttctgaagaagtgttcctggggccaagaggatctgagaggtggccaggtgtgaag actgaacaagctgagcgttaa >gi568815586r:68148802_68353448|GENSCAN_predicted_peptide_3|163_aa MDLGEIQELVDTAPQELAEDDLMEMSVSEPVPDDEEEDIEEAIPENKLTLDNWHKGDSCA QIGFENSLAQSGLVVSPGQPAYKPAARDTSLLLLLISKKYMKLEPDHFNLPTDTSLEKVK PRCSGFKMVSELPSFFPEMEGRKHTHPVSHSMVLMTVALDRGC >gi568815586r:68148802_68353448|GENSCAN_predicted_CDS_3|492_bp atggatcttggagaaattcaagagctagtagacactgcaccacaggaattagcagaagac gacttgatggagatgagtgtttctgaaccagtgccagacgatgaggaagaagacatagaa gaagcaatcccagaaaacaaattgacattagacaattggcataagggtgattcctgtgct cagatagggtttgagaattcacttgcacaaagcggactggtagtttcccctggccagcca gcctacaagccagctgctcgagatactagcctgctgcttctcctaatttcgaaaaaatat atgaaattagaaccagatcattttaacctgcccacagataccagtttagaaaaagtcaag ccacgatgttcaggtttcaaaatggtttcagagctcccgtcattttttccggagatggaa ggacgaaagcacactcacccagtcagtcactccatggttctgatgactgtggctcttgac agagggtgctag >gi568815586r:68148802_68353448|GENSCAN_predicted_peptide_4|1115_aa MLSSTWASEAGGWRIPPASRRIAAQDPANQGPVHPTQEGTPGVPPSPVPGALLAGLPPAI SISPRDLAREVGGSLSATHWSNPNGGITKEPSFISKRRVPYHDPQISKSLEWNGAISESN VVASPEPEAPETPKSQEAEQKDVTQERVHSLEASRVPKRTRSHSADSRAEGASDVENNEG VTNHTPVNENVELEHSTKVLSENVDNGFLIVYKKTNRNSIDEMQKEGELWLSLRDLAFLL SYVTAGLQQVPLDRLLRKKAGLTVVPSYNALRNSEYQRQFVWKTSKETAPAFAANQVFHN KSQFVPPFKGNSVIHETEYKRNFKGLSPVKEPKLRNDLRENRNLETVSPERKVKELREKA EFYRKRVQGTHFSRDHLNQILSDSNCCWDVSSTTSSEGTVSSNIRALDLAGDPTSHKTLQ KCPSTEPEEKGNIVEEQPQKNTTEKLGVSAPTIPVRRRLAWDTENTSEDVQKQPGEKEEE DDNEEEGDRKTGKQAFMGEQEKLDVREKSKADKMKEGSDSSVSSEKGGRLPTPKLRELGG IQRTHHDLTTPAVGGAVLVSPSKMKPPAPEQRKRMTSQDCLETSKNDFTKKESRAVSLLT SPAAGIKTVDPLPLREDSEDNIHKFAEATLPVSKIPKYPTNPPGQLPSPPHVPSYWHPSR RIQGSLRDPEFQHNDEDRLSEISARSAASSLRAFQTLARAKKRKENFWDLSVVTYIQLTQ RQFQIFFFPPGKANHTVGGLECPVTFGDNLKPPRKENISSLCHQLLELELSAMAALQKSV SSFLMGTLATSCLLLLALLVQGGAAAPISSHCRLDKSNFQQPYITNRTFMLAKEASLADN NTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEVVPFLARLSNR LSTCHIEGDDLHIQRNVQKLKDTVKKSFSSQLLVSLWQLMVTGQGCSPVLPKEAGKQTNT LTILVQQSWKPKVTAIEAQKQALISLSGVVLQNQHALDVLTTKAGGTCVLLSETCCFYIN TSGQIEESLKKKNCQFQEQLLSFFMEDVFGQLQLQGCKKIRFVEDFHSLRQKLSHCPPAV VACRWYKGWVGSLIAILGKPVGKELFILGIMVTET >gi568815586r:68148802_68353448|GENSCAN_predicted_CDS_4|3348_bp atgctgtcaagcacctgggccagcgaggctggcggctggcgaattccaccagcaagcaga cgcattgctgcgcaggacccggcgaaccagggaccggtccacccaacccaggaaggcacc cccggggtcccgcccagcccggttccgggagccttattggccggccttccgcccgccatc tctatcagccccagggatttggcaagggaggtaggcggctccctttcggccactcattgg tcaaatcctaatggaggcatcacgaaagagccaagttttatttcaaaaagaagagtccct taccatgacccacagatttcaaaatctctggagtggaatggagctatctcagagagcaat gtggttgcatcaccagaaccagaagccccggaaacaccaaaatcacaagaagcagaacaa aaggatgttactcaagaaagagttcactcactagaagcttccagggttcccaaaagaacc agatctcactctgcagactccagagctgaaggggcttcagatgtggaaaataatgagggt gtaacaaaccatacaccagttaatgaaaatgtggaactggaacattctaccaaggttctt tcagaaaatgtagataatgggtttcttatcgtgtacaagaagacaaacagaaattcaatt gatgaaatgcaaaaggaaggagagctctggctgtctctcagagatcttgcatttctgctc tcttatgtcacagctggcctgcagcaggtccctttggatagacttctgcgtaagaaagct ggattgactgttgttccttcatataatgccttgagaaattctgaatatcaaaggcagttt gtttggaagacttctaaagaaactgctccagcttttgcagccaatcaggttttccacaat aaaagccagtttgttccaccattcaaaggtaactcagtcatccatgaaactgaatacaaa agaaatttcaagggtttatctccagtgaaagaaccaaaattaagaaatgatttgagagaa aacagaaatcttgaaacagtgtctcctgaaaggaaggttaaagaactccgagaaaaggct gagttttataggaagcgagttcaggggacgcatttttctcgggaccatctgaatcagatt ttatctgatagcaactgctgttgggatgtctcctcaaccacaagctcagaaggaaccgtt agtagcaacatcagagcattagatcttgctggagatcctacaagccataagactttgcag aaatgtccttctacagaaccagaagaaaaaggaaatatcgtggaagaacagccccagaaa aataccacggagaaattgggtgtgtcagctcccaccatacccgttagaaggcggctggct tgggatacagagaacacaagtgaagacgtacagaaacagcccggggagaaagaggaggag gacgacaatgaagaggaaggggacaggaaaacgggcaagcaggcttttatgggagagcaa gagaagttggatgtacgtgagaaatctaaggcagataagatgaaagaagggtcagattct tctgtatcctcagaaaaaggaggccggcttcctactcccaagctgagagaacttggtgga atccagaggactcatcatgatctcactactccagctgttggtggtgctgttttagtgtct ccatctaagatgaagcctccagccccagaacagaggaaaagaatgacctctcaggattgt ttagaaacttcaaagaatgattttactaagaaagaaagtcgtgctgtatccctactgact tctccagctgctggtataaaaacagttgatcctctgcctttgcgggaagattctgaagac aatatccacaaatttgctgaggcaactcttccagtttcaaaaattccaaaatacccaaca aatccccctggacagttgccttctccaccacatgttccatcctactggcatccctctcga cgaattcagggctctcttagagatccagagtttcagcacaatgatgaggacagattgtct gagatttctgctcgctctgcagcttctagtctccgggcttttcaaactctggcacgagct aagaaaaggaaggagaatttctgggacctatcagttgtgacctacattcagcttacccag aggcagtttcagatattcttctttcctcctggcaaagctaaccatacggtcggaggactt gaatgcccagtgacgtttggtgacaatctgaagccaccaaggaaagaaaatatttcttct ttgtgtcaccagttgctcgagttagaattgtctgcaatggccgccctgcagaaatctgtg agctctttccttatggggaccctggccaccagctgcctccttctcttggccctcttggta cagggaggagcagctgcgcccatcagctcccactgcaggcttgacaagtccaacttccag cagccctatatcaccaaccgcaccttcatgctggctaaggaggctagcttggctgataac aacacagacgttcgtctcattggggagaaactgttccacggagtcagtatgagtgagcgc tgctatctgatgaagcaggtgctgaacttcacccttgaagaagtgctgttccctcaatct gataggttccagccttatatgcaggaggtggtgcccttcctggccaggctcagcaacagg ctaagcacatgtcatattgaaggtgatgacctgcatatccagaggaatgtgcaaaagctg aaggacacagtgaaaaagtctttcagcagccagcttctagtatctctctggcaattgatg gtaactggccagggctgttctccggtgctgcctaaagaagcaggaaaacagactaataca ctcaccatcttagtccagcaatcttggaaacctaaagtcacagccattgaggcccagaaa caggccctaatttcattgtcaggggttgtcctgcagaatcaacatgctttagatgtgctt accaccaaggcaggaggcacttgtgtgctgttaagtgaaacctgctgcttttacatcaat acttcaggtcaaatagaagaaagtttaaaaaagaaaaactgtcaatttcaagaacagctt ctgtccttcttcatggaagacgtttttggtcaactgcaattgcaaggctgcaagaaaata cgctttgtggaggactttcatagccttaggcagaaattgagccactgtcctcctgcagtt gtagcttgcagatggtacaaaggttgggttgggagccttattgcaatccttgggaagcca gttggaaaggagctcttcatcctgggcatcatggtcactgagacttaa >gi568815586r:68148802_68353448|GENSCAN_predicted_peptide_5|66_aa MGMDELSEGGSTQIGQQGWGFSIPNSHPSFGYGRHKQMSCWFLDLAFLIVSMAPTLPECL EAQDCG >gi568815586r:68148802_68353448|GENSCAN_predicted_CDS_5|201_bp atggggatggatgagctttctgagggaggaagcacgcagataggacagcagggctggggt ttctctatacccaattcccatccttcctttggctatggacgtcataagcagatgtcctgt tggttcctggatctggctttcctcattgtttcaatggcaccaacgttaccagaatgtctg gaagcacaggactgtggatga