GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:05:13 Sequence gi568815593r:103450864_103660248 : 209385 bp : 36.33% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1039 1034 6 1.05 1.02 Term - 13797 13513 285 0 0 95 39 121 0.490 2.32 1.01 Init - 14089 14036 54 1 0 82 76 48 0.854 4.33 1.00 Prom - 16116 16077 40 -4.35 2.08 PlyA - 17373 17368 6 1.05 2.07 Term - 20758 20597 162 1 0 14 43 159 0.695 0.95 2.06 Intr - 23835 23800 36 0 0 79 82 40 0.454 0.04 2.05 Intr - 35059 34881 179 2 2 73 92 149 0.470 12.52 2.04 Intr - 48302 48192 111 0 0 89 100 233 0.966 24.03 2.03 Intr - 65630 65534 97 2 1 111 64 23 0.638 0.86 2.02 Intr - 68938 68802 137 2 2 -24 81 127 0.120 0.17 2.01 Init - 75711 75252 460 0 1 53 86 310 0.444 23.26 2.00 Prom - 76567 76528 40 -9.65 3.10 PlyA - 76862 76857 6 -0.45 3.09 Term - 77503 77057 447 1 0 23 49 270 0.192 10.83 3.08 Intr - 101553 101354 200 1 2 43 103 170 0.995 12.15 3.07 Intr - 103990 103877 114 2 0 100 107 -16 0.666 1.00 3.06 Intr - 105235 105068 168 2 0 74 99 106 0.997 9.30 3.05 Intr - 108605 108016 590 1 2 99 99 406 0.999 34.14 3.04 Intr - 109391 109180 212 2 2 94 59 138 0.393 8.29 3.03 Intr - 121845 121722 124 2 1 70 66 86 0.005 4.27 3.02 Intr - 150822 150631 192 2 0 46 52 117 0.104 1.59 3.01 Init - 154683 154565 119 0 2 78 87 94 0.330 8.02 3.00 Prom - 176570 176531 40 -5.75 4.00 Prom + 176585 176624 40 -4.45 4.01 Init + 190985 191081 97 1 1 67 67 122 0.496 6.52 4.02 Term + 192349 192500 152 2 2 72 53 64 0.394 -1.61 4.03 PlyA + 194367 194372 6 1.05 5.02 PlyA - 194614 194609 6 1.05 5.01 Term - 202089 201941 149 1 2 51 42 168 0.772 5.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 89383 89441 59 1 2 68 111 31 0.892 1.01 S.002 Term + 91077 91180 104 1 2 83 41 94 0.891 1.66 S.003 Term - 100108 99998 111 1 0 82 35 62 0.867 -2.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:103450864_103660248|GENSCAN_predicted_peptide_1|112_aa MAVSTKVDVFERKKCAWEPKAVQNISWKTKEKQGQEEEGKSIKAQQCCPYHIMGISFLER RLHISHPGPFLQGCRKITCFTHRSLPVSHVSLSAPAAHLGFLWRYLTPADIF >gi568815593r:103450864_103660248|GENSCAN_predicted_CDS_1|339_bp atggcagtaagcacaaaagtggatgtctttgaaaggaaaaagtgtgcttgggagcccaaa gctgtccaaaatatcagttggaaaactaaagagaaacaaggccaggaggaggagggaaag tccatcaaagcccagcaatgctgcccctatcacataatgggtatttctttcctggagaga agactacacatcagccatccaggacctttcctgcaaggatgccgtaaaatcacttgcttc acccatcgttcactacctgtgtcacacgtttctctgtcagcacctgcggcccacctgggc ttcctatggaggtatctgaccccagcagacattttttag >gi568815593r:103450864_103660248|GENSCAN_predicted_peptide_2|393_aa MIKKLVVSNCPMGMKNRKESPSSLENVEKDKCQIIPVVNNAYHRDFEDLSNENRDKVCFQ MHFSANTAESNCAVCASDNLFLIEKTLTNDVDLHGNQDKENLGLTSFKQEPAQCQRIAST GPNTEEDENKNDSKENYCYQIHTLMPPQKQALKEVEELVMVPREKQLTSGPEGGSNTKKA KNPVQQQGAYLPAISQKNPESHLFGLHQGVNLPSDILLELTSGDDQEAPGRNSAAKKAGL QIGDAVLAVNGTEVTSVELAEAVNLAMKGTDILTLVVGSDISRCPSTPWPTCHGYLYKRT HSGIVKGWRKRWFVLKYDGCLYYYKHNKGEGAIECSSVALDFCTCNVGKRIPDGSKSLMK GYTGGCWDPIREPKVGQFDCIMGYVEKTNGDLD >gi568815593r:103450864_103660248|GENSCAN_predicted_CDS_2|1182_bp atgataaagaaactagttgtttctaattgtcccatgggaatgaaaaaccgtaaagagtca cccagctctctggaaaatgttgaaaaagacaaatgccagataatacccgtggtgaataat gcataccacagagattttgaagacctgtcaaatgagaatcgtgacaaagtctgttttcaa atgcatttttcagcaaacactgctgagtccaactgtgcagtgtgtgcttcagataatctg tttcttattgaaaaaacactgaccaatgatgtggacctacatggtaaccaagataaagaa aaccttggactaactagttttaaacaggagccagcacagtgccaaaggattgcatcaaca gggccaaatactgaagaggatgagaacaaaaatgactcaaaggaaaattactgttaccaa atacacacgttgatgccacctcagaagcaagctttgaaagaggtggaagagctagtgatg gttcccagggagaagcaattgaccagtgggccagaaggaggcagcaatacaaagaaggca aaaaacccggttcagcagcagggagcttatttgccagcaatatcacagaagaatccagag tctcatttatttggactacatcaaggggttaatttaccttctgacatcctgttggagttg accagtggggatgatcaggaggcgcctggaaggaacagtgccgccaagaaagccgggctc cagattggagatgcggtactggctgtcaatggcactgaggtcaccagtgtcgagcttgca gaagctgtgaaccttgcaatgaaaggtacagacatcttgaccctggtggtaggatctgat atcagccgctgtcccagtaccccttggcccacctgccacggctacctatataagaggact cactccgggatcgtgaagggttggagaaagaggtggtttgtgctgaagtatgatggatgc ctctactattacaaacataacaagggtgaaggggcaattgagtgcagcagcgtggcactg gatttctgcacatgtaatgtggggaagagaattccagatggaagcaaaagcctgatgaaa ggctacacaggagggtgttgggatccaattagggagccaaaagtgggccagtttgattgt atcatgggatatgtggagaagactaatggagacttagactag >gi568815593r:103450864_103660248|GENSCAN_predicted_peptide_3|721_aa MEKPDKRPVTPATVIFSKQGKGQHCPHATTRTQPDPTEQKSKMLLRKRLVVPRLPPLSRV YVATLRGTGDKSIKQHRAEAVKNGYYYVQEKRLEVNEHSPFFPCTGKCICKNRVSTMGLK SDKLWIADSAVSAWPTRALGYQDFMEEMSSVKRSLKQEIVTQFHCSAAEGDIAKLTGILS HSPSLLNETSENGWTALMYAARNGHPEIVQFLLEKGCDRSIVNKSRQTALDIAVFWGYKH IANLLATAKGGKKPWFLTNEVEECENYFSKTLLDRKSEKRNNSDWLLAKESHPATVFILF SDLNPLVTLGGNKESFQQPEVRLCQLNYTDIKDYLAQPEKITLIFLGVELEIKDKLLNYA GEVPREEEDGLVAWFALGIDPIAAEEFKQRHENCYFLHPPMPALLQLKEKEAGVVAQARS VLAWHSRYKFCPTCGNATKIEEGGYKRLCLKEDCPSLNGVHNTSYPRVDPVVIMQVIHPD GTKCLLGRQKRFPPGMFTCLAGFIEPGETIEDAVRREVEEESGVKVGHVQYVACQPWPMP SSLMIGCLALAVSTEIKVDKNEIEDARWFTREQTGTLGKSKENGDEGMSASVNEKQNSSF LPKDKKQHPCDSKIHRKKGNDDPPLAVSHQLEKVLPMITFNDYGESLSENITLQNLLGTV EDFKPAKGKFITEVKRGDFNVQIEIKKNFVCKPGIQFPTKKETSKMLPALLYQTHSEMLP F >gi568815593r:103450864_103660248|GENSCAN_predicted_CDS_3|2166_bp atggagaaacctgataaaaggccagtaacaccagccaccgttatcttcagcaagcaaggt aagggtcaacactgcccccatgctaccaccagaacacaaccggatccaacagagcaaaaa agcaagatgctcctcagaaaaaggctagtggttccacgccttccacccttaagtagagtt tatgtggcaactttgcgaggaactggtgacaagtctatcaagcagcatagagctgaagct gttaagaatggatattactatgtccaggagaaaaggctagaggtcaatgaacattcacct ttttttccatgtacggggaaatgtatttgcaaaaacagagtgagcacaatgggccttaaa tcagacaagctgtggattgctgacagcgccgtcagcgcttggcctaccagggcccttgga tatcaggattttatggaagaaatgtcttctgtaaaaagaagtctgaagcaagaaatagtt actcagtttcactgttcagctgctgaaggagatattgccaagttaacaggaatactcagt cattctccatctcttctcaatgaaacttctgaaaatggctggactgctttaatgtatgcg gcaaggaatgggcacccagagatagtccaatttctgcttgagaaagggtgtgacagatca attgtcaataaatcaaggcagactgcactggacattgctgtattttggggttataagcat atagctaatttactagctactgctaaaggtgggaagaagccttggttcctaacgaatgaa gtggaagaatgtgaaaattattttagcaaaacactactggaccggaaaagtgaaaagaga aataattctgactggctgctagctaaagaaagccatccagccacagtttttattcttttc tcagatttaaatcccttggttactctaggtggcaataaagaaagtttccaacagccagaa gttaggctttgtcagctgaactacacagatataaaggattatttggcccagcctgagaag atcaccttgatttttcttggagtagaacttgaaataaaagacaaactacttaattatgct ggtgaagtcccgagagaggaggaagatggattggttgcctggtttgctctaggtatagat cctattgctgctgaagaattcaagcaaagacatgaaaattgttactttcttcatcctcct atgccagcccttctgcaattgaaagaaaaagaagctggggttgtagctcaagcaagatct gttcttgcctggcacagtcgatacaagttttgcccaacctgtggaaatgcaactaaaatt gaagaaggtggctataagagattatgtttaaaagaagactgtcctagtctcaatggcgtc cataatacctcatacccaagagttgatccagtagtaatcatgcaagttattcatccagat gggaccaaatgccttttaggcaggcagaaaagatttcccccaggcatgtttacttgcctt gctggatttattgagcctggagagacaatagaagatgctgttaggagagaagtagaagag gaaagtggagtcaaagttggccatgttcagtatgttgcttgtcaaccatggccaatgcct tcctccttaatgattggttgcttagctctagcagtgtctacagaaattaaagttgacaag aatgaaatagaggatgcccgctggttcactagagaacagactgggacattagggaaatct aaggaaaatggagatgagggtatgtctgcttctgtgaatgaaaagcagaattcttctttt cttcctaaagataagaaacagcatccatgtgacagcaaaatacataggaagaagggaaat gatgatcctcctttggctgtctctcatcaactggagaaagtcttacctatgatcacattt aatgactatggagaatcattatcagaaaacattacattacagaatctcctaggcacagta gaagactttaagccagcaaaaggcaaatttattacagaagtgaaacgtggggatttcaat gtgcaaatagaaatcaagaaaaactttgtctgtaaaccaggaatacagttcccaacaaaa aaagagacatccaaaatgttgccagcgttgctctatcagacacattcagagatgctaccc ttctag >gi568815593r:103450864_103660248|GENSCAN_predicted_peptide_4|82_aa MQRGSLGIMLHMVALACGWFWVTSISTSIAAPGKVNKILLGEVFESIIINNLEIRKSSLI SQVGPKSNDERHSRQTKEEEAT >gi568815593r:103450864_103660248|GENSCAN_predicted_CDS_4|249_bp atgcagagaggaagcctggggatcatgctgcacatggtggccctggcctgtgggtggttc tgggttaccagcatctccacatctattgcagccccaggaaaagttaacaaaatcttactt ggagaagtctttgaatctataattattaataatcttgagataagaaagtcatccttgatt tcccaggtaggccctaaatccaatgatgagcgacatagcagacagacaaaagaggaagag gcaacgtga >gi568815593r:103450864_103660248|GENSCAN_predicted_peptide_5|49_aa XVYGGVQKEGACMEGTESKTRQSAVTELKKSHLTEQVLSDTGAPSNSDF >gi568815593r:103450864_103660248|GENSCAN_predicted_CDS_5|150_bp nttgtgtatggaggagtgcagaaggaaggtgcatgtatggaaggaacagaatcaaaaact agacagtcagccgtcactgaattgaagaaatctcatctaacagagcaggttttgtctgat actggagcaccttccaattccgatttttaa