GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:02:44 Sequence gi568815595r:122314539_122515528 : 200990 bp : 39.39% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 705 973 269 1 2 10 54 240 0.389 7.47 1.02 PlyA + 986 991 6 1.05 2.00 Prom + 1078 1117 40 -9.35 2.01 Init + 3269 3456 188 1 2 55 8 211 0.476 8.38 2.02 Intr + 3792 4000 209 0 2 26 69 231 0.103 12.80 2.03 Intr + 10723 10820 98 2 2 66 113 75 0.020 6.61 2.04 Intr + 23009 23110 102 1 0 109 72 116 0.997 11.55 2.05 Term + 26901 27029 129 2 0 109 43 132 0.996 8.00 2.06 PlyA + 27105 27110 6 1.05 3.06 PlyA - 27160 27155 6 1.05 3.05 Term - 45381 45331 51 0 0 93 43 89 0.847 1.35 3.04 Intr - 48489 48430 60 0 0 116 70 37 0.639 2.71 3.03 Intr - 53784 53638 147 0 0 84 115 57 0.982 7.51 3.02 Intr - 57262 57137 126 1 0 17 84 97 0.657 2.16 3.01 Init - 58498 58445 54 1 0 32 116 56 0.687 4.13 3.00 Prom - 61768 61729 40 -8.35 4.00 Prom + 66187 66226 40 -7.25 4.01 Init + 69728 69761 34 1 1 99 94 50 0.492 7.02 4.02 Intr + 78544 78648 105 2 0 76 91 51 0.811 3.47 4.03 Intr + 88222 88344 123 2 0 117 105 3 0.956 4.54 4.04 Intr + 89720 89825 106 0 1 89 80 127 0.999 10.35 4.05 Intr + 92743 92851 109 1 1 88 100 130 0.995 13.57 4.06 Term + 94767 94949 183 2 0 1 48 184 0.704 2.26 4.07 PlyA + 95443 95448 6 1.05 5.02 PlyA - 95476 95471 6 1.05 5.01 Sngl - 100990 99998 993 1 0 87 36 486 0.534 39.93 5.00 Prom - 111863 111824 40 -8.15 6.15 PlyA - 112122 112117 6 1.05 6.14 Term - 112634 112447 188 0 2 87 48 272 0.963 19.77 6.13 Intr - 113178 113000 179 2 2 91 95 201 0.998 19.74 6.12 Intr - 119250 119123 128 0 2 62 108 104 0.802 8.36 6.11 Intr - 127578 127500 79 2 1 111 80 83 0.831 8.43 6.10 Intr - 135199 135036 164 1 2 112 58 168 0.025 14.05 6.09 Intr - 137095 136996 100 0 1 86 116 92 0.719 10.99 6.08 Intr - 137526 137438 89 0 2 82 92 7 0.570 -1.65 6.07 Intr - 139426 139332 95 2 2 33 98 119 0.187 6.16 6.06 Intr - 143290 143188 103 1 1 29 55 136 0.191 3.23 6.05 Intr - 146780 146686 95 0 2 2 80 88 0.812 -1.84 6.04 Intr - 149503 149404 100 1 1 79 105 84 0.938 7.96 6.03 Intr - 152891 152784 108 2 0 75 63 154 0.948 11.16 6.02 Intr - 182032 181899 134 2 2 82 99 147 0.845 14.64 6.01 Init - 183987 183861 127 0 1 77 85 41 0.813 3.07 6.00 Prom - 189662 189623 40 -6.65 7.03 PlyA - 189672 189667 6 1.05 7.02 Term - 199815 199472 344 1 2 -28 35 339 0.584 10.69 7.01 Intr - 200169 200015 155 2 2 116 35 77 0.456 3.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 20799 20825 27 2 0 77 115 -6 0.887 0.65 S.002 Sngl + 130401 130883 483 2 0 88 43 223 0.962 13.62 S.003 Term - 135199 135032 168 1 0 112 44 180 0.973 12.90 S.004 Term - 181286 181215 72 2 0 40 42 110 0.882 -1.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:122314539_122515528|GENSCAN_predicted_peptide_1|89_aa XPSGHTHKQLDIKRNTPMVEDTSGWKLRGTHPRKSTPTGTDRFRQAIDWWNNAEFGWGGR RRAQLPGSPIAGENHLPTPAPFWPSHLPR >gi568815595r:122314539_122515528|GENSCAN_predicted_CDS_1|270_bp natcctagcgggcacacacacaagcagctggacatcaagaggaacacaccaatggttgag gacacaagcggctggaaattgagaggaacacacccgcgtaagagcacacccacaggcact gacagatttcggcaggccatcgactggtggaacaatgcagagtttggctggggtggtcgg agaagagcccagctgcccggaagcccgattgcaggggaaaaccaccttcccactccagcc cccttctggccttcccatctacctcgctga >gi568815595r:122314539_122515528|GENSCAN_predicted_peptide_2|241_aa MEEVDAAMNARPHKEDGRVVEPKRAVSREDSQRLGAHLTLKKIFVGGIKEDTEEHHLRDY FEDCGDGGYGGNKDGYNGFGSDSGYEGGSPSYSGGSRGYGSGGQDSGNQGSGYDGGGSHD SYNNGVGVSDFGASCPAKKQSAKMIPGGLSEAKPATPEIQEIVDKVKPQLEEKTNETYGK LEAVQYKTQVVAGTNYYIKVRAGDNKYMHLKVFKSLPGQNEDLVLTGYQVDKNKDDELTG F >gi568815595r:122314539_122515528|GENSCAN_predicted_CDS_2|726_bp atggaggaggtagatgcagccatgaatgcaaggccacacaaggaggatggaagagttgtg gaaccaaagagagctgtctcaagagaagattctcagagactaggtgcccacttaactctg aaaaagatatttgttggtggcattaaagaagacactgaagaacatcacctaagagattat tttgaagactgtggtgatggtggatatggtggcaataaggatggctataatggatttggt agtgatagtggttatgaaggaggcagccctagttactctggaggaagcagaggctacgga agtggtggacaggattctggaaaccagggcagtggctatgatgggggtggcagccatgac agctataacaatggagtaggcgtaagtgactttggtgcatcctgtccagcaaagaagcaa tcagccaaaatgatacctggaggcttatctgaggccaaacccgccactccagaaatccag gagattgttgataaggttaaaccacagcttgaagaaaaaacaaatgagacttacggaaaa ttggaagctgtgcagtataaaactcaagttgttgctggaacaaattactacattaaggta cgagcaggtgataataaatatatgcacttgaaagtattcaaaagtcttcccggacaaaat gaggacttggtacttactggataccaggttgacaaaaacaaggatgacgagctgacgggc ttttag >gi568815595r:122314539_122515528|GENSCAN_predicted_peptide_3|145_aa MDEERKELVSVMDLSLAKELLKVMRTIDDRIVHELNTTVPTASFAGKIDASQTCKQLYES LMAAHASRDRVIKNCIAQTSAVVKNLREEREKNLDDLTLLKQLRKEQTKLKWMQSELNVE EVVNDRSWKVFNERCRIHFKPPKNE >gi568815595r:122314539_122515528|GENSCAN_predicted_CDS_3|438_bp atggacgaggaaagaaaggagttggtttcagtcatggatttaagtttggcaaaggaatta ctcaaggtgatgaggacaattgatgacagaatagtacatgaattaaacactacggttcca acagcttcctttgcagggaaaattgatgccagccaaacctgtaaacaactttatgagtct ttgatggcagctcatgccagtagagacagagtcataaaaaactgtatagcccagacttca gcagtagtaaaaaacctccgagaagagagagaaaagaatttggacgatttaacgttatta aaacaacttagaaaagagcagacaaagttgaaatggatgcagtcagaactgaatgttgaa gaagtggtaaatgacaggagctggaaggtgtttaatgaacgctgccgaattcacttcaag cctccaaagaatgaataa >gi568815595r:122314539_122515528|GENSCAN_predicted_peptide_4|219_aa MGSLSGLRLAAGGTDVVFARSRQSFLAFIALENKINEAFYEKDCVPGSCFRLCERDVSSS LRLTRSSDLKRINGFCTKPQESPGAPSRTYNRVPLHKPTDWQKKILIWSGRFKKEDEIPE TVSLEMLDAAKNKMRVKISYLMIALTVVGCIFMVIEGKKRSEELEGSEYADKLGKEKWQQ QRKVKEEKRDRNGNEKVFSIKEKWGGMRRKYKRKGEDKD >gi568815595r:122314539_122515528|GENSCAN_predicted_CDS_4|660_bp atggggagcctcagcggtctgcgcctggcagcaggtggcactgatgttgtgtttgccaga tccaggcaaagctttctggcatttatagccctagaaaacaaaataaatgaggccttttat gaaaaagactgtgtgccaggaagctgttttaggttatgtgaaagagatgtttcctcatct ctaaggcttaccagaagctctgatttgaagagaataaatggattttgcacaaaaccacag gaaagtcccggagctccatcccgcacttacaacagagtgcctttacacaaacctacggat tggcagaaaaagatcctcatatggtcaggtcgcttcaaaaaggaagatgaaatcccagag actgtctcgttggagatgcttgatgctgcaaagaacaagatgcgagtgaagatcagctat ctaatgattgccctgacggtggtaggatgcatcttcatggttattgagggcaagaagaga agtgaggaactagaaggatctgagtatgcagataagttggggaaggagaaatggcagcaa cagagaaaagtgaaagaagaaaaacgggatagaaatggcaatgagaaggtatttagtata aaagagaagtggggtggaatgagaagaaagtataagagaaaaggagaggataaagactga >gi568815595r:122314539_122515528|GENSCAN_predicted_peptide_5|330_aa MATKESRDAKAQLALSSSANQSKEVPENPNYALKCTLVGHTEAVSSVKFSPNGEWLASSS ADRLIIIWGAYDGKYEKTLYGHNLEISDVAWSSDSSRLVSASDDKTLKLWDVRSGKCLKT LKGHSNYVFCCNFNPPSNLIISGSFDETVKIWEVKTGKCLKTLSAHSDPVSAVHFNCSGS LIVSGSYDGLCRIWDAASGQCLKTLVDDDNPPVSFVKFSPNGKYILTATLDNTLKLWDYS RGRCLKTYTGHKNEKYCIFANFSVTGGKWIVSGSEDNLVYIWNLQTKEIVQKLQGHTDVV ISAACHPTENLIASAALENDKTIKLWMSNH >gi568815595r:122314539_122515528|GENSCAN_predicted_CDS_5|993_bp atggcaaccaaggagtcaagagacgccaaagcacagttggccctctcctcatcggccaat cagagcaaggaagtgcctgaaaacccaaactatgctctcaaatgtactcttgtgggacac acggaagcagtgtcatcagttaagtttagtcctaatggagaatggctagcaagttcttct gctgataggctaatcataatttggggagcatatgatggaaaatatgagaaaacactctat ggtcataatttggaaatatcggatgttgcctggtcatcagattccagtcgtcttgtttct gcctcagatgataaaactctaaaattatgggatgtgagatctggaaaatgtttgaaaaca ctgaaggggcacagtaattatgtcttttgttgtaacttcaatccgccatccaaccttata atctcgggatcttttgatgagactgtaaaaatatgggaggtgaaaacaggaaagtgtctc aagactttgtctgctcattctgacccagtttctgctgttcattttaattgtagtgggtcc ttgatagtgtcaggtagctatgatggcctctgtagaatctgggatgctgcatcaggtcag tgtttaaaaacgctcgttgatgacgataaccctcctgtctcttttgtaaaattttctcca aatggtaaatacattctcactgcaactttggacaacactcttaaactatgggattatagc agaggcaggtgcctgaaaacatacactggtcataagaatgagaaatattgcatatttgcc aatttttcagttactggtggaaagtggattgtgtctggttccgaggataacctggtttac atttggaaccttcagactaaagagattgtgcagaaattacaaggccatacagatgttgtg atctcagcagcttgtcatcctacagaaaacctcatcgcatcagcagcattagaaaatgac aaaacaattaaactgtggatgagtaaccactaa >gi568815595r:122314539_122515528|GENSCAN_predicted_peptide_6|562_aa MVRLGCCNEIPETRWLTDNRNLFLTVLDAGKSKIKALADSVSEIMTTPGKENFRLKSYKN KSLNPDEMRRRREEEGLQLRKQKREEQLFKRRNVATAEEETEEEVMSDGGFHEAQISNME MAPGGVITSDMIEMIFSKSPEQQLSATQKFRKLLSKEPNPPIDEVISTPGVVARFVEFLK RKENCTLQEPEWNRQSGQQQSPDPNGEREPGVPQPGVGNLIGGNSLQTRIVIQAGAVPIF IELLSSEFEDVQEQAVWALGNIAGDSTMCRDYVLDCNILPPLLQLFSKQNRLTMTRNAVW ALSNLCRGKSPPPEFAKVSPCLNVLSWLLFVSDTDVLADACWALSYLSDGPNDKIQAVID AGVCRRLVELLMHNDYKVVSPALRAVGNIVTGDDIQTQTVIDANIFPALISILQTAEFRT RKEAAWAITNATSGGSAEQIKYLVELGCIKPLCDLLTVMDSKIVQVALNGLENILRLGEQ EAKRNGTGINPYCALIEEAYGLDKIEFLQSHENQEIYQKAFDLIEHYFGTEDEDSSIAPQ VDLNQQQYIFQQCEAPMEGFQL >gi568815595r:122314539_122515528|GENSCAN_predicted_CDS_6|1689_bp atggtccgtttgggttgctgtaatgaaataccagagactaggtggctaacagacaataga aatttgtttctcacagttctggatgctgggaagtccaagatcaaggcactggcagattca gtgtctgaaatcatgaccaccccaggaaaagagaactttcgcctgaaaagttacaagaac aaatctctgaatcccgatgagatgcgcaggaggagggaggaagaaggactgcagttacga aagcagaaaagagaagagcagttattcaagcggagaaatgttgctacagcagaagaagaa acagaagaagaagttatgtcagatggaggctttcatgaggctcagattagtaacatggag atggcaccaggtggtgtcatcacttctgacatgattgaaatgatattttccaaaagccca gagcaacagctttcagcaacacagaaattcaggaagctgctttcaaaagaacctaaccct cctattgatgaagttatcagcacaccaggagtagtggccaggtttgtggagttcctcaaa cgaaaagagaattgtacactgcaggaaccagagtggaaccggcaaagtggccagcaacag agtcctgatccaaatggagaaagggaaccaggcgtacctcaacctggagtgggaaacttg ataggaggaaattctcttcagacccgaattgtgattcaggcaggagctgtgcccatcttc atagagttgctcagctcagagtttgaagatgtccaggaacaggcagtctgggctcttggc aacattgctggagatagtaccatgtgcagggactatgtcttagactgcaatatccttccc cctcttttgcagttattttcaaagcaaaaccgcctgaccatgacccggaatgcagtatgg gctttgtctaatctctgtagagggaaaagtccacctccagaatttgcaaaggtttctcca tgtctgaatgtgctttcctggttgctgtttgtcagtgacactgatgtactggctgatgcc tgctgggccctctcatatctatcagatggacccaatgataaaattcaagcggtcatcgat gcgggagtatgtaggagacttgtggaactgctgatgcataatgattataaagtggtttct cctgctttgcgagctgtgggaaacattgtcacaggggatgatattcagacacagactgtg atagatgccaacattttcccagccctcattagtattttacaaactgctgaatttcggaca agaaaagaagcagcttgggccatcacaaatgcaacttctggaggatcagctgaacagatc aagtacctagtagaactgggttgtatcaagccgctctgtgatctcctcacggtcatggac tctaagattgtacaggttgccctaaatggcttggaaaatatcctgaggcttggagaacag gaagccaaaaggaatggcactggcattaacccttactgtgctttgattgaagaagcttat ggtctggataaaattgagttcttacagagtcatgaaaaccaggagatctaccaaaaggcc tttgatcttattgagcattacttcgggaccgaagatgaagacagcagcattgcaccccag gttgaccttaaccagcagcagtacatcttccaacagtgtgaggctcctatggaaggtttc cagctttga >gi568815595r:122314539_122515528|GENSCAN_predicted_peptide_7|166_aa XSGQSRRDSGEALGRGRVPAGPRAGVPGPDARAASVRHGPGCPNGHKAGGGEERCDRGRK EQGWRVKILTLAKRDRLGNSQVTDRVGAGGPEGGGGSGGWRRPAAVYRCHNNMGLERLIG GVVIIAMTELGLHAACAMRGKQRGIGLSSKREEETRMTGVKEVVMK >gi568815595r:122314539_122515528|GENSCAN_predicted_CDS_7|501_bp nnatccgggcagtcccggagggactcaggagaggccctggggcgagggcgggtgccggca gggcctagagccggggttcctgggccggacgctcgcgccgcgtccgtgcgtcacgggccc ggctgtcctaacggccacaaggccggcggaggagaggagcgctgtgacagaggcaggaag gagcaggggtggcgtgtaaaaattctaaccctggcaaaaagggaccgtcttgggaattca caagttacagaccgggtcggagccggcggcccggagggaggtgggggaagcggggggtgg agacggccggctgcagtttaccgatgtcataacaacatgggcctggagcggctgatcggg ggtgtggtaattatagcaatgacagaacttggtctgcacgccgcgtgtgcgatgcgaggg aagcagcgaggtatcggattaagctcaaaacgggaggaggaaacaaggatgactggagtc aaggaagtcgtcatgaagtaa