GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:54:25 Sequence gi568815586r:120222196_120426023 : 203828 bp : 49.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 495 354 142 1 1 73 67 170 0.370 12.81 1.04 Intr - 804 668 137 2 2 61 91 154 0.998 13.21 1.03 Intr - 1638 1523 116 0 2 78 84 154 0.871 13.25 1.02 Intr - 2182 1956 227 2 2 119 94 223 0.999 23.70 1.01 Init - 8790 8640 151 0 1 66 47 91 0.261 3.01 1.00 Prom - 9068 9029 40 -4.56 2.00 Prom + 11491 11530 40 -3.16 2.01 Init + 12556 12606 51 0 0 55 85 43 0.431 1.86 2.02 Term + 26378 26548 171 1 0 105 39 122 0.207 6.83 2.03 PlyA + 26944 26949 6 1.05 3.00 Prom + 33200 33239 40 -4.16 3.01 Init + 43433 43745 313 1 1 65 80 117 0.393 4.09 3.02 Term + 60052 60521 470 2 2 36 48 609 0.945 46.84 3.03 PlyA + 60640 60645 6 1.05 4.00 Prom + 71663 71702 40 -1.06 4.01 Init + 81373 81863 491 0 2 64 61 283 0.244 17.88 4.02 Intr + 90225 90555 331 0 1 72 70 234 0.224 15.63 4.03 Term + 90689 90841 153 1 0 112 47 54 0.971 1.62 4.04 PlyA + 91026 91031 6 1.05 5.06 PlyA - 91330 91325 6 1.05 5.05 Term - 100122 99998 125 1 2 114 42 185 0.996 15.05 5.04 Intr - 102866 102739 128 1 2 101 42 161 0.996 13.12 5.03 Intr - 103825 103666 160 2 1 81 108 349 0.884 35.25 5.02 Intr - 114072 113887 186 1 0 67 74 60 0.485 2.16 5.01 Init - 114966 114912 55 0 1 92 87 0 0.368 1.89 5.00 Prom - 115958 115919 40 -6.06 6.15 PlyA - 116353 116348 6 1.05 6.14 Term - 120910 120875 36 1 0 85 52 64 0.495 -0.16 6.13 Intr - 124094 123940 155 0 2 106 84 252 0.953 26.39 6.12 Intr - 125319 125251 69 1 0 94 99 92 0.994 10.05 6.11 Intr - 125493 125467 27 1 0 99 63 34 0.510 0.09 6.10 Intr - 131184 131104 81 1 0 97 131 91 0.991 13.91 6.09 Intr - 134824 134707 118 1 1 35 59 121 0.687 3.94 6.08 Intr - 135703 135621 83 2 2 92 101 107 0.981 11.76 6.07 Intr - 136858 136810 49 1 1 66 105 123 0.916 10.05 6.06 Intr - 140940 140848 93 0 0 112 98 150 0.994 18.56 6.05 Intr - 142560 142519 42 0 0 97 100 14 0.737 1.94 6.04 Intr - 145897 145813 85 0 1 53 79 84 0.608 3.82 6.03 Intr - 146078 145997 82 0 1 35 99 134 0.986 7.90 6.02 Intr - 146292 146222 71 2 2 66 40 64 0.626 -1.77 6.01 Init - 146786 146638 149 2 2 121 84 97 0.680 10.17 6.00 Prom - 161064 161025 40 -4.76 7.04 PlyA - 162175 162170 6 1.05 7.03 Term - 166829 166751 79 1 1 77 38 106 0.823 1.74 7.02 Intr - 171163 171102 62 1 2 77 107 32 0.719 1.53 7.01 Init - 173603 173550 54 2 0 92 90 59 0.815 5.78 7.00 Prom - 189970 189931 40 -1.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 105558 105525 34 0 1 64 96 58 0.861 2.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:120222196_120426023|GENSCAN_predicted_peptide_1|258_aa MASNKTDKAPALWSLILGEETVDKNPNWQGRPLKETEFLVSVQALGPWAGDALLADLEST TSHISKRPVFLSEETPYSYPTGNHTYQEIAVPPPVPPPPSSEALNGTILDPLDQWQPSSS RFIHQQPQSSSPVYGSSAKTSSVSNPQDSVGSPCSRVGEEEHVYSFPNKQKSAEPSPTVM STSLGSNLSELDRLLLELNAVQHNPPGFPAETNSPLGGKAGPLTKEKPKRNGGRGLEDVR PSVESLLDELESSVPSPV >gi568815586r:120222196_120426023|GENSCAN_predicted_CDS_1|774_bp atggcatcgaacaagaccgacaaggcccctgctctctggagcttaattcttggagaggag acagttgacaagaacccaaattggcagggaaggcccctcaaggagactgagttcctagtc agtgtccaggccctaggtccctgggctggggacgccctgctggcggacttggagtctacc acctcccacatctccaaacggcctgtgttcttgtcggaggagaccccctactcataccca actggaaaccacacataccaggagattgccgtgccaccccccgtccccccacccccgtcc agcgaggccctcaatggcacaatccttgaccccttagaccagtggcagcccagcagctcc cgattcatccaccagcagcctcagtcctcatcacctgtgtacggctccagtgccaaaact tccagtgtctccaaccctcaggacagtgttggctctccgtgctcccgagtgggtgaggag gagcacgtctacagcttccccaacaagcagaaatcagctgagccttcacccaccgtaatg agcacgtccctgggcagcaacctttctgaactcgaccgcctgctgctggaactgaacgct gtacagcataacccgccaggcttccctgcagagactaacagccccttgggaggcaaagct gggcccctgacgaaagagaagcctaagcggaatgggggccggggcctggaggacgtgcgg cccagtgtggagagtctcttggatgaactggagagctccgtgcccagccccgtn >gi568815586r:120222196_120426023|GENSCAN_predicted_peptide_2|73_aa MDEKECESHPGGWDKINWHRVASPSLASRNQNRVYNVSKGLYATVTESNPSSTTNQQLTS GKLLTSLCFSVVS >gi568815586r:120222196_120426023|GENSCAN_predicted_CDS_2|222_bp atggatgaaaaagaatgtgagagccatcctggtggctgggataaaataaactggcaccgc gtggcatcaccctcattggcgagcaggaatcagaacagagtctacaatgtcagcaaaggg ctgtatgcgacagtcactgaatccaatcccagctccaccactaatcagcaactaacctcg ggcaagttgctcacctctctgtgcttcagtgtcgtcagctag >gi568815586r:120222196_120426023|GENSCAN_predicted_peptide_3|260_aa MAGPRAPAQGRASCPSRGRSSMPRNFSAASLGPERNAPPPRAPAPAPAARPAPGRPLCTR APPPDTPASLLAPPPAHAGPAPAKSPGSGRRWAGRWRPLAAKRAASRTTANLERTFITIR PDSMQCGLVGKIIKRFEQKGFRLVAMKFLPASEEHLKQHYIDLKDRPFFPGLVKYMNSGP VVAMVWEGLNVVKTGRVMLGETNPADSKPGTIRGDFCIQVGRNIIHGSDSVKSAEKEISL RFKPEELVDYKSCAHDWVYE >gi568815586r:120222196_120426023|GENSCAN_predicted_CDS_3|783_bp atggccggaccacgggcgccggctcagggtcgcgctagctgcccgtcccggggccgctcg tctatgccccgcaacttttccgccgcgagcctcggcccggaacggaacgcgccgccgccg cgcgcgcccgcgcccgcgcccgccgcgcgccccgcccccggccgccccctgtgcacgcgc gccccgcccccggacacccccgcgagcttgctggccccgccccctgcgcacgctggtccc gcccccgccaagagcccgggcagtgggcgtcgctgggcggggcggtggcgccccctcgcg gctaagcgggcagcttcccggaccacggccaacctcgagcgcaccttcatcaccatcagg ccggacagcatgcagtgcggcctggtgggcaagatcatcaagcgcttcgagcagaagggg ttccgcctcgtggccatgaagttcctcccggcctctgaagaacacctgaagcagcactac attgacctgaaggaccgcccattcttccctgggctggtgaagtacatgaactcagggccg gtcgtggccatggtctgggaggggctgaacgtcgtgaagacaggccgagtgatgcttggg gagaccaatccagcagattctaagccaggcaccattcgtggggacttttgcattcaggtt ggcaggaacatcattcatggcagtgattcagtaaaaagtgctgaaaaagaaatcagccta cggtttaagcctgaagaactggttgactacaagtcttgtgctcatgactgggtctatgaa taa >gi568815586r:120222196_120426023|GENSCAN_predicted_peptide_4|324_aa MSFALTFRSAKGRWIANPSQPCSKASIGLFVPASPPLDPEKVKELQRFITLSKRLLVMTG AGISTESGIPDYRSEKVGLYARTDRRPIQHGDFVRSAPIRQRYWARNFVGWPQFSSHQPN PAHWALSTWEKLGKLYWLVTQNVDALHTKAGSRRLTELHGCMDRAYCSVSVFLGSRVLCL DCGEQTPRGVLQERFQVLNPTWSAEAHGLAPDGDVFLSEEQVRSFQVPTCVQCGGHLKPD VVFFGDTVNPDKVDFVHKRVKEADSLLVVGSSLQVYSGYRFILTAWEKKLPIAILNIGPT RSDDLACLKLNSRCGELLPLIDPC >gi568815586r:120222196_120426023|GENSCAN_predicted_CDS_4|975_bp atgagctttgcgttgactttcaggtcagcaaaaggccgttggatcgcaaaccccagccag ccgtgctcgaaagcctccattgggttatttgtgccagcaagtcctcctctggaccctgag aaggtcaaagagttacagcgcttcatcaccctttccaagagactccttgtgatgactggg gcaggaatctccaccgaatcggggataccagactacaggtcagaaaaagtggggctttat gcccgcactgaccgcaggcccatccagcatggtgattttgtccggagtgccccaatccgc cagcggtactgggcgagaaacttcgtaggctggcctcaattctcctcccaccagcctaac cctgcacactgggctttgagcacctgggagaaactcggaaagctgtactggttggtgacc caaaatgtggatgctttgcacaccaaggcggggagtcggcgcctgacagagctccacgga tgcatggacagggcatactgttcagtcagcgtcttccttggttccagggtcctgtgcttg gattgtggggaacagactccccggggggtgctgcaagagcgtttccaagtcctgaacccc acctggagtgctgaggcccatggcctggctcctgatggtgacgtctttctctcagaggag caagtccggagctttcaggtcccaacctgcgttcaatgtggaggccatctgaaaccagat gtcgttttcttcggggacacagtgaaccctgacaaggttgattttgtgcacaagcgtgta aaagaagccgactccctcttggtggtgggatcatccttgcaggtatactctggttacagg tttatcctcactgcctgggagaagaagctcccgattgcaatactgaacattgggcccaca cggtcggatgacttggcgtgtctgaaactgaattctcgttgtggagagttgctgcctttg atagacccatgctga >gi568815586r:120222196_120426023|GENSCAN_predicted_peptide_5|217_aa MVPCIPATPAPAVAKRSQETRECRNKDTRQRDKRKDSWARGTTTTKSRRPVVALNVWLHC YLLDTKQKGQGKECESSPMILAAADSGISPRAVWQFRKMIKCVIPGSDPFLEYNNYGCYC GLGGSGTPVDELDKCCQTHDNCYDQAKKLDSCKFLLDNPYTHTYSYSCSGSAITCSSKNK ECEAFICNCDRNAAICFSKAPYNKAHKNLDTKKYCQS >gi568815586r:120222196_120426023|GENSCAN_predicted_CDS_5|654_bp atggtgccctgcatcccagccactccagctccagctgtggctaaaaggagccaagagacg agagagtgtagaaataaagacacaagacaaagagataaaagaaaagacagctgggcccgg ggaaccactaccaccaagtcacggagaccggtagtggccctgaatgtctggctgcactgt tatttattggatacaaagcaaaaggggcagggtaaagagtgcgagtcatctccaatgata ctggccgccgccgacagcggcatcagccctcgggccgtgtggcagttccgcaaaatgatc aagtgcgtgatcccggggagtgaccccttcttggaatacaacaactacggctgctactgt ggcttggggggctcaggcacccccgtggatgaactggacaagtgctgccagacacatgac aactgctatgaccaggccaagaagctggacagctgtaaatttctgctggacaacccgtac acccacacctattcatactcgtgctctggctcggcaatcacctgtagcagcaaaaacaaa gagtgtgaggccttcatttgcaactgcgaccgcaacgctgccatctgcttttcaaaagct ccatataacaaggcacacaagaacctggacaccaagaagtattgtcagagttga >gi568815586r:120222196_120426023|GENSCAN_predicted_peptide_6|379_aa MADPGPAGPPRSPGPRPLRPGARRSRGPFVSLLLPQQDVHRGTQLADYAGPARPSASRGP GGRQEAQRERGEGEGLREYFGQFGEVKECLVMRDPLTKRSRGFGFVTFMDQAGVDKVLAQ SRHELDSKTIDPKVAFPRRAQPKMVTRTKKIFVGGLSVNTTVEDVKQYFEQFGKVDDAML MFDKTTNRHRGFGFVTFESEDIVEKVCEIHFHEINNKMVECKKAQPKEVMSPTGSARGRS RVMPYGMDAFMLGIGMLGYPGFQATTYASRSYTGLAPGYTYQFPGQDTDGVAQAIPLTAY GPMAAAAAAAAVVRGTGSTPSRTGGFLGTTSPGPMAELYGAANQDSGVSSYISAASPAPS TGFGHSLGRPSLQLTEDHE >gi568815586r:120222196_120426023|GENSCAN_predicted_CDS_6|1140_bp atggccgatccgggtccggccgggcctccccggagcccgggcccgcgccccctgcgccct ggcgcccggcgctcacgcgggccctttgtgtctctcctcctcccgcagcaagatgttcat cgggggactcagttggcagactacgcaggcccagctcggccctcggcttcccggggcccc ggtgggcgccaggaggctcagcgggaacggggcgagggcgaagggctgcgcgaatacttc ggccagttcggggaggtgaaggagtgtctggtgatgcgggaccccctgaccaagagatcc aggggtttcggcttcgtcactttcatggaccaggcgggggtggataaagtgctggcgcaa tcgcggcacgagctcgactccaaaacaattgaccctaaggtggccttccctcggcgagca cagcccaagatggtgactcgaacgaagaagatctttgtgggggggctgtcggtgaacacc acggtggaggacgtgaagcaatattttgagcagtttgggaaggtggacgacgccatgctg atgtttgacaaaaccaccaaccggcaccgagggttcgggtttgtcacgtttgagagtgag gacatcgtggagaaagtgtgtgaaattcattttcatgaaatcaacaacaaaatggtggaa tgtaagaaagctcagccaaaggaggtgatgtcgccaacgggctcagcccgggggaggtct cgagtcatgccctacggaatggacgccttcatgctgggcatcggcatgctgggttaccca ggtttccaagccacaacctacgccagccggagttatacaggcctcgcccctggctacacc taccagttccccggtcaggacacagatggtgtggcccaagccattcctctcactgcctac ggaccaatggcggcggcagcggcggcagcggctgtggttcgagggacaggttcgactccc agccgcacagggggcttcctggggaccaccagccccggccccatggccgagctctacggg gcggccaaccaggactcgggggtcagcagttacatcagcgccgccagccctgcccccagc accggcttcggccacagtcttgggcgccccagcctgcagctgactgaggaccacgagtga >gi568815586r:120222196_120426023|GENSCAN_predicted_peptide_7|64_aa MGRARWLTPVIPALLEAEVPIMHPLSACSLCHPVNALVRSRDADWLRAGPWAQRCVSIIV RDCE >gi568815586r:120222196_120426023|GENSCAN_predicted_CDS_7|195_bp atgggccgggcacggtggctcacacctgtaatcccagcacttttggaggctgaggtcccc atcatgcacccgctcagtgcttgttctctctgccatcctgtcaatgcccttgtgagatca cgtgatgccgactggctccgagctgggccctgggctcagcgctgtgtgagcatcattgta cgggactgtgaatag