GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:52:46 Sequence gi568815586r:120245594_120469091 : 223498 bp : 48.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 2980 3150 171 0 0 105 39 122 0.263 6.83 1.02 PlyA + 3546 3551 6 1.05 2.00 Prom + 9802 9841 40 -4.16 2.01 Init + 20035 20347 313 0 1 65 80 117 0.393 4.09 2.02 Term + 36654 37123 470 1 2 36 48 609 0.945 46.84 2.03 PlyA + 37242 37247 6 1.05 3.00 Prom + 48265 48304 40 -1.06 3.01 Init + 57975 58465 491 2 2 64 61 283 0.244 17.88 3.02 Intr + 66827 67157 331 2 1 72 70 234 0.224 15.63 3.03 Term + 67291 67443 153 0 0 112 47 54 0.971 1.62 3.04 PlyA + 67628 67633 6 1.05 4.06 PlyA - 67932 67927 6 1.05 4.05 Term - 76724 76600 125 0 2 114 42 185 0.996 15.05 4.04 Intr - 79468 79341 128 0 2 101 42 161 0.996 13.12 4.03 Intr - 80427 80268 160 1 1 81 108 349 0.884 35.25 4.02 Intr - 90674 90489 186 0 0 67 74 60 0.485 2.16 4.01 Init - 91568 91514 55 2 1 92 87 0 0.368 1.89 4.00 Prom - 92560 92521 40 -6.06 5.15 PlyA - 92955 92950 6 1.05 5.14 Term - 97512 97477 36 0 0 85 52 64 0.495 -0.16 5.13 Intr - 100696 100542 155 2 2 106 84 252 0.953 26.39 5.12 Intr - 101921 101853 69 0 0 94 99 92 0.994 10.05 5.11 Intr - 102095 102069 27 0 0 99 63 34 0.510 0.09 5.10 Intr - 107786 107706 81 0 0 97 131 91 0.991 13.91 5.09 Intr - 111426 111309 118 0 1 35 59 121 0.687 3.94 5.08 Intr - 112305 112223 83 1 2 92 101 107 0.981 11.76 5.07 Intr - 113460 113412 49 0 1 66 105 123 0.916 10.05 5.06 Intr - 117542 117450 93 2 0 112 98 150 0.994 18.56 5.05 Intr - 119162 119121 42 2 0 97 100 14 0.737 1.94 5.04 Intr - 122499 122415 85 2 1 53 79 84 0.608 3.82 5.03 Intr - 122680 122599 82 2 1 35 99 134 0.986 7.90 5.02 Intr - 122894 122824 71 1 2 66 40 64 0.626 -1.77 5.01 Init - 123388 123240 149 1 2 121 84 97 0.680 10.17 5.00 Prom - 137666 137627 40 -4.76 6.04 PlyA - 138777 138772 6 1.05 6.03 Term - 143431 143353 79 0 1 77 38 106 0.823 1.74 6.02 Intr - 147765 147704 62 0 2 77 107 32 0.718 1.53 6.01 Init - 150205 150152 54 1 0 92 90 59 0.814 5.78 6.00 Prom - 166572 166533 40 -1.96 7.00 Prom + 166705 166744 40 -3.86 7.01 Init + 192534 192636 103 2 1 85 84 74 0.818 5.65 7.02 Intr + 192786 192928 143 1 2 85 107 240 0.941 25.67 7.03 Term + 195187 195270 84 0 0 92 44 51 0.665 -1.25 7.04 PlyA + 197807 197812 6 1.05 8.03 PlyA - 198398 198393 6 1.05 8.02 Term - 199362 199279 84 0 0 97 43 101 0.996 4.15 8.01 Init - 200779 200633 147 1 0 97 99 298 0.636 31.89 8.00 Prom - 200827 200788 40 -16.83 9.00 Prom + 200845 200884 40 -16.89 9.01 Init + 200888 200968 81 1 0 53 103 115 0.999 8.49 9.02 Intr + 201064 201236 173 0 2 121 97 359 0.997 38.84 9.03 Intr + 211483 211586 104 1 2 70 98 142 0.974 13.22 9.04 Term + 213034 213041 8 2 2 121 42 0 0.606 -3.27 9.05 PlyA + 214108 214113 6 -0.45 10.04 PlyA - 215804 215799 6 1.05 10.03 Term - 216569 216426 144 2 0 117 53 47 0.825 2.01 10.02 Intr - 218514 218357 158 1 2 51 90 191 0.676 15.33 10.01 Intr - 220194 220103 92 2 2 98 47 103 0.590 6.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 82160 82127 34 2 1 64 96 58 0.861 2.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:120245594_120469091|GENSCAN_predicted_peptide_1|56_aa WHRVASPSLASRNQNRVYNVSKGLYATVTESNPSSTTNQQLTSGKLLTSLCFSVVS >gi568815586r:120245594_120469091|GENSCAN_predicted_CDS_1|171_bp tggcaccgcgtggcatcaccctcattggcgagcaggaatcagaacagagtctacaatgtc agcaaagggctgtatgcgacagtcactgaatccaatcccagctccaccactaatcagcaa ctaacctcgggcaagttgctcacctctctgtgcttcagtgtcgtcagctag >gi568815586r:120245594_120469091|GENSCAN_predicted_peptide_2|260_aa MAGPRAPAQGRASCPSRGRSSMPRNFSAASLGPERNAPPPRAPAPAPAARPAPGRPLCTR APPPDTPASLLAPPPAHAGPAPAKSPGSGRRWAGRWRPLAAKRAASRTTANLERTFITIR PDSMQCGLVGKIIKRFEQKGFRLVAMKFLPASEEHLKQHYIDLKDRPFFPGLVKYMNSGP VVAMVWEGLNVVKTGRVMLGETNPADSKPGTIRGDFCIQVGRNIIHGSDSVKSAEKEISL RFKPEELVDYKSCAHDWVYE >gi568815586r:120245594_120469091|GENSCAN_predicted_CDS_2|783_bp atggccggaccacgggcgccggctcagggtcgcgctagctgcccgtcccggggccgctcg tctatgccccgcaacttttccgccgcgagcctcggcccggaacggaacgcgccgccgccg cgcgcgcccgcgcccgcgcccgccgcgcgccccgcccccggccgccccctgtgcacgcgc gccccgcccccggacacccccgcgagcttgctggccccgccccctgcgcacgctggtccc gcccccgccaagagcccgggcagtgggcgtcgctgggcggggcggtggcgccccctcgcg gctaagcgggcagcttcccggaccacggccaacctcgagcgcaccttcatcaccatcagg ccggacagcatgcagtgcggcctggtgggcaagatcatcaagcgcttcgagcagaagggg ttccgcctcgtggccatgaagttcctcccggcctctgaagaacacctgaagcagcactac attgacctgaaggaccgcccattcttccctgggctggtgaagtacatgaactcagggccg gtcgtggccatggtctgggaggggctgaacgtcgtgaagacaggccgagtgatgcttggg gagaccaatccagcagattctaagccaggcaccattcgtggggacttttgcattcaggtt ggcaggaacatcattcatggcagtgattcagtaaaaagtgctgaaaaagaaatcagccta cggtttaagcctgaagaactggttgactacaagtcttgtgctcatgactgggtctatgaa taa >gi568815586r:120245594_120469091|GENSCAN_predicted_peptide_3|324_aa MSFALTFRSAKGRWIANPSQPCSKASIGLFVPASPPLDPEKVKELQRFITLSKRLLVMTG AGISTESGIPDYRSEKVGLYARTDRRPIQHGDFVRSAPIRQRYWARNFVGWPQFSSHQPN PAHWALSTWEKLGKLYWLVTQNVDALHTKAGSRRLTELHGCMDRAYCSVSVFLGSRVLCL DCGEQTPRGVLQERFQVLNPTWSAEAHGLAPDGDVFLSEEQVRSFQVPTCVQCGGHLKPD VVFFGDTVNPDKVDFVHKRVKEADSLLVVGSSLQVYSGYRFILTAWEKKLPIAILNIGPT RSDDLACLKLNSRCGELLPLIDPC >gi568815586r:120245594_120469091|GENSCAN_predicted_CDS_3|975_bp atgagctttgcgttgactttcaggtcagcaaaaggccgttggatcgcaaaccccagccag ccgtgctcgaaagcctccattgggttatttgtgccagcaagtcctcctctggaccctgag aaggtcaaagagttacagcgcttcatcaccctttccaagagactccttgtgatgactggg gcaggaatctccaccgaatcggggataccagactacaggtcagaaaaagtggggctttat gcccgcactgaccgcaggcccatccagcatggtgattttgtccggagtgccccaatccgc cagcggtactgggcgagaaacttcgtaggctggcctcaattctcctcccaccagcctaac cctgcacactgggctttgagcacctgggagaaactcggaaagctgtactggttggtgacc caaaatgtggatgctttgcacaccaaggcggggagtcggcgcctgacagagctccacgga tgcatggacagggcatactgttcagtcagcgtcttccttggttccagggtcctgtgcttg gattgtggggaacagactccccggggggtgctgcaagagcgtttccaagtcctgaacccc acctggagtgctgaggcccatggcctggctcctgatggtgacgtctttctctcagaggag caagtccggagctttcaggtcccaacctgcgttcaatgtggaggccatctgaaaccagat gtcgttttcttcggggacacagtgaaccctgacaaggttgattttgtgcacaagcgtgta aaagaagccgactccctcttggtggtgggatcatccttgcaggtatactctggttacagg tttatcctcactgcctgggagaagaagctcccgattgcaatactgaacattgggcccaca cggtcggatgacttggcgtgtctgaaactgaattctcgttgtggagagttgctgcctttg atagacccatgctga >gi568815586r:120245594_120469091|GENSCAN_predicted_peptide_4|217_aa MVPCIPATPAPAVAKRSQETRECRNKDTRQRDKRKDSWARGTTTTKSRRPVVALNVWLHC YLLDTKQKGQGKECESSPMILAAADSGISPRAVWQFRKMIKCVIPGSDPFLEYNNYGCYC GLGGSGTPVDELDKCCQTHDNCYDQAKKLDSCKFLLDNPYTHTYSYSCSGSAITCSSKNK ECEAFICNCDRNAAICFSKAPYNKAHKNLDTKKYCQS >gi568815586r:120245594_120469091|GENSCAN_predicted_CDS_4|654_bp atggtgccctgcatcccagccactccagctccagctgtggctaaaaggagccaagagacg agagagtgtagaaataaagacacaagacaaagagataaaagaaaagacagctgggcccgg ggaaccactaccaccaagtcacggagaccggtagtggccctgaatgtctggctgcactgt tatttattggatacaaagcaaaaggggcagggtaaagagtgcgagtcatctccaatgata ctggccgccgccgacagcggcatcagccctcgggccgtgtggcagttccgcaaaatgatc aagtgcgtgatcccggggagtgaccccttcttggaatacaacaactacggctgctactgt ggcttggggggctcaggcacccccgtggatgaactggacaagtgctgccagacacatgac aactgctatgaccaggccaagaagctggacagctgtaaatttctgctggacaacccgtac acccacacctattcatactcgtgctctggctcggcaatcacctgtagcagcaaaaacaaa gagtgtgaggccttcatttgcaactgcgaccgcaacgctgccatctgcttttcaaaagct ccatataacaaggcacacaagaacctggacaccaagaagtattgtcagagttga >gi568815586r:120245594_120469091|GENSCAN_predicted_peptide_5|379_aa MADPGPAGPPRSPGPRPLRPGARRSRGPFVSLLLPQQDVHRGTQLADYAGPARPSASRGP GGRQEAQRERGEGEGLREYFGQFGEVKECLVMRDPLTKRSRGFGFVTFMDQAGVDKVLAQ SRHELDSKTIDPKVAFPRRAQPKMVTRTKKIFVGGLSVNTTVEDVKQYFEQFGKVDDAML MFDKTTNRHRGFGFVTFESEDIVEKVCEIHFHEINNKMVECKKAQPKEVMSPTGSARGRS RVMPYGMDAFMLGIGMLGYPGFQATTYASRSYTGLAPGYTYQFPGQDTDGVAQAIPLTAY GPMAAAAAAAAVVRGTGSTPSRTGGFLGTTSPGPMAELYGAANQDSGVSSYISAASPAPS TGFGHSLGRPSLQLTEDHE >gi568815586r:120245594_120469091|GENSCAN_predicted_CDS_5|1140_bp atggccgatccgggtccggccgggcctccccggagcccgggcccgcgccccctgcgccct ggcgcccggcgctcacgcgggccctttgtgtctctcctcctcccgcagcaagatgttcat cgggggactcagttggcagactacgcaggcccagctcggccctcggcttcccggggcccc ggtgggcgccaggaggctcagcgggaacggggcgagggcgaagggctgcgcgaatacttc ggccagttcggggaggtgaaggagtgtctggtgatgcgggaccccctgaccaagagatcc aggggtttcggcttcgtcactttcatggaccaggcgggggtggataaagtgctggcgcaa tcgcggcacgagctcgactccaaaacaattgaccctaaggtggccttccctcggcgagca cagcccaagatggtgactcgaacgaagaagatctttgtgggggggctgtcggtgaacacc acggtggaggacgtgaagcaatattttgagcagtttgggaaggtggacgacgccatgctg atgtttgacaaaaccaccaaccggcaccgagggttcgggtttgtcacgtttgagagtgag gacatcgtggagaaagtgtgtgaaattcattttcatgaaatcaacaacaaaatggtggaa tgtaagaaagctcagccaaaggaggtgatgtcgccaacgggctcagcccgggggaggtct cgagtcatgccctacggaatggacgccttcatgctgggcatcggcatgctgggttaccca ggtttccaagccacaacctacgccagccggagttatacaggcctcgcccctggctacacc taccagttccccggtcaggacacagatggtgtggcccaagccattcctctcactgcctac ggaccaatggcggcggcagcggcggcagcggctgtggttcgagggacaggttcgactccc agccgcacagggggcttcctggggaccaccagccccggccccatggccgagctctacggg gcggccaaccaggactcgggggtcagcagttacatcagcgccgccagccctgcccccagc accggcttcggccacagtcttgggcgccccagcctgcagctgactgaggaccacgagtga >gi568815586r:120245594_120469091|GENSCAN_predicted_peptide_6|64_aa MGRARWLTPVIPALLEAEVPIMHPLSACSLCHPVNALVRSRDADWLRAGPWAQRCVSIIV RDCE >gi568815586r:120245594_120469091|GENSCAN_predicted_CDS_6|195_bp atgggccgggcacggtggctcacacctgtaatcccagcacttttggaggctgaggtcccc atcatgcacccgctcagtgcttgttctctctgccatcctgtcaatgcccttgtgagatca cgtgatgccgactggctccgagctgggccctgggctcagcgctgtgtgagcatcattgta cgggactgtgaatag >gi568815586r:120245594_120469091|GENSCAN_predicted_peptide_7|109_aa MAVVGVSSVSRLLGRSRPQLGRPMSSGAHGEEGSARMWKTLTFFVALPGVAVSMLNVYLK SHHGEHERPEFIAYPHLRIRTKKLAFELQVSVQVEKKLALVISVSTVKV >gi568815586r:120245594_120469091|GENSCAN_predicted_CDS_7|330_bp atggcggtagttggtgtgtcctcggtttctcggctgctgggtcggtcccgcccacagctg gggcggcctatgtcgagtggcgcccatggcgaagagggctcagctcgcatgtggaagact ctcaccttcttcgtcgcgctccccggggtggcagtcagcatgctgaatgtgtacctgaag tcgcaccacggagagcacgagagacccgagttcatcgcctacccccatctccgcatcagg accaagaaattagcatttgagcttcaagtcagtgtccaagttgaaaagaaattggcatta gttatttctgtttccacagtgaaggtctag >gi568815586r:120245594_120469091|GENSCAN_predicted_peptide_8|76_aa MNSVGEACTDMKREYDQCFNRWFAEKFLKGDSSGDPCTDLFKRYQQCVQKAIKEKEIPIE GLEFMGHGKEKPENSS >gi568815586r:120245594_120469091|GENSCAN_predicted_CDS_8|231_bp atgaacagtgtgggggaggcatgcacggacatgaagcgcgagtacgaccagtgcttcaat cgctggttcgccgagaaatttctcaagggggacagctccggggacccgtgcaccgacctc ttcaagcgctaccagcagtgtgttcagaaagcaataaaggagaaagagattcctattgaa ggactggagttcatgggccatggcaaagaaaagcctgaaaattcttcttga >gi568815586r:120245594_120469091|GENSCAN_predicted_peptide_9|121_aa MWSRLVWLGLRAPLGGRQGFTSKADPQGSGRITAAVIEHLERLALVDFGSREAVARLEKA IAFADRLRAVDTDGVEPMESVLEDRCLYLRSDNVVEGNCADELLQNSHRVVEEYFVAPPG R >gi568815586r:120245594_120469091|GENSCAN_predicted_CDS_9|366_bp atgtggtcgcggttggtgtggctgggccttcgggcccctctgggcgggcgccagggcttc acctccaaggcggatcctcagggcagtggccggatcacggctgcggtgatcgagcacctg gagcgtctagcgcttgtggacttcggcagccgcgaggcagtggcgcgactggagaaagct atcgccttcgccgaccggctacgcgccgtggacacagacggggtggagcccatggaatcg gtcctggaggacagatgtctatacctgagatccgacaatgtggtagaaggcaactgtgct gatgaattactacaaaactcccatcgcgtcgtggaggagtactttgtggcccccccaggt aggtga >gi568815586r:120245594_120469091|GENSCAN_predicted_peptide_10|131_aa XDAEDAIYGRNGYDYGQCRLRVEFPRTYGGRGSWQDLKDHMREAGDVCYADVQKDGVGMV EYLRKEDMEYALRKLDDTKFRSHEGETSYIRVYPERSTSYGYSRSRSGSRGRDSPYQSRG SPHYFSPFRPY >gi568815586r:120245594_120469091|GENSCAN_predicted_CDS_10|396_bp nnagatgcagaggatgctatttatggaagaaatggttatgattatggccagtgtcggctt cgtgtggagttccccaggacttatggaggtcggggcagctggcaggacctgaaggatcac atgcgagaagctggggatgtctgttatgctgatgtgcagaaggatggagtggggatggtc gagtatctcagaaaagaagacatggaatatgccctgcgtaaactggatgacaccaaattc cgctctcatgagggtgaaacttcctacatccgagtttatcctgagagaagcaccagctat ggctactcacggtctcggtctgggtcaaggggccgtgactctccataccaaagcaggggt tccccacactacttctctcctttcaggccctactga