GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:18:54 Sequence gi568815591f:112350689_112575516 : 224828 bp : 40.46% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12940 13036 97 0 1 55 94 84 0.644 4.36 1.02 Intr + 14269 14415 147 2 0 104 48 87 0.497 5.59 1.03 Intr + 14633 14814 182 0 2 83 87 43 0.533 2.37 1.04 Intr + 15528 15596 69 2 0 55 94 74 0.524 3.16 1.05 Intr + 19631 19850 220 1 1 53 63 117 0.378 2.65 1.06 Intr + 22260 22373 114 1 0 47 85 76 0.506 2.70 1.07 Intr + 24204 24423 220 1 1 98 22 119 0.132 2.74 1.08 Intr + 42729 42827 99 0 0 56 75 79 0.000 1.71 1.09 Intr + 47514 47559 46 0 1 87 92 33 0.302 1.09 1.10 Term + 48103 48195 93 0 0 153 42 29 0.781 1.65 1.11 PlyA + 48325 48330 6 1.05 2.00 Prom + 52131 52170 40 -5.25 2.01 Init + 57486 57639 154 2 1 75 82 83 0.083 6.59 2.02 Intr + 58860 59036 177 1 0 40 92 87 0.064 3.27 2.03 Intr + 66928 67058 131 2 2 91 72 29 0.031 1.09 2.04 Term + 70057 70149 93 0 0 86 36 133 0.901 4.75 2.05 PlyA + 70253 70258 6 1.05 3.00 Prom + 76300 76339 40 -4.55 3.01 Init + 100001 100094 94 1 1 98 110 158 0.999 19.59 3.02 Intr + 105075 105179 105 1 0 118 60 107 0.998 10.17 3.03 Intr + 105314 105398 85 0 1 75 80 114 0.973 7.26 3.04 Intr + 106226 106350 125 2 2 13 90 77 0.965 -0.29 3.05 Intr + 108173 108330 158 0 2 88 93 112 0.957 10.51 3.06 Intr + 117293 117427 135 1 0 57 80 118 0.701 7.74 3.07 Intr + 121531 121659 129 0 0 39 111 80 0.973 5.37 3.08 Intr + 122078 122173 96 1 0 41 73 160 0.998 9.09 3.09 Term + 124742 124831 90 1 0 101 33 121 0.993 4.64 3.10 PlyA + 125479 125484 6 1.05 4.09 PlyA - 125492 125487 6 -0.45 4.08 Term - 126167 126098 70 1 1 82 44 82 0.028 -0.37 4.07 Intr - 131194 131112 83 1 2 91 103 34 0.033 2.72 4.06 Intr - 141507 141446 62 1 2 105 96 31 0.425 3.13 4.05 Intr - 141963 141872 92 2 2 92 80 72 0.756 5.52 4.04 Intr - 147486 147399 88 1 1 122 38 116 0.835 8.11 4.03 Intr - 155312 155190 123 0 0 78 69 53 0.577 2.04 4.02 Intr - 157177 157012 166 1 1 6 54 104 0.068 -2.59 4.01 Init - 161869 161672 198 1 0 53 86 155 0.740 10.75 4.00 Prom - 164076 164037 40 -7.85 5.00 Prom + 168325 168364 40 -7.15 5.01 Init + 169800 170183 384 2 0 57 59 364 0.600 27.28 5.02 Intr + 170246 170337 92 1 2 -18 24 265 0.103 7.37 5.03 Intr + 172029 172076 48 0 0 101 106 26 0.116 2.58 5.04 Intr + 188220 188375 156 0 0 56 48 161 0.175 7.10 5.05 Intr + 196466 196661 196 2 1 44 57 176 0.027 8.60 5.06 Term + 220511 220615 105 1 0 97 31 92 0.018 1.83 5.07 PlyA + 221262 221267 6 1.05 6.02 PlyA - 221823 221818 6 1.05 6.01 Term - 224070 223889 182 1 2 57 36 155 0.827 3.99 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 39696 39526 171 1 0 81 71 111 0.813 7.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:112350689_112575516|GENSCAN_predicted_peptide_1|428_aa GIGSECGRRNTKGVGAELCHGSSVLEPDGSLPEHTMGFHSVNALHPRDWHPILLEECYGP GASAVPSVGCFHAPSGWQLPGGLTGPSVATLLPHWPVGLFGELYHVKGSTLVSINGHSAM VIVSLRPFLFLLKEKAISRSVLATLSGAEGQLLSVFKLHAHTDPSYTFQIHKVLTTGLPP CPRAICVPPTTSPHGHPEATDNYSTTLSYCLFSWATPGTNSHSLDSLEANAEMEFEVQGS ISHNLNDEQDIQKIGGLFKTLPFTSSSLTIGSLALTALATVHLLFLHETESNNPSGVSSD PDKITSPRQYTTKDVLGLIFLLLLPDLLSDPDNYTLANPLNTPPHIKPERILNSVWNFAG TSEMSVGTYCPYLKEQVQIQCGRMKVPGESGFGFVLLEVLDVLLLVPEYCMTFYHVPVSC PLTLVVSV >gi568815591f:112350689_112575516|GENSCAN_predicted_CDS_1|1287_bp gggattggcagtgaatgtggaagacgaaatacaaaaggggtgggggcagagctctgtcat gggagctctgtcttggagcctgacggttctctgccagagcacacaatgggctttcattca gtgaatgccttgcatcctagagattggcaccccatccttttagaggaatgctatggccct ggtgcttcagctgtgccctctgtgggctgtttccatgctccctcaggctggcagcttccg ggaggattgacggggcccagtgtagccaccttgctgccacactggccagttggtctcttt ggggaattataccatgttaagggctccacgctggtctctattaatggacattcagcaatg gtgatagttagccttaggccatttctctttctactcaaagaaaaagccatcagcagatca gtcttggccaccctgagtggagctgaagggcagctgctgtctgttttcaaattgcatgca catactgacccatcctacaccttccaaattcacaaagttttaacaactggactgccacct tgccccagggctatctgtgttcctcctactaccagtcctcatggccacccagaggcaact gataactactcaacaaccttatcttattgcctgttttcttgggccactcctggtaccaac tcacatagcttggattctctggaagcaaatgctgagatggagtttgaagtgcaaggctcc atcagccataacctcaatgacgaacaagacatccaaaaaataggaggactgttcaagact ttacccttcacttcctcctcccttactattggtagccttgcacttacagctctagcaact gttcaccttttattcttgcatgaaacagaatctaacaacccttcaggagtttcatcagac cccgacaaaatcacttccccccgacaatatacaaccaaagatgttctaggtttaattttt ctcctcctccttcccgacctcctgagcgacccagataattacactttagccaaccccctc aataccccaccccacattaagccagagcgtatcctgaacagcgtctggaactttgcgggc accagtgaaatgtcagttggaacgtattgcccatacctaaaagagcaggtgcagatacag tgtgggaggatgaaagtaccaggtgagagtgggttcggttttgttttgctggaggttctg gatgtgctcttgcttgttccagaatactgcatgaccttttatcatgtccctgtttcctgt ccactcactctagttgtttcagtctga >gi568815591f:112350689_112575516|GENSCAN_predicted_peptide_2|184_aa MDLNIPGSFHWLVSQGFESPSVYKFYITLRLHASESEAPGVKYGVVRQGQVGSSEDGSGL MALCDVNLVHGKQTPLSTIGRNAAGSAIVLCSCSGSSIQSLGSHPFGRESELEKTTLNFI WNQKRACIAKTILSKKNKGGDIMLPDFKLYYKATLAVSGSHDIAQARAMLKVVQTAKDLL SSPF >gi568815591f:112350689_112575516|GENSCAN_predicted_CDS_2|555_bp atggacctcaacatcccaggttccttccactggctcgtttctcagggctttgaatctcct tctgtttacaagttttatattactcttaggctacatgcctctgaaagtgaggcccctgga gtgaaatatggtgtggtgagacagggccaggtagggtcctctgaagatggttctggtctg atggccctctgtgatgtcaatttggtccatggaaagcaaacacccttgtccaccattggg aggaatgctgctggctcagccatagttctctgttcttgctctggcagctccatacagtcc ctcggttcacacccttttgggcgagaatcagaactagaaaaaactactttaaatttcata tggaaccaaaaaagagcctgtatagccaagacaatcctaagcaaaaagaacaaaggtgga gacatcatgctacctgacttcaaactatactacaaggctacactggcagtctccggcagc cacgatattgcccaggctcgtgctatgttgaaagtggtccaaactgccaaagatttactc tcctctcccttttag >gi568815591f:112350689_112575516|GENSCAN_predicted_peptide_3|338_aa MPKNKKRNTPHRGSSAGGGGSGAAAATAATAGGQHRNVQPFSDEDASIETMSHCSGYSDP SSFAEDGPEVLDEEGTQEDLEYKLKGLIDLTLDKSAKTRQAALEGIKNALASKMLYEFIL ERRMTLTDSIERCLKKGKSDEQRAAAALASVLCIQLGPGIESEEILKTLGPILKKIICDG SASMQARQTDFFYEDMESLTQMLRALATDGNKHRAKVDKRKQRSVFRDVLRAVEERDFPT ETIKFGPERMYIDCWVKKHTYDTFKEVLGSGMQYHLQSNEFLRNVFELGPPVMLDAATLK TMKISRFERHLYNSAAFKARTKARSKCRDKRADVGEFF >gi568815591f:112350689_112575516|GENSCAN_predicted_CDS_3|1017_bp atgccgaagaacaagaagcggaacactccccaccgcggtagcagtgctggcggcggcggg tcaggagcagccgcagcgacggcggcgacagcaggtggccagcatcgaaatgttcagcct tttagtgatgaagatgcatcaattgaaacaatgagccattgcagtggttatagcgatcct tccagttttgctgaagatggaccagaagtccttgatgaggaaggaactcaagaagaccta gagtacaagttgaagggattaattgacctaaccctggataagagtgcgaagacaaggcaa gcagctcttgaaggtattaaaaatgcactggcttcaaaaatgctgtatgaatttattctg gaaaggagaatgactttaactgatagcattgaacgctgcctgaaaaaaggtaagagtgat gagcaacgtgcagctgcagcgttagcatctgttctttgtattcagctgggccctggaatt gaaagtgaagagattttgaaaactcttggaccaatcctaaagaaaatcatttgtgatggg tcagctagtatgcaggctaggcaaactgactttttttatgaagacatggagtccttgacg cagatgcttagggccttggcaacagatggaaataaacaccgggccaaagtggacaagaga aagcagcggtcagttttcagagatgtcctgagggcagtggaggaacgggattttccaaca gaaaccattaaatttggtcctgaacgcatgtatattgattgctgggtaaaaaaacacacc tatgacacctttaaggaggttcttggatcagggatgcagtaccacttgcagtcaaatgaa ttccttcgaaatgtatttgaacttggacccccagtgatgcttgatgctgcaacgcttaaa acgatgaagatttctcgtttcgaaaggcatttatataactctgcagccttcaaagctcga accaaagctagaagcaaatgtcgagataagagagcagatgttggagaattcttctag >gi568815591f:112350689_112575516|GENSCAN_predicted_peptide_4|293_aa MRSDKNEACFAGELIESFKSQFPIKYAAEQKPAQEVVQQASAFEDSLRKRKDADNFILKD SGTPKQAALAFVYDDILLIFKLNVCRLHSILQTRQEFQPHKDALRTLKRSPQKLSHRFSQ IGFSLHIRLLQGYGKFQETGHLGGSSPFPTPGDTWPARACKEGQQLQSQGEWTLTGRAAS SESTKQWIPPQRLQLSGLTRSQGLLVPGSSKIQQPPGVTAVHVHPVGQALSPGQTDRFLA PSQCANLLIHQDWLKQHPFNRTLSYTLTGNIIYASLPLTATYSKSIKETSLRI >gi568815591f:112350689_112575516|GENSCAN_predicted_CDS_4|882_bp atgaggagtgacaagaatgaggcttgctttgctggggaactgatagaatcattcaaatca cagttccctatcaaatatgctgctgaacagaaaccagcccaggaagtagtacaacaggcc tcagcttttgaagatagtctaaggaagagaaaggatgctgataattttattctcaaagat agtgggacccctaagcaggctgccctggcttttgtctacgatgatatcctgttaatattt aagctcaatgtttgccgactccacagcatcctccaaaccaggcaggaatttcagccccac aaagatgctctacgaaccctcaagaggagtcctcaaaaactatcccacaggttctcacag attggtttctccctccatataagactcttacaaggatatgggaagtttcaagaaacaggc catttagggggctcatctcctttccccactccaggagacacttggccagcccgtgcctgc aaggaaggacagcagctgcaaagtcaaggagagtggaccttgactggacgagctgcatca tcagagtccactaagcaatggattcctccacaaaggttgcagctcagtggccttactaga agccagggcctcctagttccaggtagctccaagatccaacagcctccaggtgtcacagca gtgcacgtgcaccctgtgggtcaagcccttagcccagggcagacagacaggttcctggct ccatcgcagtgtgcaaatttgcttattcaccaagattggctaaagcagcatccctttaac agaaccctttcttataccttgactggaaacattatatatgcttccctacccctcacggcg acctacagcaagagtatcaaagaaacaagtctacggatttga >gi568815591f:112350689_112575516|GENSCAN_predicted_peptide_5|326_aa MEDSMNMDMSPLRPQTYLFNCEPKADKDDHFKVNDDENKHQLSLRTVSLGAGAKDELHIV EAEAMNYKGSPIKVTLATLKMSVQATVSLGGFEIPLPVVLQSKCGLGPVHISGQHLVAVE EDAESEDEGSMEKSKTCCDDDDDDDDDDEDDEETEEKLRKTFKKWLLIPHAKRTGCCEAD VGHILNSGINSLCFVGAVKLLADSRYECRGGLLCSGHCYQGGFQEDWEALLGEEKWIRVS PNEAVWPWSATAVPWGIPPSANHPVSLAPAGENGRLELQGWRLSPPPTQPTWKSANRNEV SKISISEIEESGVQMKEIEPVHVMKF >gi568815591f:112350689_112575516|GENSCAN_predicted_CDS_5|981_bp atggaggattcgatgaacatggatatgagccccttgaggccccagacctatcttttcaat tgtgaaccaaaggccgacaaagatgatcactttaaggtgaatgatgatgaaaataagcac cagttatctttaagaacggtcagtttaggggctggtgcgaaggatgaactgcacattgtt gaagcagaagcaatgaattacaaaggtagtccaattaaagtaacactggcaactttgaaa atgtctgtgcaggcaacagtttcccttgggggctttgaaataccactacctgtggtctta caatcgaagtgtggtttagggccagtgcatattagtggacagcacttagtagctgtggag gaagatgcagaatcagaagatgagggttccatggaaaaaagtaaaacctgctgtgatgat gatgatgatgatgatgatgatgatgaagatgacgaggaaactgaagaaaaactccggaag acttttaagaagtggcttcttattcctcatgcaaagagaactggctgctgtgaagctgac gtgggccacatactgaacagtgggataaattctctctgctttgtaggagctgtgaaactt ctggctgactcgagatatgaatgtagaggaggactgctctgttctggccattgctatcag ggcggcttccaggaagactgggaggccctgcttggtgaggagaaatggatccgggtctca cctaacgaagcagtctggccgtggtctgccacagccgtaccgtggggaattcctcccagt gcaaatcatccggtctccctggcaccggcaggggagaacggccgactggagctgcaggga tggcggctgtccccaccccctacccaaccaacttggaaatctgctaatagaaatgaagtt tcaaaaataagtatcagtgaaattgaagaatctggagtccagatgaaggaaatagagcca gttcatgtaatgaagttctaa >gi568815591f:112350689_112575516|GENSCAN_predicted_peptide_6|60_aa XLYKDSKSDSQRETVRPPGDNQKSSVDHPDDHLLGYDDLYRKMTHSALITSQNRRISRTV >gi568815591f:112350689_112575516|GENSCAN_predicted_CDS_6|183_bp nttttgtacaaagattcaaaatctgactcacaaagagaaacagtgaggccaccaggtgac aatcagaagagctctgttgaccatcccgatgatcatttattgggttatgatgatctgtat aggaaaatgactcactcagctttaataacaagccaaaatagaagaatcagcagaaccgtc taa