GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:16:33 Sequence gi568815584f:19820607_20021575 : 200969 bp : 35.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 6834 7766 933 2 0 95 48 543 0.478 42.13 1.02 PlyA + 7789 7794 6 1.05 2.06 PlyA - 7834 7829 6 1.05 2.05 Term - 13908 13591 318 0 0 39 47 171 0.472 2.00 2.04 Intr - 15007 14946 62 2 2 74 91 49 0.194 1.33 2.03 Intr - 27473 27405 69 0 0 40 83 94 0.107 2.34 2.02 Intr - 37712 37619 94 2 1 68 88 60 0.228 2.62 2.01 Init - 45181 45113 69 1 0 112 72 6 0.229 2.52 2.00 Prom - 47239 47200 40 -3.65 3.03 PlyA - 47251 47246 6 1.05 3.02 Term - 48437 47893 545 0 2 93 39 214 0.088 10.44 3.01 Init - 66522 66450 73 0 1 81 42 61 0.104 2.08 3.00 Prom - 70167 70128 40 -5.55 4.03 PlyA - 70482 70477 6 1.05 4.02 Term - 74094 73854 241 2 1 124 39 194 0.980 12.61 4.01 Init - 75230 75163 68 2 2 89 70 10 0.687 -0.10 4.00 Prom - 78125 78086 40 -3.65 5.02 PlyA - 78925 78920 6 1.05 5.01 Sngl - 79716 79444 273 0 0 74 44 189 0.954 8.18 5.00 Prom - 81901 81862 40 -8.25 6.00 Prom + 86623 86662 40 -7.05 6.01 Sngl + 90901 91230 330 0 0 79 37 369 0.921 26.67 6.02 PlyA + 91313 91318 6 1.05 7.00 Prom + 92903 92942 40 -8.25 7.01 Sngl + 93766 94170 405 0 0 60 43 202 0.801 8.93 7.02 PlyA + 94421 94426 6 1.05 8.00 Prom + 106025 106064 40 -6.35 8.01 Sngl + 115301 115996 696 1 0 74 50 178 0.378 8.55 8.02 PlyA + 116782 116787 6 1.05 9.00 Prom + 121024 121063 40 -5.25 9.01 Init + 127751 127757 7 1 1 42 119 0 0.508 -0.42 9.02 Term + 135968 136416 449 0 2 122 43 185 0.973 11.79 9.03 PlyA + 136865 136870 6 1.05 10.00 Prom + 143235 143274 40 -3.65 10.01 Init + 146869 146974 106 0 1 48 86 47 0.237 0.93 10.02 Intr + 152809 152981 173 2 2 98 44 54 0.486 0.74 10.03 Intr + 154955 155914 960 1 0 145 9 363 0.293 24.42 10.04 Intr + 160193 160302 110 1 2 93 57 96 0.216 5.16 10.05 Intr + 165107 165365 259 2 1 35 40 165 0.021 2.94 10.06 Intr + 181542 181975 434 2 2 86 -14 192 0.012 0.32 10.07 Term + 182169 182499 331 0 1 44 53 207 0.394 5.94 10.08 PlyA + 183778 183783 6 1.05 11.02 PlyA - 184048 184043 6 1.05 11.01 Sngl - 194410 193655 756 1 0 24 44 400 0.262 23.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 88096 88569 474 0 0 56 36 211 0.933 6.60 S.002 Term + 101715 101834 120 2 0 101 36 79 0.857 1.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:19820607_20021575|GENSCAN_predicted_peptide_1|310_aa AREMESENRTVIREFILLGLTQSQDIQLLVFVLVLIFYFIILPGNFLIIFTIKSDPGLTA PLYFFLGNLAFLDASYSFIVAPRMLVDFLSAKKIISYRGCITQLFFLHFLGGGEGLLLVV MAFDRYIAICRPLHYPTVMNPRTCYAMMLALWLGGFVHSIIQVVLILRLPFCGPNQLDNF FCDVPQVIKLACTDTFVVELLMVFNSGLMTLLCFLGLLASYAVILCRIRGSSSEAKNKAM STCITHIIVIFFMFGPGIFIYTRPFRAFPADKVVSLFHTVIFPLLNPVIYTLRNQEVKAS MKKVFNKHIA >gi568815584f:19820607_20021575|GENSCAN_predicted_CDS_1|933_bp gccagggaaatggaaagcgagaacagaacagtgataagagaattcatcctccttggtctg acccagtctcaagatattcagctcctggtctttgtgctagttttaatattctacttcatc atcctccctggaaattttctcattattttcaccataaagtcagaccctgggctcacagcc cccctctatttctttctgggcaacttggccttcctggatgcatcctactccttcattgtg gctccccggatgttggtggacttcctctctgcgaagaagataatctcctacagaggctgc atcactcagctctttttcttgcacttccttggaggaggggagggattactccttgttgtg atggcctttgaccgctacatcgccatctgccggcctctgcactatcctactgtcatgaac cctagaacctgctatgcaatgatgttggctctgtggcttgggggttttgtccactccatt atccaggtggtcctcatcctccgcttgcctttttgtggcccaaaccagctggacaacttc ttctgtgatgtcccacaggtcatcaagctggcctgcaccgacacatttgtggtggagctt ctgatggtcttcaacagtggcctgatgacactcctgtgctttctggggcttctggcctcc tatgcagtcattctttgtcgcatacgagggtcttcttctgaggcaaaaaacaaggccatg tccacgtgcatcacccatatcattgttatattcttcatgtttggacctggcatcttcatc tacacgcgccccttcagggctttcccagctgacaaggtggtttctctcttccacacagtg atttttcctttgttgaatcctgtcatttatacccttcgcaaccaggaagtgaaagcttcc atgaaaaaggtgtttaataagcacatagcctga >gi568815584f:19820607_20021575|GENSCAN_predicted_peptide_2|203_aa MAAVQSSLGWASEGDRAKRARARASTNCHICENARIAMQRNRFLETNRSFQSMGGRKSLG GSCGGVVPDSNKNNDVGAIPVQLKTFGGELVEAFRGKKIWSAIAEAKMRVPREPESALQA GKNQAGALGEVSRPGGTKVVSALSDGKTALQNSCPTVPVGLKSPMKASQVWKDGHPWPCS AIDVPATNPLGSISASVLLLPLF >gi568815584f:19820607_20021575|GENSCAN_predicted_CDS_2|612_bp atggcggcagtacagtccagcctcggctgggcatcagagggagaccgtgcaaagagggcg agggcgagggcttccaccaactgccacatctgtgaaaatgcaagaattgcaatgcagagg aacagatttctagaaaccaataggtcatttcagagtatgggaggtagaaagagccttgga ggatcctgtggaggagtagtccctgacagtaataagaacaatgatgtaggagccatccca gttcagttaaaaacttttggtggagagctggtggaggcattcagaggaaagaagatatgg tctgccattgcagaagctaagatgcgggtccccagggaacctgagtctgcactgcaagca ggaaaaaaccaggctggagcactgggagaggtcagcagaccagggggtactaaggttgta tctgccctatctgatgggaagactgccctgcagaattcatgtccgacagttcccgtaggg ctaaagtctcctatgaaagcaagtcaagtctggaaggatgggcatccctggccatgctct gctatagatgttcctgcaacaaaccctctgggctccatatcggctagcgtgctgctccta ccactcttctaa >gi568815584f:19820607_20021575|GENSCAN_predicted_peptide_3|205_aa MAFIKSHKTDVGVDAEKRECSYTVVYAATVLGNLLIVVTIASEPHLHSPMYFLLGNLSFI DMSLASFATPKMIADFLREHKAISFEGCMTQMFFLHLLGGAEIVLLISMSFDRYVAICKP LHYLTIMSRRMCVGLVILSWIVGIFHALSQLAFTVNLPFCGPNEVDSFFCDLPLVIKLAC VDTYILGVFMISTSGMIAWCASSSW >gi568815584f:19820607_20021575|GENSCAN_predicted_CDS_3|618_bp atggctttcattaaaagtcataaaacagatgttggtgtggatgcagagaaaagggaatgc tcatacactgttgtctatgcagccactgtgctggggaaccttcttattgtggtcaccatt gcatcagagccacaccttcattcccctatgtactttctgctgggcaatctctccttcatt gacatgtccctggcctcatttgccacccccaaaatgattgcagacttccttagagaacac aaagccatctcttttgaaggctgcatgacccagatgttcttcctacatctcttagggggt gctgagattgtactgctgatctccatgtcctttgataggtacgtggctatctgtaagcct ctacattacctaacaatcatgagccgaagaatgtgtgttgggcttgtgatactttcctgg attgtcggcatcttccatgctctgagtcagttagcatttacagtgaatctgcccttctgt ggacccaatgaagtagacagtttcttttgtgacctccctttggtgattaaacttgcttgt gttgacacatatattctgggggtgttcatgatctcaaccagtggcatgattgcctggtgt gcttcatcctcttggtga >gi568815584f:19820607_20021575|GENSCAN_predicted_peptide_4|102_aa MKPSNCPPRTMQSWGNLPNKGKEPAWDQGGLALLLVPNSEIQCHLCERKMQVCHAPHSCR SPLSQLKNPALTSEMPTAQMPTPPEHFSCGPDHFRKPNPTGT >gi568815584f:19820607_20021575|GENSCAN_predicted_CDS_4|309_bp atgaagccaagtaactgcccacccaggaccatgcagagctgggggaatctccctaacaaa ggaaaagagccagcatgggatcaaggaggtttggcacttttgcttgtccctaacagtgaa atccagtgccacttgtgtgagaggaaaatgcaagtgtgccatgctccccacagctgccga tctccattgtcccagctgaagaatcctgccctcaccagtgaaatgcccacagcacagatg cccacccctcctgagcatttcagctgtggcccagatcacttcagaaaacccaaccccaca ggcacatga >gi568815584f:19820607_20021575|GENSCAN_predicted_peptide_5|90_aa MTEDKWGESASHGKNRSKKEKEESQILKQPDLTERKLTYHQGDATKTFMRDSPPDSITSN QALPPTLEITFRHEIWRGQTSKLYQVGSRS >gi568815584f:19820607_20021575|GENSCAN_predicted_CDS_5|273_bp atgacagaggacaagtggggagaaagtgcatcacatggcaaaaacaggagcaagaaagag aaagaggagtctcagattcttaaacaaccagatctcactgaacgaaaactcacttatcat caaggggatgctactaagacattcatgagggactcacctcctgattcaatcacctccaac caagccctacctccaacattggaaatcacatttagacatgagatttggaggggacaaaca tccaaactatatcaagtgggatcaagatcctga >gi568815584f:19820607_20021575|GENSCAN_predicted_peptide_6|109_aa MRKKQRRKTGNSKNQSTSPPPKEHSSSPATEQSWMENDFDDLREEGFRRSNYSELKEEVR TNGKEVKNLGKKLDEWLTGITNAEKSLKDLMELKTMTLELCDECTSLSS >gi568815584f:19820607_20021575|GENSCAN_predicted_CDS_6|330_bp atgaggaaaaaacagagaagaaaaactggaaattctaaaaatcagagcacctctcctcct ccaaaggaacacagctcctcaccagcaacggaacaaagctggatggagaatgactttgat gacttgagagaagaaggcttcagacgatcaaactactccgagctaaaggaggaagttcga accaatggcaaagaagttaaaaaccttggaaaaaaactagatgaatggctaactggaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccatgacactagaacta tgtgatgaatgcacaagcctcagtagctga >gi568815584f:19820607_20021575|GENSCAN_predicted_peptide_7|134_aa MSELPFTIASKRINYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KKAILPKVIYRLNVIPIKLPMTFFTELEKTTLKFTWNQKRAHIAKSMLSQKNKAGGITLP DFKLYYKPTVTKTA >gi568815584f:19820607_20021575|GENSCAN_predicted_CDS_7|405_bp atgagtgaactcccattcacaattgcttcaaagagaataaactacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaaaggccatactgcccaaggtgatttatagattgaatgtcatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcacatggaaccaaaaaaga gcccacattgccaagtcaatgctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactacaagcctacagtaaccaaaacagcatga >gi568815584f:19820607_20021575|GENSCAN_predicted_peptide_8|231_aa MLVDFFIERKTISFEGCMAQIFVLHSFVGSEMMLLVAMAYDRFIAICKPLHYSTIMNRRL CVIFVSISWAVGVLHSVSHLAFTVDLPFCGPNEVDSFFCDLPLVIELACMDTYEMEIMTL TNSGLISLSCFLALIISYTIILIGVRCRSSSGSSKALSTLTAHITVVILFFGPCIYFYIW PFSRLPVDKFLSVFYTVCTPLLNPIIYSLRNEDVKAAMWKLRNRHVNSWKN >gi568815584f:19820607_20021575|GENSCAN_predicted_CDS_8|696_bp atgcttgtagacttttttattgagcgcaagactatctcctttgagggttgcatggcccag atattcgttcttcacagttttgttgggagtgagatgatgttgcttgtagctatggcatat gacagatttatagccatatgtaagcctctgcactacagtacaattatgaaccggaggctc tgtgtaatttttgtgtctatttcctgggcggtgggcgttcttcattctgtgagccacttg gcttttacagtggacctgccattctgtggtcccaatgaggtggatagcttcttttgtgac cttcccttggtgatagagctggcttgcatggatacatatgaaatggaaattatgacccta acgaacagtggcctgatatcattgagctgtttcctggctttaattatttcctacaccatc attttgatcggtgtccgatgcaggtcctccagtgggtcatctaaggctctttctacatta actgcccacatcacagtggtcattcttttcttcgggccttgcatttatttctatatatgg ccttttagcagacttcctgtggacaaatttctttctgtgttctacactgtttgtactccc ttgttgaaccccatcatctactctctgaggaatgaagatgttaaagcagccatgtggaag ctgagaaaccgtcatgtgaactcctggaaaaactag >gi568815584f:19820607_20021575|GENSCAN_predicted_peptide_9|151_aa MSDNFSSSDSVNGWSNKSVVTEFNLLGLSSSWELQVFFFFIFSVFYGAAVLGNILIIITV IIDSHLHSPMYFLLSNLSSIDVCQATFATPKMIADFLNEHKTTTFQGCMSQIFFLHVFGG SEMVLLVAMAYDRYIAICKPLHYMTIMNRRV >gi568815584f:19820607_20021575|GENSCAN_predicted_CDS_9|456_bp atgtctgataacttcagcagctcagattcagttaatggatggagtaataaatcagtggtt actgaattcaatttgttggggctgtctagctcttgggaactccaagtcttctttttcttt atcttctctgtgttttatggagctgcagtgttgggaaacatccttatcatcatcacagta attatagactctcatttgcattccccaatgtactttcttcttagcaatctctcttccatc gatgtgtgtcaggctacatttgccactcccaagatgattgcagacttcctcaacgaacac aagaccaccactttccagggatgcatgtcacaaatctttttcttgcatgtttttgggggt agtgagatggtgcttcttgttgccatggcctatgatagatacattgctatatgcaaacct ctgcactacatgaccatcatgaaccggagggtgtga >gi568815584f:19820607_20021575|GENSCAN_predicted_peptide_10|790_aa MQHKTKILERIKHKINIMINTFSSMEVLSTSQGSKAPEGLRRTLIPCAYGFALSDGVPMN SYSLILLDFTSIHALGIQCSIYVTGLKGQDFWEVAEIKSLPKSMNETNHSRVTEFVLLGL SSSRELQPFLFLTFSLLYLAILLGNFLIILTVTSDSRLHTPMYFLLANLSFIDVCVASFA TPKMIADFLVERKTISFDACLAQIFFVHLFTGSEMVLLVSMAYDRYVAICKPLHYMTVMS RRVCVVLVLISWFVGFIHTTSQLAFTVNLPFCGPNKVDSFFCDLPLVTKLACIDTYVVSL LIVADSGFLSLSSFLLLVVSYTVILVTVRNRSSASMAKARSTLTAHITVVTLFFGPCIFI YVWPFSSYSVDKVLAVFYTIFTLILNPVIYTLRNKEVKAAMSKLKSRYLKPSQGGNLAFL GDLKGCSELKTFQELTNQSALVHPRADVWSRCGGSTPAENETVYALAFTRGGVQLPLPTQ SGSTLTTEDRLQSCVSCTGGRGSTLTLIIVVAIREADPWPTKALCSELKDEDFTNKVEPD QMDKNQTEVMREFFLSGFSQTPSIEAGLFVLFLFFYMSIWVGNVLIMVTVASDKYLNSSP MYFLLGNLSFLDLCYSTVTTPKLLADFFNHEKLISYDQCIVQLFFLHFVGAAEMFLLTVM AYDRYVAICRPLHYTTVMSRGGLISTISFVVLISSYTTILVKIRSKEGRRKALSTCASHL MVVTLFFGPCIFIYARPFSTFSVDKMVSVLYNVITPMLNPLIYTLRNKEVKSAMQKLWVR NGLTWKKQET >gi568815584f:19820607_20021575|GENSCAN_predicted_CDS_10|2373_bp atgcaacacaagactaagatccttgagagaataaaacataagattaatattatgatcaac acattttcctccatggaggtactttccacatctcagggcagcaaagcaccggagggtctc cgtaggacactcattccttgtgcttatggctttgctctttctgatggagtacctatgaac tcctattcactgattttgttagattttacttccattcatgccttagggattcaatgcagt atttatgtcacaggattgaagggtcaggacttctgggaagtagctgaaattaagtccctt ccaaaatcgatgaatgagacaaatcattctcgggtgacagaatttgtgttgctgggactg tctagttcaagggagctccaacctttcttgtttcttacattttcactactttatctagca attctgttgggcaactttctcatcatcctcactgtgacctcagattcccgccttcacacc cccatgtactttctgcttgcaaacctgtcatttatagacgtatgtgttgcctcttttgct acccctaaaatgattgcagactttctggttgagcgcaagactatttcttttgatgcctgc ctggcccagattttctttgttcatctcttcactggcagtgaaatggtgctcctagtttcc atggcctatgaccgttatgttgctatatgcaaacctctccactacatgacagtcatgagc cgtcgtgtatgtgttgtgctcgtcctcatttcatggtttgtgggcttcatccatactacc agccagttggcattcactgttaatctgccattttgtggtcctaataaggtagacagtttt ttctgtgaccttcctctagtgaccaagttagcctgcatagacacttatgttgtcagctta ctaatagttgcagatagtggctttctttctctgagttcctttctcctcttggttgtctcc tacactgtaatacttgttacagttaggaatcgctcctctgcaagcatggcgaaggcccgc tccacattgactgctcacatcactgtggtcactttattctttggaccatgcattttcatc tatgtgtggcccttcagcagttactcagttgacaaagtccttgctgtattctacaccatc ttcacgcttattttaaaccctgtaatctacacgctaagaaacaaagaagtgaaggcagct atgtcaaaactgaagagtcggtatctgaagcctagtcaggggggaaacttagcatttctt ggagacctgaaaggatgcagtgagcttaagactttccaagagcttaccaatcagtcagcc cttgttcatccccgagcagatgtatggagcaggtgtggtggatccacccctgctgaaaat gagactgtgtatgctctggctttcacaaggggtggggtccaactcccccttccaacacag agtggcagcaccctgacaacagaggatagactacaaagttgtgtgtcctgtactggagga agaggttctaccctgaccctcattatagtggtagccatcagagaggcagatccatggccc acaaaggcactgtgctcggaactaaaggatgaagattttacaaacaaggtagagccagat caaatggataaaaaccaaacagaagtgatgagagaatttttcttgtcagggttctcacag acaccatctattgaagcagggctatttgtactatttcttttcttctatatgtccatttgg gttggcaatgtcctcatcatggtcacagtagcatctgataaatacctgaattcatcaccc atgtatttccttcttggcaacctctcatttctggacctatgttattcaacagtaacgacc cctaagcttctggctgacttctttaatcatgaaaaactcatttcctatgaccaatgcatt gtgcaactcttcttcctgcattttgtaggggcagctgagatgttcctgctcacagtgatg gcgtacgatcgctatgttgcaatctgtcgcccgctgcactacaccactgtcatgagtcgg ggtgggttgatctccaccatctcctttgtggtgctgatttcctcctacaccactatccta gtcaagattcgctccaaggaaggaaggcgaaaggcactctccacgtgtgcctctcacctc atggtggtaacactgttttttggaccctgtattttcatctacgctcgtcctttctctaca ttttctgtggacaagatggtgtctgtactctacaatgttattaccccaatgctaaacccc ctcatctacacacttcggaacaaagaggtaaagtcagccatgcagaagctctgggtcaga aatgggcttacttggaaaaagcaggagacatga >gi568815584f:19820607_20021575|GENSCAN_predicted_peptide_11|251_aa MYFLLGNLAFLDMWLASFATPKMIRDFLSDQKLISFGGCMAQIFFLHFTGGAEMVLLVSM AYDRYVAICKPLHYMTLMSWQTCIRLVLASWVVGFVHSISQVAFTVNLPYCGPNEVDSFF CDLPLVIKLACMDTYVLGIIMISDSGLLSLSCFLLLLISYTVILLAIRQRAAGSTSKALS TCSAHIMVVTLFFGPCIFVYVRPFSRFSVDKLLSVFYTIFTPLLNPIIYTLRNEEMKAAM KKLQNRRVTFQ >gi568815584f:19820607_20021575|GENSCAN_predicted_CDS_11|756_bp atgtacttcctgctggggaacctagctttcctggacatgtggctggcctcatttgccact cccaagatgatcagggatttccttagtgatcaaaaactcatctcctttggaggatgtatg gctcaaatcttcttcttgcactttactggtggggctgagatggtgctcctggtttccatg gcctatgacagatatgtggccatatgcaaacccttgcattacatgactttgatgagttgg cagacttgcatcaggctggtgctggcttcatgggtcgttggatttgtgcactccatcagt caagtggctttcactgtaaatttgccttactgtggccccaatgaggtagacagcttcttc tgtgacctccctctggtgatcaaacttgcctgcatggacacctatgtcttgggtataatt atgatctcagacagtgggttgctttccttgagctgttttctgctcctcctgatctcctac accgtgatcctcctcgctatcagacagcgtgctgccggtagcacatccaaagcactctcc acttgctctgcacatatcatggtagtgacgctgttctttggcccttgcatttttgtttat gtgcggcctttcagtaggttctctgtggacaagctgctgtctgtgttttataccattttt actccactcctgaaccccattatctacacattgagaaatgaggagatgaaagcagctatg aagaaactgcaaaaccgacgggtgacttttcaatga