GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:58:35 Sequence gi568815584f:19835667_20036599 : 200933 bp : 35.75% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 2948 2943 6 1.05 1.03 Term - 12413 12298 116 0 2 40 44 109 0.402 -0.55 1.02 Intr - 22652 22559 94 2 1 68 88 60 0.226 2.62 1.01 Init - 30121 30053 69 1 0 112 72 6 0.228 2.52 1.00 Prom - 32179 32140 40 -3.65 2.03 PlyA - 32191 32186 6 1.05 2.02 Term - 33377 32833 545 0 2 93 39 214 0.088 10.44 2.01 Init - 51462 51390 73 0 1 81 42 61 0.104 2.08 2.00 Prom - 55107 55068 40 -5.55 3.03 PlyA - 55422 55417 6 1.05 3.02 Term - 59034 58794 241 2 1 124 39 194 0.980 12.61 3.01 Init - 60170 60103 68 2 2 89 70 10 0.687 -0.10 3.00 Prom - 63065 63026 40 -3.65 4.02 PlyA - 63865 63860 6 1.05 4.01 Sngl - 64656 64384 273 0 0 74 44 189 0.954 8.18 4.00 Prom - 66841 66802 40 -8.25 5.00 Prom + 71563 71602 40 -7.05 5.01 Sngl + 75841 76170 330 0 0 79 37 369 0.921 26.67 5.02 PlyA + 76253 76258 6 1.05 6.00 Prom + 77843 77882 40 -8.25 6.01 Sngl + 78706 79110 405 0 0 60 43 202 0.801 8.93 6.02 PlyA + 79361 79366 6 1.05 7.00 Prom + 90965 91004 40 -6.35 7.01 Sngl + 100241 100936 696 1 0 74 50 178 0.378 8.55 7.02 PlyA + 101722 101727 6 1.05 8.00 Prom + 105964 106003 40 -5.25 8.01 Init + 112691 112697 7 1 1 42 119 0 0.508 -0.42 8.02 Term + 120908 121356 449 0 2 122 43 185 0.973 11.79 8.03 PlyA + 121805 121810 6 1.05 9.00 Prom + 128175 128214 40 -3.65 9.01 Init + 131809 131914 106 0 1 48 86 47 0.237 0.93 9.02 Intr + 137749 137921 173 2 2 98 44 54 0.486 0.74 9.03 Intr + 139895 140854 960 1 0 145 9 363 0.293 24.42 9.04 Intr + 145133 145242 110 1 2 93 57 96 0.216 5.16 9.05 Intr + 150047 150305 259 2 1 35 40 165 0.021 2.94 9.06 Intr + 166482 166915 434 2 2 86 -14 192 0.012 0.32 9.07 Term + 167109 167439 331 0 1 44 53 207 0.394 5.94 9.08 PlyA + 168718 168723 6 1.05 10.04 PlyA - 168988 168983 6 1.05 10.03 Term - 179556 178595 962 1 2 65 44 344 0.003 18.93 10.02 Intr - 181096 180988 109 1 1 104 30 31 0.001 -2.16 10.01 Init - 198741 198214 528 0 0 93 83 142 0.151 9.10 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 73036 73509 474 0 0 56 36 211 0.933 6.60 S.002 Term + 86655 86774 120 2 0 101 36 79 0.857 1.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:19835667_20036599|GENSCAN_predicted_peptide_1|92_aa MAAVQSSLGWASEGDRAKRARARASTNCHICENARIAMQRNRFLETNRSFQSMGGRKSLG GSCGGVVPDSNKNNDVGGEQKQKTYKSNHTTG >gi568815584f:19835667_20036599|GENSCAN_predicted_CDS_1|279_bp atggcggcagtacagtccagcctcggctgggcatcagagggagaccgtgcaaagagggcg agggcgagggcttccaccaactgccacatctgtgaaaatgcaagaattgcaatgcagagg aacagatttctagaaaccaataggtcatttcagagtatgggaggtagaaagagccttgga ggatcctgtggaggagtagtccctgacagtaataagaacaatgatgtaggaggtgagcaa aagcagaaaacttataagagcaatcacaccactggttga >gi568815584f:19835667_20036599|GENSCAN_predicted_peptide_2|205_aa MAFIKSHKTDVGVDAEKRECSYTVVYAATVLGNLLIVVTIASEPHLHSPMYFLLGNLSFI DMSLASFATPKMIADFLREHKAISFEGCMTQMFFLHLLGGAEIVLLISMSFDRYVAICKP LHYLTIMSRRMCVGLVILSWIVGIFHALSQLAFTVNLPFCGPNEVDSFFCDLPLVIKLAC VDTYILGVFMISTSGMIAWCASSSW >gi568815584f:19835667_20036599|GENSCAN_predicted_CDS_2|618_bp atggctttcattaaaagtcataaaacagatgttggtgtggatgcagagaaaagggaatgc tcatacactgttgtctatgcagccactgtgctggggaaccttcttattgtggtcaccatt gcatcagagccacaccttcattcccctatgtactttctgctgggcaatctctccttcatt gacatgtccctggcctcatttgccacccccaaaatgattgcagacttccttagagaacac aaagccatctcttttgaaggctgcatgacccagatgttcttcctacatctcttagggggt gctgagattgtactgctgatctccatgtcctttgataggtacgtggctatctgtaagcct ctacattacctaacaatcatgagccgaagaatgtgtgttgggcttgtgatactttcctgg attgtcggcatcttccatgctctgagtcagttagcatttacagtgaatctgcccttctgt ggacccaatgaagtagacagtttcttttgtgacctccctttggtgattaaacttgcttgt gttgacacatatattctgggggtgttcatgatctcaaccagtggcatgattgcctggtgt gcttcatcctcttggtga >gi568815584f:19835667_20036599|GENSCAN_predicted_peptide_3|102_aa MKPSNCPPRTMQSWGNLPNKGKEPAWDQGGLALLLVPNSEIQCHLCERKMQVCHAPHSCR SPLSQLKNPALTSEMPTAQMPTPPEHFSCGPDHFRKPNPTGT >gi568815584f:19835667_20036599|GENSCAN_predicted_CDS_3|309_bp atgaagccaagtaactgcccacccaggaccatgcagagctgggggaatctccctaacaaa ggaaaagagccagcatgggatcaaggaggtttggcacttttgcttgtccctaacagtgaa atccagtgccacttgtgtgagaggaaaatgcaagtgtgccatgctccccacagctgccga tctccattgtcccagctgaagaatcctgccctcaccagtgaaatgcccacagcacagatg cccacccctcctgagcatttcagctgtggcccagatcacttcagaaaacccaaccccaca ggcacatga >gi568815584f:19835667_20036599|GENSCAN_predicted_peptide_4|90_aa MTEDKWGESASHGKNRSKKEKEESQILKQPDLTERKLTYHQGDATKTFMRDSPPDSITSN QALPPTLEITFRHEIWRGQTSKLYQVGSRS >gi568815584f:19835667_20036599|GENSCAN_predicted_CDS_4|273_bp atgacagaggacaagtggggagaaagtgcatcacatggcaaaaacaggagcaagaaagag aaagaggagtctcagattcttaaacaaccagatctcactgaacgaaaactcacttatcat caaggggatgctactaagacattcatgagggactcacctcctgattcaatcacctccaac caagccctacctccaacattggaaatcacatttagacatgagatttggaggggacaaaca tccaaactatatcaagtgggatcaagatcctga >gi568815584f:19835667_20036599|GENSCAN_predicted_peptide_5|109_aa MRKKQRRKTGNSKNQSTSPPPKEHSSSPATEQSWMENDFDDLREEGFRRSNYSELKEEVR TNGKEVKNLGKKLDEWLTGITNAEKSLKDLMELKTMTLELCDECTSLSS >gi568815584f:19835667_20036599|GENSCAN_predicted_CDS_5|330_bp atgaggaaaaaacagagaagaaaaactggaaattctaaaaatcagagcacctctcctcct ccaaaggaacacagctcctcaccagcaacggaacaaagctggatggagaatgactttgat gacttgagagaagaaggcttcagacgatcaaactactccgagctaaaggaggaagttcga accaatggcaaagaagttaaaaaccttggaaaaaaactagatgaatggctaactggaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccatgacactagaacta tgtgatgaatgcacaagcctcagtagctga >gi568815584f:19835667_20036599|GENSCAN_predicted_peptide_6|134_aa MSELPFTIASKRINYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KKAILPKVIYRLNVIPIKLPMTFFTELEKTTLKFTWNQKRAHIAKSMLSQKNKAGGITLP DFKLYYKPTVTKTA >gi568815584f:19835667_20036599|GENSCAN_predicted_CDS_6|405_bp atgagtgaactcccattcacaattgcttcaaagagaataaactacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaaaggccatactgcccaaggtgatttatagattgaatgtcatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcacatggaaccaaaaaaga gcccacattgccaagtcaatgctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactacaagcctacagtaaccaaaacagcatga >gi568815584f:19835667_20036599|GENSCAN_predicted_peptide_7|231_aa MLVDFFIERKTISFEGCMAQIFVLHSFVGSEMMLLVAMAYDRFIAICKPLHYSTIMNRRL CVIFVSISWAVGVLHSVSHLAFTVDLPFCGPNEVDSFFCDLPLVIELACMDTYEMEIMTL TNSGLISLSCFLALIISYTIILIGVRCRSSSGSSKALSTLTAHITVVILFFGPCIYFYIW PFSRLPVDKFLSVFYTVCTPLLNPIIYSLRNEDVKAAMWKLRNRHVNSWKN >gi568815584f:19835667_20036599|GENSCAN_predicted_CDS_7|696_bp atgcttgtagacttttttattgagcgcaagactatctcctttgagggttgcatggcccag atattcgttcttcacagttttgttgggagtgagatgatgttgcttgtagctatggcatat gacagatttatagccatatgtaagcctctgcactacagtacaattatgaaccggaggctc tgtgtaatttttgtgtctatttcctgggcggtgggcgttcttcattctgtgagccacttg gcttttacagtggacctgccattctgtggtcccaatgaggtggatagcttcttttgtgac cttcccttggtgatagagctggcttgcatggatacatatgaaatggaaattatgacccta acgaacagtggcctgatatcattgagctgtttcctggctttaattatttcctacaccatc attttgatcggtgtccgatgcaggtcctccagtgggtcatctaaggctctttctacatta actgcccacatcacagtggtcattcttttcttcgggccttgcatttatttctatatatgg ccttttagcagacttcctgtggacaaatttctttctgtgttctacactgtttgtactccc ttgttgaaccccatcatctactctctgaggaatgaagatgttaaagcagccatgtggaag ctgagaaaccgtcatgtgaactcctggaaaaactag >gi568815584f:19835667_20036599|GENSCAN_predicted_peptide_8|151_aa MSDNFSSSDSVNGWSNKSVVTEFNLLGLSSSWELQVFFFFIFSVFYGAAVLGNILIIITV IIDSHLHSPMYFLLSNLSSIDVCQATFATPKMIADFLNEHKTTTFQGCMSQIFFLHVFGG SEMVLLVAMAYDRYIAICKPLHYMTIMNRRV >gi568815584f:19835667_20036599|GENSCAN_predicted_CDS_8|456_bp atgtctgataacttcagcagctcagattcagttaatggatggagtaataaatcagtggtt actgaattcaatttgttggggctgtctagctcttgggaactccaagtcttctttttcttt atcttctctgtgttttatggagctgcagtgttgggaaacatccttatcatcatcacagta attatagactctcatttgcattccccaatgtactttcttcttagcaatctctcttccatc gatgtgtgtcaggctacatttgccactcccaagatgattgcagacttcctcaacgaacac aagaccaccactttccagggatgcatgtcacaaatctttttcttgcatgtttttgggggt agtgagatggtgcttcttgttgccatggcctatgatagatacattgctatatgcaaacct ctgcactacatgaccatcatgaaccggagggtgtga >gi568815584f:19835667_20036599|GENSCAN_predicted_peptide_9|790_aa MQHKTKILERIKHKINIMINTFSSMEVLSTSQGSKAPEGLRRTLIPCAYGFALSDGVPMN SYSLILLDFTSIHALGIQCSIYVTGLKGQDFWEVAEIKSLPKSMNETNHSRVTEFVLLGL SSSRELQPFLFLTFSLLYLAILLGNFLIILTVTSDSRLHTPMYFLLANLSFIDVCVASFA TPKMIADFLVERKTISFDACLAQIFFVHLFTGSEMVLLVSMAYDRYVAICKPLHYMTVMS RRVCVVLVLISWFVGFIHTTSQLAFTVNLPFCGPNKVDSFFCDLPLVTKLACIDTYVVSL LIVADSGFLSLSSFLLLVVSYTVILVTVRNRSSASMAKARSTLTAHITVVTLFFGPCIFI YVWPFSSYSVDKVLAVFYTIFTLILNPVIYTLRNKEVKAAMSKLKSRYLKPSQGGNLAFL GDLKGCSELKTFQELTNQSALVHPRADVWSRCGGSTPAENETVYALAFTRGGVQLPLPTQ SGSTLTTEDRLQSCVSCTGGRGSTLTLIIVVAIREADPWPTKALCSELKDEDFTNKVEPD QMDKNQTEVMREFFLSGFSQTPSIEAGLFVLFLFFYMSIWVGNVLIMVTVASDKYLNSSP MYFLLGNLSFLDLCYSTVTTPKLLADFFNHEKLISYDQCIVQLFFLHFVGAAEMFLLTVM AYDRYVAICRPLHYTTVMSRGGLISTISFVVLISSYTTILVKIRSKEGRRKALSTCASHL MVVTLFFGPCIFIYARPFSTFSVDKMVSVLYNVITPMLNPLIYTLRNKEVKSAMQKLWVR NGLTWKKQET >gi568815584f:19835667_20036599|GENSCAN_predicted_CDS_9|2373_bp atgcaacacaagactaagatccttgagagaataaaacataagattaatattatgatcaac acattttcctccatggaggtactttccacatctcagggcagcaaagcaccggagggtctc cgtaggacactcattccttgtgcttatggctttgctctttctgatggagtacctatgaac tcctattcactgattttgttagattttacttccattcatgccttagggattcaatgcagt atttatgtcacaggattgaagggtcaggacttctgggaagtagctgaaattaagtccctt ccaaaatcgatgaatgagacaaatcattctcgggtgacagaatttgtgttgctgggactg tctagttcaagggagctccaacctttcttgtttcttacattttcactactttatctagca attctgttgggcaactttctcatcatcctcactgtgacctcagattcccgccttcacacc cccatgtactttctgcttgcaaacctgtcatttatagacgtatgtgttgcctcttttgct acccctaaaatgattgcagactttctggttgagcgcaagactatttcttttgatgcctgc ctggcccagattttctttgttcatctcttcactggcagtgaaatggtgctcctagtttcc atggcctatgaccgttatgttgctatatgcaaacctctccactacatgacagtcatgagc cgtcgtgtatgtgttgtgctcgtcctcatttcatggtttgtgggcttcatccatactacc agccagttggcattcactgttaatctgccattttgtggtcctaataaggtagacagtttt ttctgtgaccttcctctagtgaccaagttagcctgcatagacacttatgttgtcagctta ctaatagttgcagatagtggctttctttctctgagttcctttctcctcttggttgtctcc tacactgtaatacttgttacagttaggaatcgctcctctgcaagcatggcgaaggcccgc tccacattgactgctcacatcactgtggtcactttattctttggaccatgcattttcatc tatgtgtggcccttcagcagttactcagttgacaaagtccttgctgtattctacaccatc ttcacgcttattttaaaccctgtaatctacacgctaagaaacaaagaagtgaaggcagct atgtcaaaactgaagagtcggtatctgaagcctagtcaggggggaaacttagcatttctt ggagacctgaaaggatgcagtgagcttaagactttccaagagcttaccaatcagtcagcc cttgttcatccccgagcagatgtatggagcaggtgtggtggatccacccctgctgaaaat gagactgtgtatgctctggctttcacaaggggtggggtccaactcccccttccaacacag agtggcagcaccctgacaacagaggatagactacaaagttgtgtgtcctgtactggagga agaggttctaccctgaccctcattatagtggtagccatcagagaggcagatccatggccc acaaaggcactgtgctcggaactaaaggatgaagattttacaaacaaggtagagccagat caaatggataaaaaccaaacagaagtgatgagagaatttttcttgtcagggttctcacag acaccatctattgaagcagggctatttgtactatttcttttcttctatatgtccatttgg gttggcaatgtcctcatcatggtcacagtagcatctgataaatacctgaattcatcaccc atgtatttccttcttggcaacctctcatttctggacctatgttattcaacagtaacgacc cctaagcttctggctgacttctttaatcatgaaaaactcatttcctatgaccaatgcatt gtgcaactcttcttcctgcattttgtaggggcagctgagatgttcctgctcacagtgatg gcgtacgatcgctatgttgcaatctgtcgcccgctgcactacaccactgtcatgagtcgg ggtgggttgatctccaccatctcctttgtggtgctgatttcctcctacaccactatccta gtcaagattcgctccaaggaaggaaggcgaaaggcactctccacgtgtgcctctcacctc atggtggtaacactgttttttggaccctgtattttcatctacgctcgtcctttctctaca ttttctgtggacaagatggtgtctgtactctacaatgttattaccccaatgctaaacccc ctcatctacacacttcggaacaaagaggtaaagtcagccatgcagaagctctgggtcaga aatgggcttacttggaaaaagcaggagacatga >gi568815584f:19835667_20036599|GENSCAN_predicted_peptide_10|532_aa MAIDRYVAICKPLHYMTIMSPRVLTGLLLSSYAVGFVHSSSQMAFMLTLPFCGPNVIDSF FCDLPLVIKLACKDTYILQLLVIADSGLLSLVCFLLLLVSYGVIIFSVRYRAASRSSKAF STLSAHITVVTLFFAPCVFIYVWPFSRYSVDKILSVFYTIFTPLLNPIIYTLRNQEFADY GPYFQHIEIFYKSVSKSCQICAICKLANYTYPDYKQSLKPEAMDPQNYSLVSEFVLHGLC TSRHLQNFFFIFFFGVYVAIMLGNLLILVTVISDPCLHSSPMYFLLGNLAFLDMWLASFA TPKMIRDFLSDQKLISFGGCMAQIFFLHFTGGAEMVLLVSMAYDRYVAICKPLHYMTLMS WQTCIRLVLASWVVGFVHSISQVAFTVNLPYCGPNEVDSFFCDLPLVIKLACMDTYVLGI IMISDSGLLSLSCFLLLLISYTVILLAIRQRAAGSTSKALSTCSAHIMVVTLFFGPCIFV YVRPFSRFSVDKLLSVFYTIFTPLLNPIIYTLRNEEMKAAMKKLQNRRVTFQ >gi568815584f:19835667_20036599|GENSCAN_predicted_CDS_10|1599_bp atggcaatagacaggtatgttgccatatgcaaacccctccattacatgaccatcatgagc ccacgggtgctcactgggctactgttatcctcctatgcagttggatttgtgcactcatct agtcaaatggctttcatgttgactttgcccttctgtggtcccaatgttatagacagcttt ttctgtgaccttccccttgtgattaaacttgcctgcaaggacacctacatcctacagctc ctggtcattgctgacagtgggctcctgtcactggtctgcttcctcctcttgcttgtctcc tatggagtcataatattctcagttaggtaccgtgctgctagtcgatcctctaaggctttc tccactctctcagctcacatcacagttgtgactctgttctttgctccgtgtgtctttatc tacgtctggcccttcagcagatactcggtagataaaattctttctgtgttttacacaatt ttcacacctctcttaaatcctattatttatacattaagaaatcaagagtttgctgattac ggcccatatttccagcatattgaaatattctataagtctgtaagtaaatcgtgtcagatt tgtgctatctgtaaacttgcaaattatacatatccagattacaaacaaagtctgaaacct gaggcaatggacccacagaactattccttggtgtcagaatttgtgttgcatggactctgc acttcacgacatcttcaaaattttttctttatatttttctttggggtctatgtggccatt atgctgggtaaccttctcattttggtcactgtaatttctgatccctgcctgcactcctcc cctatgtacttcctgctggggaacctagctttcctggacatgtggctggcctcatttgcc actcccaagatgatcagggatttccttagtgatcaaaaactcatctcctttggaggatgt atggctcaaatcttcttcttgcactttactggtggggctgagatggtgctcctggtttcc atggcctatgacagatatgtggccatatgcaaacccttgcattacatgactttgatgagt tggcagacttgcatcaggctggtgctggcttcatgggtcgttggatttgtgcactccatc agtcaagtggctttcactgtaaatttgccttactgtggccccaatgaggtagacagcttc ttctgtgacctccctctggtgatcaaacttgcctgcatggacacctatgtcttgggtata attatgatctcagacagtgggttgctttccttgagctgttttctgctcctcctgatctcc tacaccgtgatcctcctcgctatcagacagcgtgctgccggtagcacatccaaagcactc tccacttgctctgcacatatcatggtagtgacgctgttctttggcccttgcatttttgtt tatgtgcggcctttcagtaggttctctgtggacaagctgctgtctgtgttttataccatt tttactccactcctgaaccccattatctacacattgagaaatgaggagatgaaagcagct atgaagaaactgcaaaaccgacgggtgacttttcaatga