GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:56:10 Sequence gi568815593f:140014182_140215147 : 200966 bp : 42.74% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4092 4189 98 2 2 78 57 136 0.883 9.33 1.02 Term + 10753 10870 118 1 1 61 54 101 0.452 1.03 1.03 PlyA + 11393 11398 6 1.05 2.04 PlyA - 13409 13404 6 1.05 2.03 Term - 15669 15614 56 1 2 137 50 21 0.353 -0.06 2.02 Intr - 16295 16203 93 0 0 85 83 62 0.204 4.42 2.01 Init - 28888 28189 700 1 1 62 101 927 0.715 84.99 2.00 Prom - 33567 33528 40 -7.55 3.04 PlyA - 34373 34368 6 1.05 3.03 Term - 37060 36936 125 2 2 74 44 81 0.406 -0.13 3.02 Intr - 42851 42819 33 0 0 90 105 61 0.634 5.28 3.01 Init - 44310 44220 91 0 1 79 47 108 0.489 4.57 3.00 Prom - 49506 49467 40 -6.25 4.00 Prom + 65498 65537 40 -4.05 4.01 Init + 75299 75370 72 1 0 57 98 32 0.088 2.22 4.02 Intr + 93503 93720 218 1 2 13 68 219 0.317 8.78 4.03 Intr + 99533 99677 145 2 1 103 3 124 0.539 4.76 4.04 Term + 99860 100969 1110 1 0 136 43 1596 0.994 150.96 4.05 PlyA + 101062 101067 6 1.05 5.00 Prom + 102733 102772 40 -6.95 5.01 Init + 116522 116628 107 1 2 49 58 114 0.344 4.24 5.02 Intr + 126501 126664 164 0 2 76 87 125 0.184 9.90 5.03 Intr + 126838 126853 16 2 1 108 76 9 0.058 -4.62 5.04 Term + 129571 129751 181 1 1 123 48 96 0.173 5.20 5.05 PlyA + 130731 130736 6 -3.64 6.12 PlyA - 130774 130769 6 -0.45 6.11 Term - 131985 131724 262 2 1 56 49 143 0.876 1.21 6.10 Intr - 132303 132158 146 0 2 50 80 153 0.716 8.86 6.09 Intr - 132745 132531 215 2 2 81 24 148 0.763 5.21 6.08 Intr - 138498 138376 123 1 0 74 85 94 0.974 7.34 6.07 Intr - 139420 139166 255 2 0 80 81 178 0.984 12.79 6.06 Intr - 139945 139874 72 2 0 72 82 73 0.756 3.56 6.05 Intr - 140515 140460 56 0 2 58 51 66 0.792 -2.40 6.04 Intr - 141583 141415 169 2 1 5 41 195 0.754 4.58 6.03 Intr - 143244 143127 118 0 1 86 43 57 0.351 0.12 6.02 Intr - 145543 145418 126 1 0 87 84 9 0.497 0.36 6.01 Init - 146020 145943 78 1 0 45 103 58 0.952 4.11 6.00 Prom - 147124 147085 40 -4.75 7.08 PlyA - 148415 148410 6 1.05 7.07 Term - 149053 148986 68 2 2 116 50 39 0.267 0.02 7.06 Intr - 152251 151972 280 1 1 102 86 43 0.403 1.63 7.05 Intr - 157998 157958 41 1 2 103 81 44 0.490 2.22 7.04 Intr - 159240 159150 91 0 1 20 90 95 0.520 1.55 7.03 Intr - 161134 161009 126 1 0 71 61 118 0.756 7.36 7.02 Intr - 161603 161369 235 1 1 68 76 233 0.709 16.87 7.01 Init - 169614 169532 83 0 2 54 115 41 0.313 3.89 7.00 Prom - 173373 173334 40 -5.05 8.00 Prom + 176737 176776 40 -5.45 8.01 Init + 177253 177262 10 0 1 91 98 5 0.801 2.46 8.02 Intr + 180265 180471 207 2 0 114 89 156 0.965 16.43 8.03 Intr + 182408 182544 137 0 2 36 86 78 0.466 1.77 8.04 Term + 193448 193609 162 1 0 100 35 66 0.193 -0.55 8.05 PlyA + 194541 194546 6 1.05 9.03 PlyA - 196331 196326 6 1.05 9.02 Term - 197110 196985 126 1 0 37 47 60 0.223 -5.90 9.01 Intr - 200907 200629 279 0 0 10 41 242 0.612 8.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:140014182_140215147|GENSCAN_predicted_peptide_1|71_aa MARHNQVKCGDVIENSNEVEERSQSASLQTRDGLKHSLHKLSQLKKRLQFAENKSPECKY NLLKAIQKAVD >gi568815593f:140014182_140215147|GENSCAN_predicted_CDS_1|216_bp atggcccggcataaccaggtcaaatgtggggacgtgattgaaaacagcaatgaagtagaa gagaggtctcaaagtgccagcctgcagacgagggatgggctcaaacactctcttcacaag ctctcacagctgaagaagagactacaatttgctgaaaacaaaagcccagaatgtaagtac aacctgctcaaggccatacagaaagcagtggactga >gi568815593f:140014182_140215147|GENSCAN_predicted_peptide_2|282_aa MRQVCCSALPPPPLEKGRCSSYSDSSSSSSERSSSSSSSSSESGSSSRSSSNNSSISRPA APPEPRPQQQPQPRSPAARRAAARSRAAAAGGMRRDPAPGFSMLLFGVSLACYSPSLKSV QDQAYKAPVVVEGKVQGLVPAGGSSSNSTREPPASGRVALVKVLDKWPLRSGGLQREQVI SVGSCVPLERNQRYIFFLEPTEQPLVFKTAFAPLDTNGKNLKKEVGKILCTDCDNVGFAL LLCGVTASGPLVNGLPYIQSLSYRGHSITMSSFDHFVEGTVP >gi568815593f:140014182_140215147|GENSCAN_predicted_CDS_2|849_bp atgcggcaggtttgctgctcagcgctgccgccgccgccactggagaagggtcggtgcagc agctacagcgacagcagcagcagcagcagcgagaggagcagcagcagcagcagcagcagc agcgagagcggcagcagcagcaggagcagcagcaacaacagcagcatctctcgtcccgct gcgcccccagagccgcggccgcagcaacagccgcagccccgcagccccgcagcccggaga gccgccgcccgttcgcgagccgcagccgccggcggcatgaggcgcgacccggcccccggc ttctccatgctgctcttcggtgtgtcgctcgcctgctactcgcccagcctcaagtcagtg caggaccaggcgtacaaggcacccgtggtggtggagggcaaggtacaggggctggtccca gccggcggctccagctccaacagcacccgagagccgcccgcctcgggtcgggtggcgttg gtaaaggtgctggacaagtggccgctccggagcggggggctgcagcgcgagcaggtgatc agcgtgggctcctgtgtgccgctcgaaaggaaccagcgctacatctttttcctggagccc acggaacagcccttagtctttaagacggcctttgcccccctcgataccaacggcaaaaat ctcaagaaagaggtgggcaagatcctgtgcactgactgcgacaatgtgggctttgcgctt ctcctgtgtggtgttacagcctcagggcctctggtgaatggtctgccttacatccagagc ttgtcgtacagaggtcattcaatcaccatgtcttcttttgaccactttgtagagggtaca gtcccttag >gi568815593f:140014182_140215147|GENSCAN_predicted_peptide_3|82_aa MASLSWLAVCVGSQLEAQPGLPARVLGFPLYDEKPELMDNARLKALYKEIIIIIPDGTTQ TEGLLPFSANWGVEAPRLQTQP >gi568815593f:140014182_140215147|GENSCAN_predicted_CDS_3|249_bp atggcctcactctcatggctggcagtttgtgttggcagccagctggaagctcagccgggg ctgccagccagagtccttggttttcctctatatgatgaaaaacctgagctgatggacaat gcccgtctcaaagcactttacaaagaaataattattattatcccagatggaacaactcag acagaggggctgctgcccttttcagcaaattggggagtggaagcccctcgactccaaaca cagccatag >gi568815593f:140014182_140215147|GENSCAN_predicted_peptide_4|514_aa MLISKRKEERDSKMKTLTKITRCTLKRRDARAKGASHGVGRRGGERGARADVAWSHARSA GLRADVWLPRLAVCGGGGGGERIPQGKEEEEGEPRGLPRPPRVCPTATATVVVVVVVVVA VVAAAAAAAAATLRVLLPLPHPARVISREWLTGCGGCGGSRRSRGGKAAAAEATEAAGGA AGGGGAAAERSIMADRDSGSEQGGAALGSGGSLGHPGSGSGSGGGGGGGGGGGGSGGGGG GAPGGLQHETQELASKRVDIQNKRFYLDVKQNAKGRFLKIAEVGAGGNKSRLTLSMSVAV EFRDYLGDFIEHYAQLGPSQPPDLAQAQDEPRRALKSEFLVRENRKYYMDLKENQRGRFL RIRQTVNRGPGLGSTQGQTIALPAQGLIEFRDALAKLIDDYGVEEEPAELPEGTSLTVDN KRFFFDVGSNKYGVFMRVSEVKPTYRNSITVPYKVWAKFGHTFCKYSEEMKKIQEKQREK RAACEQLHQQQQQQQEETAAATLLLQGEEEGEED >gi568815593f:140014182_140215147|GENSCAN_predicted_CDS_4|1545_bp atgctgatttcaaagagaaaagaagaaagggacagcaaaatgaaaacactgacaaaaata acaagatgcacactaaaacggagggacgcgagagcgaagggcgcgagccacggagttggg aggaggggtggggaacgcggagcgcgcgctgacgtcgcctggagtcacgcacggagcgcc gggttacgcgccgacgtctggctgccacgacttgccgtctgcggcggcggcggcggcggc gagcggatcccgcaggggaaggaggaggaggagggagagccaagggggctgccgaggccg cctagggtctgcccaacagcgacagccacggtggtggtggtggtggtggtggtggtagca gtggtggcggcagcagcggcagcagcagctgcgacgctgcgcgtcctgctccctctcccc cacccagccagggttatctcgcgagagtggctgactggctgtgggggttgcggcggcagc aggcggagccggggagggaaagcagcggcggctgaggcgactgaggcggcgggcggagcg gcaggcggcggcggcgcggcagcggagcgcagcatcatggcggaccgagacagcggcagc gagcagggtggtgcggcgctgggttcgggcggctccctggggcaccccggctcgggctca ggctccggcgggggcggtggtggcggcgggggcggcggcggcagtggcggcggcggcggc ggggccccaggggggctgcagcacgagacgcaggagctggcctccaagcgggtggacatc cagaacaagcgcttctacctggacgtgaagcagaacgccaagggccgcttcctgaagatc gccgaggtgggcgcgggcggcaacaagagccgccttactctctccatgtcagtggccgtg gagttccgcgactacctgggcgacttcatcgagcactacgcgcagctgggccccagccag ccgccggacctggcccaggcgcaggacgagccgcgccgggcgctcaaaagcgagttcctg gtgcgcgagaaccgcaagtactacatggatctcaaggagaaccagcgcggccgcttcctg cgcatccgccagacggtcaaccgggggcctggcctgggctccacgcagggccagaccatt gcgctgcccgcgcaggggctcatcgagttccgtgacgctctggccaagctcatcgacgac tacggagtggaggaggagccggccgagctgcccgagggcacctccttgactgtggacaac aagcgcttcttcttcgatgtgggctccaacaagtacggcgtgtttatgcgagtgagcgag gtgaagcccacctatcgcaactccatcaccgtgccctacaaggtgtgggccaagttcgga cacaccttctgcaagtactcggaggagatgaagaagattcaagagaagcagagggagaag cgggctgcctgtgagcagcttcaccagcagcaacagcagcagcaggaggagaccgccgct gccaccctgctactgcagggtgaggaagaaggggaagaagattga >gi568815593f:140014182_140215147|GENSCAN_predicted_peptide_5|155_aa MPNGQLDSKRPRGEVLLKGYRRSIPDSQVGTTRQCWDSSIRCVIASVKAKRVRTKVVWLS DQVSQKPESRKQEPNTGLKFSQRAECALRPGSSEARATMPDKHLTACPAGVVAGESHCSK PSLWLLALNSQPLTFSTHHLSSPSPPYGSAHIASA >gi568815593f:140014182_140215147|GENSCAN_predicted_CDS_5|468_bp atgcccaatggtcagttggacagcaagaggccccgaggtgaggtcctactgaagggctac aggaggagcatcccagacagccaggtcggaactacacggcaatgctgggacagcagtatc cgttgtgttattgccagtgtgaaagccaaaagggtcagaactaaagttgtgtggctgtca gaccaggtgtcgcagaagccagaaagtaggaaacaggagcccaacacagggttgaagttc tcccaaagggctgaatgtgctttaaggccaggctcctctgaggccagggccaccatgcct gataagcacctcacagcttgtcctgctggtgtggttgcaggggagtcccattgctcaaag ccctctttgtggttgctggctctaaacagtcagcccctaactttctcaactcatcatctc agcagccccagtcctccctatggaagtgcccacatagccagtgcttga >gi568815593f:140014182_140215147|GENSCAN_predicted_peptide_6|539_aa MAKHRQYRGTSNSSRKWHLDASTEAQPLALPLPRDSLYVESLWVQQHHSGHGCLSLAENG KVLCLCSGEVTQTYCLRIRLCLKEAAFWILPQTLEAGLLRHRLLGPPGDRAWSTTFPEEV TSGLGAEGGGRGAIQENPSLKATSPGECCDDPTGPQALPGPVKCSFVTGFCQFGDKEPET QRVIPANKSDHGYSDVWDDDASQAVPGAAGAVTDANSEPWPRTLWLASRGRSARSAGEPS SLPGPTNDDTRSLGLMQEDQMSCKGESPSPTSSPAPHPHLTRMQGVLASLQGGATPLPHQ VAPEAGEEKTLGLQTTVWRILEDLEVPAAAWPGLSPTLAVLRKTHLGSKEQNFSVKTASH QAERSPAVPRPGSRSLSSAASLLAAAPKDLLIFGPHQELTQRVHPHPEPEDAENGLRTAR GCFGGVRGYSEKQSCVSLVPSLNSGVLQSQCPSTYGPPPLGLGAQSHREGLRVGLEVSAR LVCGGGGKLLLPWPGLSGAQCRRARSPRSRRAPHSPPGGAGDTPGAVVAAAAAAAAAAE >gi568815593f:140014182_140215147|GENSCAN_predicted_CDS_6|1620_bp atggcaaaacacaggcaatacagagggaccagcaacagctccaggaagtggcatctggat gcttccacagaggcccagccactagctctacccctacctagggacagcctttatgtagag tcactgtgggtccagcagcaccattcaggccatggatgtctttctctggctgagaatggg aaggttctctgtctctgcagtggggaagtcacccaaacctattgcctccgtattagactc tgcctgaaggaagccgctttctggatcctaccccagacgctggaggctgggctgctgcgg catcgtctattaggtccaccaggggacagggcctggtctacgacattccctgaggaagtg acttctggactgggagctgaaggaggaggaagaggagctatccaagaaaatccctccctt aaagcaaccagccctggggaatgctgtgatgacccaaccggtcctcaggctcttcctggc ccagtcaaatgctcctttgtcacaggattctgccagtttggagataaggaacctgagacc cagagagtaatcccagctaacaaaagcgaccatggctactctgacgtgtgggacgacgat gctagccaggccgtgccaggggctgctggtgctgttactgatgcaaattctgagccgtgg ccacgcacactttggctggcctccaggggacggtcagctaggtccgctggggaaccttcc tccctgcctggtcccactaatgatgacaccaggtccttgggcctgatgcaagaagatcag atgtcctgcaagggagagtcaccatctcctacctcatccccggcaccccacccccacctt accagaatgcaaggtgtcctggctagcctgcaaggtggggccacccctctgccccaccaa gttgccccagaagctggagaggaaaaaactctgggtctccagacaacggtctggagaatc cttgaggatctggaagtccccgctgctgcctggcctggtttatcaccaactctggcagtt ttgagaaagactcacttgggttctaaggaacagaatttttctgtgaaaactgcatctcat caagctgaaagatctccagcagtgccaaggccagggagcaggtctctgtcctcagctgcc agcttgttggctgcagccccaaaagacctactgatatttggtcctcatcaagagctgacc cagagagtccatcctcacccagagcccgaggacgcagagaacggcctgcgcaccgcccgc ggctgcttcggaggggtgagggggtacagcgagaaacagagctgcgtctccctcgtcccc tccttaaattccggcgtcctccagtcacagtgtcccagcacctacggccccccaccccta ggcctgggcgcacagagccaccgagagggcttgagggtggggctggaagtctctgcccgg ctggtctgcggcggcggagggaagttgcttcttccctggccgggcctctccggagcgcag tgccggcgcgcccggagtccgaggagccggcgggcaccccattcgccccctggcggcgcc ggggacacgccgggagccgttgtagccgccgcagccgcagccgccgccgcggcggaatga >gi568815593f:140014182_140215147|GENSCAN_predicted_peptide_7|307_aa MYGPLQKLEDCLDAVFPSAFSLWKGPHSYFQSLVTSAPLPPAGLATCPLPPAGPARNRQS SASEIAAQGEEIQKPPADVPRDGVWRAPLRARRLLVRREPQRRGDPLGRARSAAPHLSSP RTDWITVGSEPEEPGPARSRLAAPVSEQRLLEKSSIVCLPLKSEGDPCTPQTTSGPSEAF MERGAMPAALAKEKVQVKRQIYGPIMAPSLISVAPWTGDQESFFCMMTRATPLQFFSCSS GDTHYLCSQCIQVLILDATFDSLSCFSHSPDESVLVLPPPLPSLEGRESRERACLQVNSG HDPREQE >gi568815593f:140014182_140215147|GENSCAN_predicted_CDS_7|924_bp atgtatggtcctctgcaaaagctagaggactgcctggatgcagtattcccctctgcattc agcttatggaagggtccccacagttacttccagtcactagttacttccgctccgctccca cccgcgggactcgcgacctgccccctgcccccggctggcccagctcggaataggcagagc agcgcctcagaaatcgcagcccaaggcgaagaaatccagaagcctcccgcggacgtcccc cgggacggggtgtggcgagccccgctccgagcccgccgactcctcgtgagacgagaaccg cagcgacgcggagaccctctgggtcgggccaggtccgctgcacctcacctctcctctcca cgaaccgactggataacggtcgggtctgaacctgaggagcccggaccagcccgcagtaga cttgccgcacctgtctcagagcaaaggctgttggagaagagcagcattgtctgtttgcct ctgaagagtgaaggtgatccatgcacaccccaaactacctctggaccatctgaggcgttc atggaaagaggggccatgccagctgcattggctaaggagaaggtacaggttaaaagacag atctatggtccgatcatggccccatctcttatttctgtggctccttggactggtgatcag gagtcatttttctgcatgatgaccagggccactccactgcaattctttagctgcagcagt ggagacacacactacctttgttcccagtgcatccaggtccttattttagatgctaccttc gattctctgagctgcttcagccacagtccagatgagagtgtgcttgtcctgccccctccc ctccccagcctggaaggtagagaaagcagagaaagagcatgcttgcaagttaattcagga catgatcccagggagcaggaatag >gi568815593f:140014182_140215147|GENSCAN_predicted_peptide_8|171_aa MMRGALYRSPMNQENPPPYPGPGPTAPYPPYPPQPMGPGPMGGPYPPPQGYPYQGYPQYG WQGGPQEPPKTTVFIMHARKRAGSITSIIGVVGIQACDIRETFSENKPQLLGFLLQNKTS IPKNLGAFHGLALLLGSVSVSQHHQAPSFHMAFANATSSVWLGKFFAPLLA >gi568815593f:140014182_140215147|GENSCAN_predicted_CDS_8|516_bp atgatgaggggtgcactttacaggtccccgatgaaccaagagaaccctccaccatatcca ggccctggtccaacggccccatacccaccttatccaccacaaccaatgggtccaggacct atggggggaccctacccacctcctcaagggtacccctaccaaggatacccacagtacggc tggcagggtggacctcaggagcctcctaaaaccacagtttttatcatgcatgctcggaag agggcaggaagtataaccagtataatcggagtagttggtattcaggcctgtgacatcaga gaaacattttctgaaaataaacctcagcttctgggttttctgcttcaaaataagaccagc attccaaaaaacttaggggcctttcatggtctggccctgctactaggttcagtttcagtt tctcagcatcatcaggctccctccttccacatggcttttgcaaatgccacttcctctgtg tggcttgggaaattctttgctcctctcttggcttag >gi568815593f:140014182_140215147|GENSCAN_predicted_peptide_9|134_aa FPLWLTTLTTGKIALGAYGHCATQAAGVVRDIPPYAQALEPGPFLYRNGSLLLGRIYPRG RSVSPRMVPTPMTYSTGLALQSAADIGRSRQCLLCDHEQVSLSMLQFLHLYDGDKKTYLA GLFLGLDELIYKSA >gi568815593f:140014182_140215147|GENSCAN_predicted_CDS_9|405_bp tttcccctgtggctaaccacactgaccactgggaagattgcactaggtgcctacgggcac tgtgccacacaggcagctggtgtggtaagagacataccaccctatgcacaggccctggag cctgggccctttctgtacaggaatggctcattactactgggaagaatatatcctagggga agatccgtgtcacctcggatggtccccacaccaatgacctacagcactggacttgctctg cagagtgcagctgacatagggaggtcccgacaatgcctgctctgtgaccatgaacaagtt agcctttccatgcttcagtttcttcatctatatgatggagataagaagacctaccttgca ggattgttcttaggactagatgagctaatatacaaaagtgcttga