GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:40:24 Sequence gi568815584f:19875519_20076562 : 201044 bp : 35.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1385 1380 6 1.05 1.02 Term - 19182 18942 241 2 1 124 39 194 0.980 12.61 1.01 Init - 20318 20251 68 2 2 89 70 10 0.687 -0.10 1.00 Prom - 23213 23174 40 -3.65 2.02 PlyA - 24013 24008 6 1.05 2.01 Sngl - 24804 24532 273 0 0 74 44 189 0.954 8.18 2.00 Prom - 26989 26950 40 -8.25 3.00 Prom + 31711 31750 40 -7.05 3.01 Sngl + 35989 36318 330 0 0 79 37 369 0.921 26.67 3.02 PlyA + 36401 36406 6 1.05 4.00 Prom + 37991 38030 40 -8.25 4.01 Sngl + 38854 39258 405 0 0 60 43 202 0.801 8.93 4.02 PlyA + 39509 39514 6 1.05 5.00 Prom + 51113 51152 40 -6.35 5.01 Sngl + 60389 61084 696 1 0 74 50 178 0.378 8.55 5.02 PlyA + 61870 61875 6 1.05 6.00 Prom + 66112 66151 40 -5.25 6.01 Init + 72839 72845 7 1 1 42 119 0 0.508 -0.42 6.02 Term + 81056 81504 449 0 2 122 43 185 0.973 11.79 6.03 PlyA + 81953 81958 6 1.05 7.00 Prom + 88323 88362 40 -3.65 7.01 Init + 91957 92062 106 0 1 48 86 47 0.237 0.93 7.02 Intr + 97897 98069 173 2 2 98 44 54 0.486 0.74 7.03 Intr + 100043 101002 960 1 0 145 9 363 0.293 24.42 7.04 Intr + 105281 105390 110 1 2 93 57 96 0.216 5.16 7.05 Intr + 110195 110453 259 2 1 35 40 165 0.021 2.94 7.06 Intr + 126630 127063 434 2 2 86 -14 192 0.012 0.32 7.07 Term + 127257 127587 331 0 1 44 53 207 0.394 5.94 7.08 PlyA + 128866 128871 6 1.05 8.06 PlyA - 129136 129131 6 1.05 8.05 Term - 139704 138743 962 1 2 65 44 344 0.003 18.93 8.04 Intr - 141244 141136 109 1 1 104 30 31 0.001 -2.16 8.03 Intr - 158789 158362 428 0 2 73 83 143 0.049 4.88 8.02 Intr - 159168 158976 193 0 1 32 74 66 0.105 -2.26 8.01 Init - 164386 163211 1176 1 0 70 90 378 0.138 29.67 8.00 Prom - 169225 169186 40 -5.75 9.05 PlyA - 169563 169558 6 1.05 9.04 Term - 173216 173042 175 1 1 87 49 113 0.533 3.55 9.03 Intr - 181976 181820 157 0 1 41 86 122 0.054 5.45 9.02 Intr - 183568 183492 77 0 2 126 37 1 0.026 -2.96 9.01 Intr - 188590 188415 176 1 2 48 71 116 0.006 3.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 33184 33657 474 0 0 56 36 211 0.933 6.60 S.002 Term + 46803 46922 120 2 0 101 36 79 0.857 1.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:19875519_20076562|GENSCAN_predicted_peptide_1|102_aa MKPSNCPPRTMQSWGNLPNKGKEPAWDQGGLALLLVPNSEIQCHLCERKMQVCHAPHSCR SPLSQLKNPALTSEMPTAQMPTPPEHFSCGPDHFRKPNPTGT >gi568815584f:19875519_20076562|GENSCAN_predicted_CDS_1|309_bp atgaagccaagtaactgcccacccaggaccatgcagagctgggggaatctccctaacaaa ggaaaagagccagcatgggatcaaggaggtttggcacttttgcttgtccctaacagtgaa atccagtgccacttgtgtgagaggaaaatgcaagtgtgccatgctccccacagctgccga tctccattgtcccagctgaagaatcctgccctcaccagtgaaatgcccacagcacagatg cccacccctcctgagcatttcagctgtggcccagatcacttcagaaaacccaaccccaca ggcacatga >gi568815584f:19875519_20076562|GENSCAN_predicted_peptide_2|90_aa MTEDKWGESASHGKNRSKKEKEESQILKQPDLTERKLTYHQGDATKTFMRDSPPDSITSN QALPPTLEITFRHEIWRGQTSKLYQVGSRS >gi568815584f:19875519_20076562|GENSCAN_predicted_CDS_2|273_bp atgacagaggacaagtggggagaaagtgcatcacatggcaaaaacaggagcaagaaagag aaagaggagtctcagattcttaaacaaccagatctcactgaacgaaaactcacttatcat caaggggatgctactaagacattcatgagggactcacctcctgattcaatcacctccaac caagccctacctccaacattggaaatcacatttagacatgagatttggaggggacaaaca tccaaactatatcaagtgggatcaagatcctga >gi568815584f:19875519_20076562|GENSCAN_predicted_peptide_3|109_aa MRKKQRRKTGNSKNQSTSPPPKEHSSSPATEQSWMENDFDDLREEGFRRSNYSELKEEVR TNGKEVKNLGKKLDEWLTGITNAEKSLKDLMELKTMTLELCDECTSLSS >gi568815584f:19875519_20076562|GENSCAN_predicted_CDS_3|330_bp atgaggaaaaaacagagaagaaaaactggaaattctaaaaatcagagcacctctcctcct ccaaaggaacacagctcctcaccagcaacggaacaaagctggatggagaatgactttgat gacttgagagaagaaggcttcagacgatcaaactactccgagctaaaggaggaagttcga accaatggcaaagaagttaaaaaccttggaaaaaaactagatgaatggctaactggaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccatgacactagaacta tgtgatgaatgcacaagcctcagtagctga >gi568815584f:19875519_20076562|GENSCAN_predicted_peptide_4|134_aa MSELPFTIASKRINYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIV KKAILPKVIYRLNVIPIKLPMTFFTELEKTTLKFTWNQKRAHIAKSMLSQKNKAGGITLP DFKLYYKPTVTKTA >gi568815584f:19875519_20076562|GENSCAN_predicted_CDS_4|405_bp atgagtgaactcccattcacaattgcttcaaagagaataaactacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaaaggccatactgcccaaggtgatttatagattgaatgtcatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcacatggaaccaaaaaaga gcccacattgccaagtcaatgctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactacaagcctacagtaaccaaaacagcatga >gi568815584f:19875519_20076562|GENSCAN_predicted_peptide_5|231_aa MLVDFFIERKTISFEGCMAQIFVLHSFVGSEMMLLVAMAYDRFIAICKPLHYSTIMNRRL CVIFVSISWAVGVLHSVSHLAFTVDLPFCGPNEVDSFFCDLPLVIELACMDTYEMEIMTL TNSGLISLSCFLALIISYTIILIGVRCRSSSGSSKALSTLTAHITVVILFFGPCIYFYIW PFSRLPVDKFLSVFYTVCTPLLNPIIYSLRNEDVKAAMWKLRNRHVNSWKN >gi568815584f:19875519_20076562|GENSCAN_predicted_CDS_5|696_bp atgcttgtagacttttttattgagcgcaagactatctcctttgagggttgcatggcccag atattcgttcttcacagttttgttgggagtgagatgatgttgcttgtagctatggcatat gacagatttatagccatatgtaagcctctgcactacagtacaattatgaaccggaggctc tgtgtaatttttgtgtctatttcctgggcggtgggcgttcttcattctgtgagccacttg gcttttacagtggacctgccattctgtggtcccaatgaggtggatagcttcttttgtgac cttcccttggtgatagagctggcttgcatggatacatatgaaatggaaattatgacccta acgaacagtggcctgatatcattgagctgtttcctggctttaattatttcctacaccatc attttgatcggtgtccgatgcaggtcctccagtgggtcatctaaggctctttctacatta actgcccacatcacagtggtcattcttttcttcgggccttgcatttatttctatatatgg ccttttagcagacttcctgtggacaaatttctttctgtgttctacactgtttgtactccc ttgttgaaccccatcatctactctctgaggaatgaagatgttaaagcagccatgtggaag ctgagaaaccgtcatgtgaactcctggaaaaactag >gi568815584f:19875519_20076562|GENSCAN_predicted_peptide_6|151_aa MSDNFSSSDSVNGWSNKSVVTEFNLLGLSSSWELQVFFFFIFSVFYGAAVLGNILIIITV IIDSHLHSPMYFLLSNLSSIDVCQATFATPKMIADFLNEHKTTTFQGCMSQIFFLHVFGG SEMVLLVAMAYDRYIAICKPLHYMTIMNRRV >gi568815584f:19875519_20076562|GENSCAN_predicted_CDS_6|456_bp atgtctgataacttcagcagctcagattcagttaatggatggagtaataaatcagtggtt actgaattcaatttgttggggctgtctagctcttgggaactccaagtcttctttttcttt atcttctctgtgttttatggagctgcagtgttgggaaacatccttatcatcatcacagta attatagactctcatttgcattccccaatgtactttcttcttagcaatctctcttccatc gatgtgtgtcaggctacatttgccactcccaagatgattgcagacttcctcaacgaacac aagaccaccactttccagggatgcatgtcacaaatctttttcttgcatgtttttgggggt agtgagatggtgcttcttgttgccatggcctatgatagatacattgctatatgcaaacct ctgcactacatgaccatcatgaaccggagggtgtga >gi568815584f:19875519_20076562|GENSCAN_predicted_peptide_7|790_aa MQHKTKILERIKHKINIMINTFSSMEVLSTSQGSKAPEGLRRTLIPCAYGFALSDGVPMN SYSLILLDFTSIHALGIQCSIYVTGLKGQDFWEVAEIKSLPKSMNETNHSRVTEFVLLGL SSSRELQPFLFLTFSLLYLAILLGNFLIILTVTSDSRLHTPMYFLLANLSFIDVCVASFA TPKMIADFLVERKTISFDACLAQIFFVHLFTGSEMVLLVSMAYDRYVAICKPLHYMTVMS RRVCVVLVLISWFVGFIHTTSQLAFTVNLPFCGPNKVDSFFCDLPLVTKLACIDTYVVSL LIVADSGFLSLSSFLLLVVSYTVILVTVRNRSSASMAKARSTLTAHITVVTLFFGPCIFI YVWPFSSYSVDKVLAVFYTIFTLILNPVIYTLRNKEVKAAMSKLKSRYLKPSQGGNLAFL GDLKGCSELKTFQELTNQSALVHPRADVWSRCGGSTPAENETVYALAFTRGGVQLPLPTQ SGSTLTTEDRLQSCVSCTGGRGSTLTLIIVVAIREADPWPTKALCSELKDEDFTNKVEPD QMDKNQTEVMREFFLSGFSQTPSIEAGLFVLFLFFYMSIWVGNVLIMVTVASDKYLNSSP MYFLLGNLSFLDLCYSTVTTPKLLADFFNHEKLISYDQCIVQLFFLHFVGAAEMFLLTVM AYDRYVAICRPLHYTTVMSRGGLISTISFVVLISSYTTILVKIRSKEGRRKALSTCASHL MVVTLFFGPCIFIYARPFSTFSVDKMVSVLYNVITPMLNPLIYTLRNKEVKSAMQKLWVR NGLTWKKQET >gi568815584f:19875519_20076562|GENSCAN_predicted_CDS_7|2373_bp atgcaacacaagactaagatccttgagagaataaaacataagattaatattatgatcaac acattttcctccatggaggtactttccacatctcagggcagcaaagcaccggagggtctc cgtaggacactcattccttgtgcttatggctttgctctttctgatggagtacctatgaac tcctattcactgattttgttagattttacttccattcatgccttagggattcaatgcagt atttatgtcacaggattgaagggtcaggacttctgggaagtagctgaaattaagtccctt ccaaaatcgatgaatgagacaaatcattctcgggtgacagaatttgtgttgctgggactg tctagttcaagggagctccaacctttcttgtttcttacattttcactactttatctagca attctgttgggcaactttctcatcatcctcactgtgacctcagattcccgccttcacacc cccatgtactttctgcttgcaaacctgtcatttatagacgtatgtgttgcctcttttgct acccctaaaatgattgcagactttctggttgagcgcaagactatttcttttgatgcctgc ctggcccagattttctttgttcatctcttcactggcagtgaaatggtgctcctagtttcc atggcctatgaccgttatgttgctatatgcaaacctctccactacatgacagtcatgagc cgtcgtgtatgtgttgtgctcgtcctcatttcatggtttgtgggcttcatccatactacc agccagttggcattcactgttaatctgccattttgtggtcctaataaggtagacagtttt ttctgtgaccttcctctagtgaccaagttagcctgcatagacacttatgttgtcagctta ctaatagttgcagatagtggctttctttctctgagttcctttctcctcttggttgtctcc tacactgtaatacttgttacagttaggaatcgctcctctgcaagcatggcgaaggcccgc tccacattgactgctcacatcactgtggtcactttattctttggaccatgcattttcatc tatgtgtggcccttcagcagttactcagttgacaaagtccttgctgtattctacaccatc ttcacgcttattttaaaccctgtaatctacacgctaagaaacaaagaagtgaaggcagct atgtcaaaactgaagagtcggtatctgaagcctagtcaggggggaaacttagcatttctt ggagacctgaaaggatgcagtgagcttaagactttccaagagcttaccaatcagtcagcc cttgttcatccccgagcagatgtatggagcaggtgtggtggatccacccctgctgaaaat gagactgtgtatgctctggctttcacaaggggtggggtccaactcccccttccaacacag agtggcagcaccctgacaacagaggatagactacaaagttgtgtgtcctgtactggagga agaggttctaccctgaccctcattatagtggtagccatcagagaggcagatccatggccc acaaaggcactgtgctcggaactaaaggatgaagattttacaaacaaggtagagccagat caaatggataaaaaccaaacagaagtgatgagagaatttttcttgtcagggttctcacag acaccatctattgaagcagggctatttgtactatttcttttcttctatatgtccatttgg gttggcaatgtcctcatcatggtcacagtagcatctgataaatacctgaattcatcaccc atgtatttccttcttggcaacctctcatttctggacctatgttattcaacagtaacgacc cctaagcttctggctgacttctttaatcatgaaaaactcatttcctatgaccaatgcatt gtgcaactcttcttcctgcattttgtaggggcagctgagatgttcctgctcacagtgatg gcgtacgatcgctatgttgcaatctgtcgcccgctgcactacaccactgtcatgagtcgg ggtgggttgatctccaccatctcctttgtggtgctgatttcctcctacaccactatccta gtcaagattcgctccaaggaaggaaggcgaaaggcactctccacgtgtgcctctcacctc atggtggtaacactgttttttggaccctgtattttcatctacgctcgtcctttctctaca ttttctgtggacaagatggtgtctgtactctacaatgttattaccccaatgctaaacccc ctcatctacacacttcggaacaaagaggtaaagtcagccatgcagaagctctgggtcaga aatgggcttacttggaaaaagcaggagacatga >gi568815584f:19875519_20076562|GENSCAN_predicted_peptide_8|955_aa MDKFLDTYTLPRLNQEEVESLHRPITGAEIVAIINSLPTKKSPGPDGFTPKFYQRYKGEL VPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRTKDKNHIIISIDAEKAFDKI QQPFMLKTLNKLGIDVTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIEGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKV SGYKINVQNSQAFSYTNNSQTESQIMSELPFTIASKRIKYLGIHLTRDVKDLFKENYKPL LSEIKEDIKKWKNIPCSWVGRINIMKMAILPKILFFLGFSVVFVGIVLGNLLILVTVTFD SLLHTPMYFLLSNLSCIDMILASFATPKMIVDFLRELGFVHSSSQMAFMLTLPFCGPNVI DSFFCDLPLVIKLACKDTYILQLLVIADSGLLSLVCFLLLLVSYGVIIFSVRYRAASRSS KAFSTLSAHITVVTLFFAPCVFIYVWPFSRYSVDKILSVFYTIFTPLLNPIIYTLRNQEF ADYGPYFQHIEIFYKSVSKSCQICAICKLANYTYPDYKQSLKPEAMDPQNYSLVSEFVLH GLCTSRHLQNFFFIFFFGVYVAIMLGNLLILVTVISDPCLHSSPMYFLLGNLAFLDMWLA SFATPKMIRDFLSDQKLISFGGCMAQIFFLHFTGGAEMVLLVSMAYDRYVAICKPLHYMT LMSWQTCIRLVLASWVVGFVHSISQVAFTVNLPYCGPNEVDSFFCDLPLVIKLACMDTYV LGIIMISDSGLLSLSCFLLLLISYTVILLAIRQRAAGSTSKALSTCSAHIMVVTLFFGPC IFVYVRPFSRFSVDKLLSVFYTIFTPLLNPIIYTLRNEEMKAAMKKLQNRRVTFQ >gi568815584f:19875519_20076562|GENSCAN_predicted_CDS_8|2868_bp atggataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgcatagaccaataacaggagctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacacccaaattctaccagaggtacaagggcgaactg gtaccattccttctgaaactattccaatcaatagaaaaagaaggaatcctccctaactca ttttatgaggccagcatcatcctgataccaaagccaggcagagacacaacaaaaaaagag aattttagaccaatatccttgatgaacatcgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcatatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatatgcaaatcaataaatgtaatccagcatataaacaga accaaagacaaaaaccacataattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacctttcatgctaaaaactctcaataaattaggtattgatgtgacgtatttcaaa ataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcttattc aacatagtgttggaagttctggccagggcaattaggcaggagaaggaaatagagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatcta gaaaaccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtacaaaactcacaagcattctcatacaccaacaacagtcaa acagagagccaaatcatgagtgaactcccatttacaattgcttcaaagagaataaaatac ctaggaatccaccttacaagggacgtgaaggacctcttcaaggaaaactacaaaccactg ctcagtgaaataaaagaggatataaagaaatggaagaacattccatgctcatgggtagga agaatcaatatcatgaaaatggccatactgcccaagattttattcttcttgggattctct gtggtcttcgtggggattgtgttaggaaacctgctcatcttggtgactgtgacctttgat tcgctccttcacacaccaatgtattttctgcttagcaacctctcctgcattgatatgatc ctggcttcttttgctacccctaagatgattgtagatttcctccgagaacttggatttgtg cactcatctagtcaaatggctttcatgttgactttgcccttctgtggtcccaatgttata gacagctttttctgtgaccttccccttgtgattaaacttgcctgcaaggacacctacatc ctacagctcctggtcattgctgacagtgggctcctgtcactggtctgcttcctcctcttg cttgtctcctatggagtcataatattctcagttaggtaccgtgctgctagtcgatcctct aaggctttctccactctctcagctcacatcacagttgtgactctgttctttgctccgtgt gtctttatctacgtctggcccttcagcagatactcggtagataaaattctttctgtgttt tacacaattttcacacctctcttaaatcctattatttatacattaagaaatcaagagttt gctgattacggcccatatttccagcatattgaaatattctataagtctgtaagtaaatcg tgtcagatttgtgctatctgtaaacttgcaaattatacatatccagattacaaacaaagt ctgaaacctgaggcaatggacccacagaactattccttggtgtcagaatttgtgttgcat ggactctgcacttcacgacatcttcaaaattttttctttatatttttctttggggtctat gtggccattatgctgggtaaccttctcattttggtcactgtaatttctgatccctgcctg cactcctcccctatgtacttcctgctggggaacctagctttcctggacatgtggctggcc tcatttgccactcccaagatgatcagggatttccttagtgatcaaaaactcatctccttt ggaggatgtatggctcaaatcttcttcttgcactttactggtggggctgagatggtgctc ctggtttccatggcctatgacagatatgtggccatatgcaaacccttgcattacatgact ttgatgagttggcagacttgcatcaggctggtgctggcttcatgggtcgttggatttgtg cactccatcagtcaagtggctttcactgtaaatttgccttactgtggccccaatgaggta gacagcttcttctgtgacctccctctggtgatcaaacttgcctgcatggacacctatgtc ttgggtataattatgatctcagacagtgggttgctttccttgagctgttttctgctcctc ctgatctcctacaccgtgatcctcctcgctatcagacagcgtgctgccggtagcacatcc aaagcactctccacttgctctgcacatatcatggtagtgacgctgttctttggcccttgc atttttgtttatgtgcggcctttcagtaggttctctgtggacaagctgctgtctgtgttt tataccatttttactccactcctgaaccccattatctacacattgagaaatgaggagatg aaagcagctatgaagaaactgcaaaaccgacgggtgacttttcaatga >gi568815584f:19875519_20076562|GENSCAN_predicted_peptide_9|194_aa AGKPVPMAQMTVGQLFGCHRERLKPRLLQNPLLDRGKQTHPGGPECMSVGGYSQAVAERP LYSLRYNNIGFRPINSFAIASKCSNATKNIYDSWEEIKTSTLTGVWKKLIPVLMDYFEEF KTYMEKTSHEEGFPRRSLLVQQSFVPEKASQDDQRLDSSQCPLREQAYHLGRSGLLSLLL CTGTEVLPRKKGMP >gi568815584f:19875519_20076562|GENSCAN_predicted_CDS_9|585_bp gctggcaagcctgtaccaatggctcagatgacagtggggcagctctttggctgccacaga gaaaggctgaagccaaggctcctgcagaatccactgttggacagaggcaagcaaacccac cctggtggtccagagtgtatgtctgtgggagggtattcccaggctgtagcagagaggcct ctctattctttgagatacaacaatattggatttaggccaattaatagctttgcaatagcc tctaagtgttcaaatgccactaagaacatttatgattcatgggaggagatcaaaacctca acattaacaggagtttggaagaagctgattccagtcctcatggattactttgaagagttc aagacttacatggagaagacttcccatgaagaaggctttccacggagaagcttgcttgta cagcagtcgtttgtgccagaaaaggcaagccaagatgaccagagactcgactcttctcag tgcccactcagagagcaggcatatcatcttgggagaagtgggctgctgtccctgctcttg tgcactggcacagaagttctgcccaggaagaaaggcatgccataa