GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:39:47 Sequence gi568815580f:59039949_59258682 : 218734 bp : 43.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11636 11722 87 1 0 61 101 64 0.894 5.65 1.02 Intr + 13216 13295 80 0 2 83 87 96 0.995 7.35 1.03 Intr + 17092 17205 114 1 0 123 95 79 0.999 11.66 1.04 Intr + 17242 17387 146 1 2 67 82 39 0.815 1.13 1.05 Intr + 17924 18002 79 0 1 89 73 39 0.807 1.11 1.06 Intr + 19537 19669 133 1 1 78 86 141 0.979 13.55 1.07 Intr + 22885 23046 162 0 0 76 61 70 0.533 3.27 1.08 Intr + 23408 23453 46 1 1 45 66 59 0.414 -2.62 1.09 Intr + 28424 28566 143 0 2 102 101 127 0.696 15.47 1.10 Term + 28893 29015 123 2 0 92 41 114 0.972 5.48 1.11 PlyA + 29223 29228 6 1.05 2.00 Prom + 29748 29787 40 -9.46 2.01 Init + 30400 30604 205 0 1 94 31 133 0.159 7.21 2.02 Intr + 37448 37540 93 0 0 56 100 50 0.096 2.94 2.03 Intr + 39261 39340 80 1 2 35 77 49 0.022 -2.23 2.04 Term + 44837 45115 279 1 0 54 38 149 0.104 1.95 2.05 PlyA + 45947 45952 6 1.05 3.00 Prom + 55639 55678 40 -3.26 3.01 Init + 56003 56177 175 1 1 96 49 114 0.620 7.81 3.02 Intr + 59918 60033 116 0 2 100 72 72 0.331 6.97 3.03 Term + 75783 77333 1551 2 0 -5 43 533 0.023 29.94 3.04 PlyA + 77466 77471 6 1.05 4.00 Prom + 78230 78269 40 -2.96 4.01 Init + 79121 79169 49 1 1 86 89 33 0.103 2.31 4.02 Intr + 90870 91032 163 1 1 100 78 80 0.654 7.23 4.03 Intr + 98459 98525 67 2 1 58 109 7 0.063 -1.29 4.04 Term + 99983 100180 198 1 0 20 42 212 0.123 7.20 4.05 PlyA + 101021 101026 6 1.05 5.00 Prom + 107943 107982 40 0.14 5.01 Init + 108206 108283 78 1 0 43 78 41 0.868 -0.34 5.02 Intr + 109565 109674 110 1 2 82 92 142 0.914 13.08 5.03 Intr + 112588 112737 150 1 0 103 37 72 0.686 2.88 5.04 Term + 115740 115863 124 0 1 49 55 116 0.642 2.26 5.05 PlyA + 116092 116097 6 1.05 6.04 PlyA - 116514 116509 6 1.05 6.03 Term - 119578 119518 61 0 1 108 49 34 0.806 -1.22 6.02 Intr - 126806 126685 122 2 2 91 41 93 0.147 4.19 6.01 Init - 133451 133383 69 2 0 75 98 48 0.560 5.55 6.00 Prom - 134779 134740 40 -0.86 7.03 PlyA - 135695 135690 6 1.05 7.02 Term - 137060 137018 43 1 1 120 47 27 0.262 -1.57 7.01 Init - 138222 138158 65 0 2 83 85 52 0.458 5.03 7.00 Prom - 143892 143853 40 -3.96 8.03 PlyA - 145512 145507 6 1.05 8.02 Term - 161921 161866 56 0 2 76 44 80 0.835 0.12 8.01 Init - 163150 163075 76 1 1 80 116 73 0.987 10.60 8.00 Prom - 165405 165366 40 -6.06 9.00 Prom + 180174 180213 40 -4.26 9.01 Init + 180318 180456 139 2 1 92 92 277 0.662 26.80 9.02 Intr + 185544 185786 243 1 0 108 93 174 0.976 17.37 9.03 Intr + 206271 206327 57 1 0 44 95 94 0.436 4.56 9.04 Intr + 207179 207200 22 0 1 91 102 8 0.355 -0.90 9.05 Intr + 210871 210970 100 1 1 83 82 46 0.420 3.61 9.06 Term + 217791 217904 114 2 0 65 46 150 0.454 7.07 9.07 PlyA + 218138 218143 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 75960 77333 1374 2 0 70 43 449 0.960 34.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:59039949_59258682|GENSCAN_predicted_peptide_1|370_aa MPKFHWDNCRQAWWTNLLLLNNFVSVKNACNGWTWYLANDFQFHLTTPVIIFIHVKSTQI LILLGAMLFLASFTATALITLAYKLPVVAPSETRTSRGGLLNARLFTLCPLVHGKSGYET FGLDGKADCLLASKLLNLSTCTGVCANVTFSEQPLPIFQNKMARTVPGIEEAIVLYFVEY YTKPYCRFGPVLVGLFLSIYMHQNHQENILRTKLQLSTKPSTGPCGRRLWAESSLRATED MEVWKRLQALLSGSHPVPLKVTNRTHRRAKQIKGFNGKESSPGLVNRVLSWDIWSFLSSI SYARYLVHPILIILYNGLQETLIHHTDTNMFYLFSGHRVLTFVTGLALTLFIEKPCQELK QHLLGHECSG >gi568815580f:59039949_59258682|GENSCAN_predicted_CDS_1|1113_bp atgcccaaattccactgggataactgccggcaagcatggtggacgaatctgctgttgcta aataactttgtgtcggtcaagaatgcgtgcaatggctggacctggtaccttgccaatgac ttccagttccacctcaccacaccagtgattatcttcatccatgtaaagagtacacagatc ctcatcctccttggggccatgctgttcttggcatctttcacagccactgctctgatcacc ttggcatataaacttcctgtcgtggctccatcagaaaccaggacttcccggggagggctg ctgaatgccaggctgttcaccctgtgccctttggttcatggaaaaagtgggtatgaaact tttggtctggatgggaaagctgattgccttcttgcttccaaacttctgaacctttcaacc tgcactggtgtctgtgcaaatgtcaccttctcagagcagcctttgccgatcttccaaaat aaaatggccagaaccgttcctggtattgaagaggcgattgtattgtatttcgtggagtac tacacaaagccctactgccgatttgggccagttcttgtgggcctctttctgagcatttac atgcaccaaaaccaccaggaaaacattctcagaaccaagctgcagctctctaccaagccc tccaccggaccctgtgggcggcggctgtgggctgagtcctctttgcgtgccacggaggat atggaggtatggaagcggctccaggctttgctgtcgggttcacaccctgttcctttaaag gtgacaaatcgaacacacaggagagccaagcagataaaaggcttcaatggaaaagaatct tctccaggtctggtgaaccgtgtgctttcttgggacatctggagtttcctgtccagcatc agttatgctcgctacttggtgcatccgattctgatcatcctttacaatggccttcaggaa acacttattcaccacactgacaccaacatgttctatcttttctctggacaccgtgtgctg accttcgtcactgggctggccctgacgctgttcattgagaaaccatgtcaggaactgaag cagcacctgctgggccatgaatgttctggttaa >gi568815580f:59039949_59258682|GENSCAN_predicted_peptide_2|218_aa MTNVGGEGKAERFTHSDGHGQLTLALPASHGDHLPEDDEMRDLISLRSGPSAEEANLREW GAKAQQPRCRGFMSNSFQKFDCEKQERDRWKLECDVGLEEVEDTKTAPLCGAGCGEGSDD DEDAAKLRDSASSLVREIRVEVSGIPGSVDGWENCSAPGSRASDQSQPWGPARDIANLRT NVESTPALGVKNKQGAFRDGFGMRCPRLSAPGIAQLSS >gi568815580f:59039949_59258682|GENSCAN_predicted_CDS_2|657_bp atgaccaatgtgggtggggaggggaaggctgaacgtttcacacactctgatggccacgga cagctaaccttggccttgccagccagccacggggatcatctccctgaggatgatgagatg agagacctcatctccctgaggtctggtccctctgctgaagaagccaacctgagggagtgg ggcgccaaggctcaacagccgcgctgcagggggtttatgagcaattctttccagaagttt gactgtgaaaagcaggagagagatcgctggaagctggagtgtgatgtgggcttggaagag gtagaggatacgaaaacagcaccattgtgtggagcaggatgtggagaagggagcgatgac gatgaggatgctgctaagctaagggacagtgctagctctctggtgagggagatccgtgtg gaggtctctggcattcctgggtcagtggatggctgggagaactgctcagctccaggttcc agggcctctgatcaaagccaaccctgggggccagccagagatatcgcaaacctaaggact aacgtggaatctacacctgccttgggagtaaagaacaagcagggtgctttccgggatggc tttgggatgagatgcccaagacttagtgcccctggcattgctcagctctcaagctga >gi568815580f:59039949_59258682|GENSCAN_predicted_peptide_3|613_aa MGGRSRSQGTPRSPAKHQKPGERQGTESPSQALAGTTSADTLFSDFRYPELGDNPFLLFC GSDRSCCVLAPLALEQAWTPDPGWANGKTLAVVDGLGINKIDRPLASLIKKRREKNQIDA IKNDKGDITTDTTEIQTTIREYYKHLYANKLENLEEMDKFLDTYILPRLNQEEVESLNRP ITGSEIEAINNSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKWGILPNSFYEA SIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANQIQQHIKKLIHHDQVGFIPGMQG WFNIRKSINIIQHINRTKDKNHMTISIDAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRA IYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVPARAIRQEKEIKHIQSGK EEVKLSLFADDMIVYLENPIISAQNLLKLIINFSKVSGYKINVQKSQAFLYTNNRQTESQ IMSELPFTIASKRIKYLGIQLTRNVKDLSKENYKPLLNEIKEDTNKWKNIPCSWIGRINI VKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFNQKRARITKSILSQKNKAGGITLPD FKLYYKATVTKTA >gi568815580f:59039949_59258682|GENSCAN_predicted_CDS_3|1842_bp atggggggacgcagcagaagccaaggaacaccaagatcaccagccaagcaccagaagcca ggggagaggcaaggaacagaatctccctcgcaggccttggcaggaaccacctctgctgac accttgttctcagacttccggtatccagaactgggagataatccatttctgttgttctgt ggctccgacaggagctgctgtgtccttgcacccctggccctggaacaggcttggactcct gacccaggttgggccaatggtaaaaccctggccgtggttgatggactagggatcaacaaa attgatagaccgctagcaagtctaataaagaagaggagagagaagaatcaaatagatgca ataaaaaatgataaaggggatatcaccactgataccacagaaatacaaactaccatcaga gaatactataaacacctctacgcaaataaactagaaaatctagaagaaatggataaattc ctggacacatacatcctcccaagactaaaccaggaagaagttgaatctctaaatagaccg ataacaggctctgaaattgaggcgataaataatagcctaccaaccaaaaaaagtccagga ccagacggattcacagccgaattctaccagaggtacaaggaggagctggtaccattcctt ctgaaactattccaatcaatagaaaaatggggaatcctccctaactcattttatgaggcc agcatcatcctgataccaaagccaggcagagacacaacaaaaaaagagaactttagacca atatccctgatgaatattgatgcaaaaatcctcaataaaatactggcaaatcaaatccag cagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggc tggttcaacatacgcaaatcaataaacataatccagcatataaacagaaccaaagacaaa aaccacatgactatctcaatagatgcagaaaaggcctttgacaaaatccaacagcccttc atgctaaaaactctcaataaattaggtattgatgggacgtatctcaaaataataagagct atttatgacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccct ttgaaaactggcacaagacagggatgccctctctcaccactcctattcaacatagtgttg gaagttccggccagggcaatcaggcaagagaaagaaataaagcatattcaatcaggaaaa gaggaagtcaaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccatc atctcagcccaaaatctccttaagctgataatcaacttcagcaaagtctcaggatacaaa atcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaa atcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaa cttacaaggaatgtgaaggacctctccaaggagaactacaaaccattgctcaatgaaata aaagaggacacaaacaagtggaagaacattccatgctcatggataggaagaatcaatatt gtgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagcta ccaatgactttcttcacagaattggaaaaaactactttaaagttcaaccaaaaaagagcc cgcattacgaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatga >gi568815580f:59039949_59258682|GENSCAN_predicted_peptide_4|158_aa MGFHRVAQAGLKLLTSGMQYCFWFTILVKRELSMDFYLVPPTIMALTYGSEKELGDSESD KYINSKYTSKRSSAAEHFHIRCHFQWDFLPRCQVPGDPAMVRAGAVGAHLPASGLDIFGD LKKMNKRQVTEEGGERAVAGTACASGESGLPSQGSGSQ >gi568815580f:59039949_59258682|GENSCAN_predicted_CDS_4|477_bp atggggtttcaccgtgttgcccaggctggtctcaaactcctgacctcagggatgcagtat tgcttctggtttactatcttggtgaagagggaactttctatggacttctacttagtccct cctaccataatggcccttacctatgggtcagagaaagaacttggggattctgagagtgac aagtatatcaattccaaatatacatccaagaggtcaagtgctgcagaacactttcatatc agatgccacttccagtgggacttcctgccacggtgccaggtccccggagaccctgctatg gtgcgtgcgggcgccgtgggggctcatctccccgcgtccggcttggatatcttcggggac ctgaagaagatgaacaagcgccaggtgacggaggagggtggtgagcgcgccgtggccggg accgcctgtgctagcggggagtcggggcttcccagtcagggctccggctcccagtga >gi568815580f:59039949_59258682|GENSCAN_predicted_peptide_5|153_aa MDHPPGSADHPNNCRIVKRKIEAGTKLYYQVLNFAMIVSSALMIWKGLIVLTGSESPIVV VLSGSMEPAFHRGDLLFLTNFREDPIRAGEIVVFKVEGRDIPIVHRVIKVHEKDNGDIKF LTKGDNNEVDDRGLYKEGQNWLEKKDVVGRARG >gi568815580f:59039949_59258682|GENSCAN_predicted_CDS_5|462_bp atggaccacccacctggctctgcagatcaccccaacaactgcaggattgtgaagaggaag attgaggcaggtaccaaactctattaccaggttttaaacttcgccatgatcgtgtcttct gcactcatgatatggaaaggcttgatcgtgctcacaggcagtgagagccccatcgtggtg gtgctgagtggcagtatggagccggcctttcacagaggagacctcctgttcctcacaaat ttccgggaagacccaatcagagctggtgaaatagttgtttttaaagttgaaggacgagac attccaatagttcacagagtaatcaaagttcatgaaaaagataatggagacatcaaattt ctgactaaaggagataataatgaagttgatgatagaggcttgtacaaagaaggccagaac tggctggaaaagaaggacgtggtgggaagagcaagagggtga >gi568815580f:59039949_59258682|GENSCAN_predicted_peptide_6|83_aa MTDIMDVKDIKRAFMNLFLILKKFSHFSSCSFVDFSTYPFNVGAPQTSVFFASEKQEIIV DNPSNDQYLKSQARVLSVPPIPV >gi568815580f:59039949_59258682|GENSCAN_predicted_CDS_6|252_bp atgacagacataatggatgtcaaggacattaaaagagcttttatgaatctgtttcttata ctcaagaagttcagccacttctcgtcatgctcctttgtggacttctccacttaccctttc aatgttggtgctccccaaacttctgtcttctttgcttctgagaaacaggaaattattgta gataaccccagcaatgaccagtacctgaaatcacaggctcgtgtgttatctgttcctcct attccagtgtga >gi568815580f:59039949_59258682|GENSCAN_predicted_peptide_7|35_aa MTPSAKECAPLEVLAAIYRGCWLIGPGLSQLCIET >gi568815580f:59039949_59258682|GENSCAN_predicted_CDS_7|108_bp atgactccaagtgcaaaggagtgtgcaccactcgaggtcctagcagccatctaccgtggc tgctggctgatcggcccaggcttgtcacagctgtgcattgagacttga >gi568815580f:59039949_59258682|GENSCAN_predicted_peptide_8|43_aa MGPSPSKERGAVASRLLTRSILKKEGTKSSFISGVKEKQQDSC >gi568815580f:59039949_59258682|GENSCAN_predicted_CDS_8|132_bp atgggtcccagccccagcaaggagcggggagcagtggcaagccggctgctcaccaggtcc atcctgaaaaaggaagggaccaagtcctccttcatcagtggtgtcaaagagaaacagcag gacagctgttag >gi568815580f:59039949_59258682|GENSCAN_predicted_peptide_9|224_aa MRGRELPLVLLALVLCLAPRGRAVPLPAGGGTVLTKMYPRGNHWAVGHLMGKKSTGESSS VSERGSLKQQLREYIRWEEAARNLLGLIEAKENRNHQPPQPKALGNQQPSWDSEDSSNFK DVGSKGKVLECGKSKIKIKVMANVVSDQTNVHLTAGILGLSLILDKKALSTSSNSIEVNS MNFNAVKEHNSSPATEQTWTENDFDDLREEGFRRSNYSELKEEV >gi568815580f:59039949_59258682|GENSCAN_predicted_CDS_9|675_bp atgcgcggccgtgagctcccgctggtcctgctggcgctggtcctctgcctggcgccccgg gggcgagcggtcccgctgcctgcgggcggagggaccgtgctgaccaagatgtacccgcgc ggcaaccactgggcggtggggcacttaatggggaaaaagagcacaggggagtcttcttct gtttctgagagagggagcctgaagcagcagctgagagagtacatcaggtgggaagaagct gcaaggaatttgctgggtctcatagaagcaaaggagaacagaaaccaccagccacctcaa cccaaggccctgggcaatcagcagccttcgtgggattcagaggatagcagcaacttcaaa gatgtaggttcaaaaggcaaagttctcgagtgtggaaagtccaagatcaagatcaaggtg atggcaaatgtggtgtctgaccaaaccaatgttcatcttacagctggcattttaggcttg tctctaattttggataaaaaggcattgtccacatcaagcaatagtatagaggtcaatagt atgaacttcaacgctgtgaaggaacacaactcctcaccagcaacggaacaaacctggacg gagaatgactttgatgacttgagagaagaaggcttcagacgatcaaactactccgagcta aaggaggaagtttga