GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:53:32 Sequence gi568815595f:138272884_138502266 : 229383 bp : 45.77% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 84 233 150 2 0 85 95 1 0.516 0.86 1.02 Intr + 1566 1661 96 2 0 91 98 44 0.909 5.91 1.03 Intr + 11548 11643 96 0 0 126 89 72 0.993 11.31 1.04 Intr + 16165 16237 73 0 1 91 116 -14 0.849 0.68 1.05 Intr + 17663 17756 94 0 1 66 89 98 0.674 6.72 1.06 Term + 25803 25938 136 0 1 126 48 84 0.577 5.69 1.07 PlyA + 27632 27637 6 1.05 2.12 PlyA - 28139 28134 6 1.05 2.11 Term - 30246 30137 110 1 2 13 48 63 0.259 -6.53 2.10 Intr - 30760 30624 137 0 2 53 110 148 0.998 13.71 2.09 Intr - 32144 31990 155 2 2 90 59 218 0.996 17.97 2.08 Intr - 33597 33515 83 1 2 15 116 84 0.785 3.26 2.07 Intr - 41524 41449 76 1 1 20 92 129 0.872 5.59 2.06 Intr - 42760 42644 117 1 0 64 38 113 0.836 4.56 2.05 Intr - 56523 56420 104 1 2 30 115 100 0.111 6.79 2.04 Intr - 73553 73488 66 0 0 106 66 53 0.046 3.78 2.03 Intr - 78533 78491 43 2 1 100 92 23 0.140 1.71 2.02 Intr - 86265 86220 46 2 1 119 86 21 0.034 3.41 2.01 Init - 96695 96571 125 2 2 80 72 79 0.148 3.31 2.00 Prom - 97979 97940 40 -11.14 3.00 Prom + 98502 98541 40 -6.66 3.01 Init + 100001 100193 193 1 1 86 93 274 0.997 26.83 3.02 Intr + 104884 105033 150 2 0 74 13 113 0.735 2.53 3.03 Term + 105426 105547 122 1 2 111 44 56 0.819 2.14 3.04 PlyA + 105974 105979 6 -1.95 4.00 Prom + 106804 106843 40 -6.36 4.01 Init + 111440 111506 67 1 1 101 69 57 0.535 6.34 4.02 Intr + 124441 124594 154 2 1 69 99 340 0.744 32.33 4.03 Intr + 125586 125689 104 0 2 78 75 87 0.964 6.22 4.04 Intr + 126160 126305 146 2 2 35 51 115 0.928 2.50 4.05 Intr + 127600 127730 131 0 2 14 71 150 0.774 5.39 4.06 Intr + 130768 130855 88 1 1 121 77 14 0.544 3.57 4.07 Intr + 137929 138024 96 0 0 62 47 90 0.130 2.51 4.08 Term + 151280 151336 57 1 0 103 55 26 0.074 -1.51 4.09 PlyA + 154022 154027 6 1.05 5.00 Prom + 159688 159727 40 -1.66 5.01 Init + 161916 162242 327 2 0 64 80 505 0.994 42.62 5.02 Intr + 166413 166610 198 2 0 32 43 125 0.725 1.95 5.03 Intr + 167255 167341 87 1 0 73 105 6 0.447 0.97 5.04 Intr + 179165 179206 42 1 0 85 116 71 0.976 8.04 5.05 Intr + 182311 182445 135 0 0 94 98 160 0.858 18.36 5.06 Intr + 184685 184761 77 1 2 75 93 118 0.991 9.31 5.07 Intr + 185847 186005 159 0 0 74 105 54 0.910 4.80 5.08 Intr + 186304 186370 67 1 1 96 94 169 0.999 17.21 5.09 Intr + 187068 187151 84 2 0 89 117 128 0.533 15.82 5.10 Intr + 187701 187783 83 2 2 72 58 86 0.942 2.44 5.11 Intr + 189203 189323 121 2 1 63 56 188 0.794 13.60 5.12 Intr + 191462 191632 171 1 0 40 94 210 0.995 16.94 5.13 Intr + 192456 192538 83 2 2 94 94 162 0.999 15.84 5.14 Intr + 194678 194726 49 2 1 94 77 53 0.928 3.48 5.15 Intr + 195222 195311 90 2 0 98 105 70 0.779 9.89 5.16 Intr + 195772 195834 63 0 0 97 117 58 0.976 8.51 5.17 Intr + 195936 195998 63 2 0 108 70 81 0.989 7.21 5.18 Intr + 196553 196621 69 1 0 66 116 13 0.704 1.28 5.19 Intr + 197177 197263 87 1 0 122 42 68 0.973 5.77 5.20 Intr + 197994 198143 150 2 0 34 66 176 0.926 10.36 5.21 Intr + 199480 199976 497 0 2 106 1 395 0.315 24.39 5.22 Intr + 200653 200751 99 1 0 38 103 83 0.374 4.03 5.23 Intr + 201338 201469 132 2 0 22 105 112 0.424 6.06 5.24 Intr + 206759 206889 131 2 2 -6 58 87 0.011 -3.36 5.25 Intr + 211723 211886 164 2 2 60 95 28 0.080 0.39 5.26 Term + 212006 212266 261 1 0 84 36 127 0.081 2.63 5.27 PlyA + 212767 212772 6 1.05 6.04 PlyA - 213396 213391 6 1.05 6.03 Term - 217604 217515 90 2 0 55 43 71 0.592 -2.98 6.02 Intr - 219811 219691 121 0 1 74 96 31 0.640 3.00 6.01 Intr - 227341 227227 115 2 1 121 53 39 0.223 3.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:138272884_138502266|GENSCAN_predicted_peptide_1|214_aa NMAFQAEQKIKADILRSLSTEQLFRLLSDSDLNVLMKTLGLLRNLLSTRPHIDKIMSTHG KQIMQAVTLILEGEHNIEVKEQTLCILANIADGTTAKDLIMTNDDILQKIKYYMGHSHVK LQLAAMFCISNLIWNEEEGSQERQDKLRDMGIVDILHKLSQSPDSNLCDKYAPSPLPPAC ISTTTVCKSLLSLCKDTQVLFVYYALPPFPISEL >gi568815595f:138272884_138502266|GENSCAN_predicted_CDS_1|645_bp aatatggcatttcaggctgaacaaaaaataaaagcagatattttacgaagcttgagtact gaacagctattccggttattatcagattcagatttgaatgtgctgatgaagacattggga cttcttagaaatctcctctccactcgtcctcatatagataaaataatgagtactcatgga aagcaaattatgcaagccgtcactcttattctagaaggggaacataacattgaggtcaaa gagcagacactgtgcatcttagccaacatagcggatgggacaacagcaaaagatcttatt atgaccaatgatgatatcctacagaaaatcaagtattacatgggccattcacatgttaaa ctgcagcttgcagccatgttttgtatatcaaacctcatatggaatgaagaggaaggttca caagaacgccaggataaattacgagacatgggcatcgtagatattctacacaaactgagt cagtcaccagattcaaacctttgtgacaagtatgccccatctccactaccaccagcttgc atctccactaccacagtatgcaagagcttgctatccctctgcaaggacacacaggtgctc ttcgtctactacgcactgccaccttttcccatctctgaattgtag >gi568815595f:138272884_138502266|GENSCAN_predicted_peptide_2|353_aa MLRRKVQAPSQDIQCLSLAATADLSSHTRAAAPYTLLSPPTSNQLAAPFLRGKACPKIKV TVPSMPAQGLAELGKAGRDLYILVACQPLVGSPSVCREAAGERPTAAVTGLRLLCRAMGS RKKEIALQGGELVAVVRGANAPLLQKTILDQLEAEKKVLAEGRERKVIKDEALSDEDECV SHGKNNGEDEDMVSSERTCTLAIIKPDAVAHGKTDEIIMKEAFEKLVHHMCSGPSHLLIL TRTEGFEDVVTTWRTVMGPRDPNVARREQPESLRAQYGTEMPFNAVHGSRDREDADRELA LLFPSLKFSDKDTEAPQAPKGSLRNAAIRKGPQPGACCDFISLNTTFQCMTVV >gi568815595f:138272884_138502266|GENSCAN_predicted_CDS_2|1062_bp atgctgaggaggaaagtccaggccccctcacaggacattcaatgcctgtcgctggctgcc actgctgacctctccagccacacccgtgccgcagccccttatactctactgtcaccccca accagtaaccagcttgctgctccattcctgcgaggcaaggcctgtccaaagataaaggtg actgtaccttctatgcctgcccagggcctggcagaacttgggaaggcgggcagggatctg tacatcctggtggcctgtcagcccctcgtgggctccccatcggtttgtcgcgaggctgcg ggggaacggcccacggccgcggtaacagggcttcgtcttctttgcagagccatgggcagc aggaagaaggaaattgccctgcagggaggagaactggtggctgtggttagaggagcaaat gccccactgctgcagaaaaccatcctagaccagctggaggccgaaaagaaagtgctggct gaaggcagagaacggaaagtgattaaagatgaggctctttctgatgaagatgaatgtgtt tcccatggaaagaataatggtgaagatgaggacatggtttcatcagagaggacctgtacc ttggccatcattaaaccagatgcagtggcccatggaaagactgatgagattatcatgaag gaggcatttgagaagctggtacatcacatgtgcagtggaccaagccacctcctgatcctc accaggactgagggcttcgaggacgtggtcactacctggcgaaccgtcatgggcccccgt gaccccaatgtggccaggagggagcagccagaaagtctccgagctcagtacggcacagaa atgcccttcaatgccgtccatggaagccgggacagagaagatgctgacagagaactggca ttgctcttccccagtttgaaattttcagacaaagatacagaagcccctcaggcccctaag ggctcactgagaaacgcagctatacggaagggccctcaacctggagcctgctgtgacttc atctccctcaacaccacatttcaatgcatgaccgttgtttag >gi568815595f:138272884_138502266|GENSCAN_predicted_peptide_3|154_aa MATSAVPSDNLPTYKLVVVGDGGVGKSALTIQFFQKIFVPDYDPTIEDSYLKHTEIDNQW AILDERIKTWQHDDEEKEENDNFDDGVLNGQKLCGRPVLSTLHLLSLNPYKCYRGSHIAC VLPEPKRQELAGLRGQQPGSPVAVGPPGFGASLL >gi568815595f:138272884_138502266|GENSCAN_predicted_CDS_3|465_bp atggcaaccagcgccgtccccagtgacaacctccccacatacaagctggtggtggtgggg gatgggggtgtgggcaaaagtgccctcaccatccagtttttccagaagatctttgtgcct gactatgaccccaccattgaagactcctacctgaaacatacggagattgacaatcaatgg gccatcttggacgaaagaataaaaacgtggcagcacgatgatgaggagaaggaagagaat gataattttgatgacggggtattgaatgggcagaaactgtgtggaaggcccgtgctaagc acattgcatttgctgtctttgaatccttacaaatgctatcgtggaagtcacattgcctgt gtcctcccagaacctaagagacaagagctggctggcctcaggggccagcagccaggctct cctgtggctgtgggacccccaggctttggagcaagcctgctctga >gi568815595f:138272884_138502266|GENSCAN_predicted_peptide_4|280_aa MAESERPVKKLLMMGLGIGGGGVLDTAGQEEFSAMREQYMRTGDGFLIVYSVTDKASFEH VDRFHQLILRVKDRESFPMILVANKVDLMHLRKITREQGKEMATKHNVGASAMAGFGVQP GRGSKADLSTPYQGQEKVESVPSSNPDSLPAGVDLEQGFEDLCATLIKLSFAFQIPYIET SAKDPPLNVDKAFHDLVRVIRCCWCEPKTGVFPAGVGVERQQEQRCTKEKFPSVTKRVTH MTPSSPPLLWLTPPASRALNTLVNTPKAKNKSQLVKSSHS >gi568815595f:138272884_138502266|GENSCAN_predicted_CDS_4|843_bp atggcagaatcggaaagaccagttaagaagctcctgatgatgggcttgggcattggtgga ggaggtgttctggacacagctgggcaggaggaattcagcgccatgcgggagcaatacatg cgcacgggggatggcttcctcatcgtctactccgtcactgacaaggccagctttgagcac gtggaccgcttccaccagcttatcctgcgcgtcaaagacagggagtcattcccgatgatc ctcgtggccaacaaggtcgatttgatgcacttgaggaagatcaccagggagcaaggaaaa gaaatggcgaccaaacacaatgtaggggcatctgccatggccggatttggtgtccaacct ggaagaggctccaaagcagacctcagcaccccttaccagggccaggagaaagtggaaagt gtgcccagttcaaaccctgattctctgccagccggtgttgaccttgagcaaggctttgag gatctctgtgccactttgatcaagttgagctttgccttccagattccgtacatagaaacc agtgccaaggacccacctctcaatgtcgacaaagccttccatgacctcgttagagtaatt aggtgctgctggtgtgagccaaagactggagtttttccagctggggtgggagtggagaga caacaggaacaacgctgcaccaaagaaaagttccccagcgtgactaagcgtgtaacgcac atgactccctcatctccacccctgctgtggttaacaccccctgcctctcgggccctcaac accctggtcaacaccccaaaggccaagaacaagagccagcttgtcaagtcctcccattct tga >gi568815595f:138272884_138502266|GENSCAN_predicted_peptide_5|1162_aa MRAEEPCAPGAPSALGAQRTPGPELRLSSQLLPELCTFVVRVLFYLGPVYLAGYLGLSIT WLLLGALLWMWWRRNRRGKLGRLAAAFEFLDNEREFISRELRGQHLPAWHLDDCQGVERV EGTRRALLWNPLHPGFSWSLLLWVLGVPAVLPALELPSDLQSPSFVLWAPKTLLGDPTRC LSCLSPSNWVRTGAAGRSKHQGKKIHFPDVERVEWANKIISQTWPYLSMIMESKFREKLE PKIREKSIHLRTFTFTKLYFGQKCPRVNGVKAHTNTCNRRRVTVDLQICPSSTWDVSSGG CFCVPMKDTWAEMGQGDSRGGKVGSVFTKSPSFSSSGYRGVSYIGDCEISVELQKIQAGV NGIQGTLRVILEPLLVDKPFVGAVTVFFLQKPPNSFPLPLKHLQINWTGLTNLLDAPGIN DVSDSLLEDLIATHLVLPNRVTVPVKKGLDLTNLRFPLPCGVIRVHLLEAEQLAQKDNFL GLRGKSDPYAKVSIGLQHFRSRTIYRNLNPTWNEVFEFMVYEVPGQDLEVDLYDEDTDRD DFLGSLQICLGDVMTNRVVDEWFVLNDTTSGRLHLRLEWLSLLTDQEVLTEDHGGLSTAI LVVFLESACNLPRNPFDYLNGEYRAKKLSRFARNKVSKDPSSYVKLSVGKKTHTSKTCPH NKDPVWSQVFSFFVHNVATERLHLKVLDDDQECALGMLEVPLCQILPYADLTLEQRFQLD HSGLDSLISMRLVLRFLQVEERELGSPYTGPEALKKGPLLIKKVATNQGPKAQPQEEGPT DLPCPPDPASDTKDVSRSTTTTTSATTVATEPTSQETGPEPKGKDSAKRFCEPIGEKKSP ATIFLTVPGPHSPGPIKSPRPMKCPASPFAWPPKRLAPSMSSLNSLASSCFDLADISLNI EGGDLRRRQLGEIQLTVRYVCLRRCLSVLINGCRNLTPCTSSGADPYVRVYLLPERKWAC RKKTSVKRKTLEPLFDETTPTVYPWLMAFMFYQQATPFGKFLSHKWVAWSYGCQRKSKSQ PGLAKAGAGSLSLQRGVEGEARAGTGPVLAGQLEFWVGVGLAGPTLGAAGRPCQPQGRAR DLQPTMPEPPTPSMGSCVARASPTSTAPCSTAPSPIDHPRAEECRHTAWDWQAAPPAALV WDPLGEASCAPESGGALEKLYV >gi568815595f:138272884_138502266|GENSCAN_predicted_CDS_5|3489_bp atgcgagcagaggagccctgcgcccccggggcccccagcgccctgggagcccagcgcacg ccgggccccgagctgcgcctgtccagccagctgctgcccgagctctgtaccttcgtggtg cgcgtgctgttctacctggggcctgtctacctagctggctacctggggctcagcataacc tggttgctgctcggcgccctgctgtggatgtggtggcgcaggaaccgccgcgggaagctt gggcgcctggccgccgccttcgaattccttgacaatgaacgcgagttcatcagccgcgag ctgcggggccagcacctgccagcctggcatctggatgactgtcaaggggtggagcgagtg gaagggacgcgcagagcacttctgtggaaccccctgcaccctggcttttcctggagcctg ctcctgtgggttctcggagtcccggctgtgctgcccgcgctagagctgccttctgatctt cagagcccatccttcgtgctctgggctcctaagacccttcttggggaccccactcgctgc ctttcatgcctgtccccatctaactgggtgcgcactggggctgccggcagaagcaagcac caggggaaaaagatccacttcccggacgtggagcgggtcgagtgggccaacaagatcatc tctcagacctggccctacctaagcatgatcatggaaagcaagttccgggagaaacttgag cccaagatccgagagaagagcatccacctgaggacctttacctttaccaagctctacttt ggacagaagtgtcccagggtcaacggtgtcaaggcacacactaatacgtgcaaccgaaga cgtgtgactgtggacctgcagatctgccccagcagcacctgggatgtaagcagtgggggc tgcttctgtgtccccatgaaagacacctgggcagagatgggacagggggacagcaggggt ggaaaagtgggcagcgtgtttaccaagagcccctccttttcatcttcagggtatcgtggg gtgagctacatcggggactgtgagatcagtgtggagctgcagaagattcaggctggtgtg aacgggatccagggcaccctgcgggtcatcctggagcccctcctagtggacaagcccttt gtgggagccgtgactgtgttcttccttcagaagccgcctaatagcttccctctgcccctg aagcacctacagatcaactggactggcctgaccaacctgctggatgcgccgggaatcaat gatgtgtcagacagcttactggaggacctcattgccacccacctggtgctgcccaaccgt gtgactgtgcctgtgaagaaggggctggatctgaccaacctgcgcttccctctgccctgt ggggtgatcagagtgcacttgctggaggcagagcagctggcccagaaggacaactttctg gggctccgaggcaagtcagatccctacgccaaggtgagcatcggcctacagcatttccgg agtaggaccatctacaggaacctgaaccccacctggaacgaagtgtttgagttcatggtg tacgaagtccctggacaggacctggaggtagacctgtatgatgaggataccgacagggat gacttcctgggcagcctgcagatctgccttggagatgtcatgaccaacagagtggtggat gagtggtttgtcctgaatgacacaaccagcgggcggctgcacctgcggctggagtggctt tcattgcttactgaccaagaagttctgactgaggaccatggtggcctttccactgccatt ctcgtggtcttcttggagagtgcctgcaacttgccgagaaacccttttgactacctgaat ggtgaatatcgagccaaaaaactctccaggtttgccagaaacaaggtcagcaaagaccct tcttcctatgtcaaactatctgtaggcaagaagacacatacaagtaagacctgtccccac aacaaggaccctgtgtggagccaggtgttctccttctttgtgcacaatgtggccactgag cggctccatctgaaggtgcttgatgatgaccaggagtgtgctctgggaatgctggaggtc cccctgtgccagatcctcccctatgctgacctcactcttgagcagcgctttcagctggac cactcaggcctggacagcctcatctccatgaggctggtgcttcggttcctgcaagtggag gaacgagagctggggagcccatacacaggacctgaagccctaaagaaaggccctctgctc atcaagaaagtggctaccaaccagggtcccaaagcccaacctcaggaagaaggccctaca gatttgccatgtcccccagaccctgcttctgatactaaggacgtatccaggagtaccaca accaccaccagtgctaccaccgttgccactgagcccacatcccaagagacaggcccagag cctaaaggcaaggacagtgccaaaaggttctgtgagcccatcggggagaagaagagtcca gccaccatcttcctgactgtcccaggtccccactctccagggcccatcaagtcacccaga cccatgaaatgccctgcctccccattcgcatggccgcccaagaggctggctcccagcatg tcctcgctcaactccttggcctcttcttgctttgacctggcagatatcagcctcaacatt gaaggtggggacctcaggcgacggcagctgggtgagattcagctcacagtgcgctatgtg tgtctgcggcgctgcctcagcgtgctaatcaatggctgcagaaacctaacaccatgtacc agcagtggagctgatccctacgtccgtgtctacttgttgccagaaaggaagtgggcatgt cgtaagaagacttcagtgaagcggaagaccttggaacccctgtttgatgagactaccccc acagtttacccttggctcatggccttcatgttttatcagcaggcaacaccttttggcaag tttcttagccacaagtgggttgcatggtcctatggctgccagcggaaatctaaatctcag cctgggctggccaaggccggagccggctccctcagcttgcagagaggtgtggagggagag gcgcgagcaggaactgggccggtgcttgcgggccagctggagttctgggtgggtgtgggc ttggcgggccccacactcggagcagccggccggccctgccagccccagggcagggctcgg gacctgcagcccaccatgcctgagcctcccaccccctccatgggctcctgtgtggcccga gcctccccgacgagcaccgccccctgctccacagcacccagtcccatcgaccacccaagg gctgaggagtgcaggcacacggcgtgggactggcaggcagctccacctgcagccctggtg tgggacccactgggtgaagccagctgtgctcctgagtctggtggggccttagagaagctt tatgtctag >gi568815595f:138272884_138502266|GENSCAN_predicted_peptide_6|108_aa XSSSSLCVLVSTVGKLCRLINEDVNEQVMQVLGPEDLQRYQLSHNRAFGVPEAFGVPESK PSLLDSISGPCPGCPLPFRVLAEDSFDESDFSEIDDSYDSNDSDVSFV >gi568815595f:138272884_138502266|GENSCAN_predicted_CDS_6|327_bp natagttcatcctcattgtgtgtgctagtaagcactgttggaaaactctgtaggctgatt aatgaagatgtgaatgagcaggttatgcaggtattaggacctgaagacctccagaggtac cagctcagccacaatagagcttttggggtccctgaggcttttggggtccctgaatccaag cctagtctcttggacagcatttctggaccctgccctgggtgcccactgcccttcagggtg ttggctgaagattcatttgacgaatctgatttttctgaaatagatgattcttatgattcc aatgattctgacgttagttttgtttag