GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:39:30 Sequence gi568815594f:39598328_39878431 : 280104 bp : 42.31% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 3309 3348 40 -3.45 1.01 Init + 13681 13722 42 0 0 81 121 24 0.635 5.49 1.02 Intr + 20479 20550 72 0 0 53 87 52 0.143 0.28 1.03 Intr + 31416 31510 95 2 2 74 96 11 0.183 -1.66 1.04 Intr + 36802 36971 170 1 2 47 116 95 0.937 6.87 1.05 Intr + 39320 39579 260 0 2 35 30 216 0.899 6.76 1.06 Intr + 40187 40370 184 1 1 65 84 278 0.901 23.54 1.07 Term + 50609 50976 368 0 2 21 44 267 0.054 9.38 1.08 PlyA + 51612 51617 6 1.05 2.04 PlyA - 52015 52010 6 1.05 2.03 Term - 74891 74322 570 2 0 36 37 263 0.774 9.45 2.02 Intr - 81511 81407 105 1 0 93 75 49 0.630 3.59 2.01 Init - 88076 88074 3 2 0 98 81 0 0.487 0.35 2.00 Prom - 91634 91595 40 -8.15 3.00 Prom + 91901 91940 40 -5.85 3.01 Init + 99352 99548 197 0 2 11 49 270 0.020 13.85 3.02 Intr + 120152 120340 189 2 0 49 85 145 0.023 8.08 3.03 Term + 131451 131532 82 0 1 81 42 83 0.229 -0.81 3.04 PlyA + 133334 133339 6 1.05 4.03 PlyA - 133663 133658 6 1.05 4.02 Term - 172641 171754 888 0 0 81 42 918 0.981 77.97 4.01 Init - 173044 172682 363 1 0 82 44 481 0.851 40.20 4.00 Prom - 174328 174289 40 -10.05 5.00 Prom + 175143 175182 40 -9.65 5.01 Init + 176517 176606 90 2 0 72 110 105 0.967 11.64 5.02 Intr + 179355 179483 129 2 0 51 110 82 0.980 6.67 5.03 Term + 180033 180107 75 2 0 129 54 41 0.966 1.66 5.04 PlyA + 180271 180276 6 1.05 6.14 PlyA - 180637 180632 6 1.05 6.13 Term - 220651 220267 385 0 1 46 50 159 0.142 1.28 6.12 Intr - 239881 239529 353 1 2 65 90 314 0.513 22.50 6.11 Intr - 243729 243621 109 2 1 57 94 102 0.943 7.07 6.10 Intr - 246474 246284 191 0 2 77 -57 176 0.898 -0.54 6.09 Intr - 250643 250524 120 2 0 109 86 -3 0.714 1.37 6.08 Intr - 251325 251193 133 2 1 33 63 86 0.020 0.33 6.07 Intr - 264006 263892 115 1 1 45 89 68 0.375 1.19 6.06 Intr - 264746 264542 205 2 1 73 119 138 0.997 13.25 6.05 Intr - 265132 265009 124 0 1 73 101 68 0.986 6.27 6.04 Intr - 268670 268534 137 2 2 62 94 72 0.966 3.65 6.03 Intr - 271135 271067 69 1 0 42 94 89 0.887 3.36 6.02 Intr - 274817 274659 159 2 0 61 110 49 0.780 3.66 6.01 Intr - 278826 278740 87 0 0 96 64 48 0.472 2.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:39598328_39878431|GENSCAN_predicted_peptide_1|396_aa MEMRPELERLTFQKKGLHTFFNLHVGPCRGVKPPVEKKAPEQLSPQVHAITLSQFFYFFV ERGGLDVLPSKIWEKSTEAPAFSKCKGPTARTSLASSRDKNASGIRIQEETHMARLQGTE LSRRIQVLSTAAESGKFGAGYTKGICAASHPKWKGMQSLLQPPQRWAARMPPKRPNTPTP GRSYAFQVIICTAAVSDNNTHKLEGLFCKCSYYRIPNCRLFAGDAGSAAAELLPVQNRQP GRGNAKPEKQPCSAAAAAVPESLRSPQEKLGRGEGCWFGWLLGRNHGGGAPGVWSPPGGT AVAAVSVGLQSPQVPTGSLALGEEGACFFCGHPGHIKRRCYGCQPWLQLNLGEACPPGPL EWLWSCPGEILRTFLVLGAGPLGELEVAVADETVQF >gi568815594f:39598328_39878431|GENSCAN_predicted_CDS_1|1191_bp atggagatgcggcctgagctggagaggctcactttccagaagaaaggactgcacactttt ttcaacctgcatgttggaccttgcaggggtgtcaagccccctgttgagaagaaagctccg gagcagctgagcccacaggtgcatgccatcacactcagccaatttttttacttttttgta gagagagggggtctcgatgtgttgcccagcaagatctgggagaagagtactgaagcccca gcgttcagcaaatgcaaaggccctacagcaagaaccagcttggcaagttccagggacaaa aatgcaagtgggattagaatacaggaagaaactcacatggccaggctccagggtactgaa ctaagtagaaggatacaagtgctgtcgacggccgcggagagcggcaagttcggagcaggc tacactaaaggaatctgcgcggccagccaccctaagtggaagggaatgcaaagcctcctg caacctccacagcgctgggcagcaagaatgcctcctaagcgtcccaacactccaacacct ggaaggtcttacgctttccaagttataatatgtacagctgctgtttctgacaataacaca cacaaactggaaggtctcttctgcaaatgtagctattacagaatacccaattgcaggctc ttcgcgggggacgcgggctcggcagcagcggagcttctcccggtgcagaatcgccagccc ggccgaggaaacgccaagcccgagaaacagccctgttctgccgcagccgccgcagtcccc gagagcctgaggagcccccaggaaaaactggggcgaggggagggttgctggtttggctgg ctgcttggcaggaaccatggagggggagcaccaggagtgtggagcccgcctggtggcact gctgttgcagcagtttcagtgggcctccagtctcctcaggtgccaactggttctttggct ctgggtgaagagggtgcctgcttcttttgtgggcaccctgggcatatcaagaggcgctgc tatggatgtcagccatggctgcagttgaatcttggtgaggcttgtccacctggaccttta gagtggctgtggagctgccctggggaaattctgcggacatttcttgtactcggagctggt cctttgggagagctggaggttgctgttgctgatgagactgtgcagttttag >gi568815594f:39598328_39878431|GENSCAN_predicted_peptide_2|225_aa MYQQRRTGTFTWFSSVDTVSGRRDILPPTLTHPVSEACLTRAPEGSTKYGKEKPVPATAK TSSEIEAVISILPTKKSPGPDGFTAKFYQRYKEKLVLFLVKLFQTIDKQGLFPNSSYEAS IILIPKPDKDTTKKENFRPISLKNIDVKILNKILANQIQQHIKKLIHHDQVGFIPGIQVW FTIHKSIKVIHHINRSNDNNHRIISIDAEKAFDKNSTTLHAKNPQ >gi568815594f:39598328_39878431|GENSCAN_predicted_CDS_2|678_bp atgtatcaacagaggagaaccgggactttcacctggttctcctctgttgatactgtgagc ggaagaagggacattcttccacccaccttaacccaccctgtgtcagaggcctgccttaca agagctcctgaaggaagcactaaatatggaaaggaaaaaccggtaccagccactgcaaaa acaagttctgaaattgaggcagtaataagtatcctaccaaccaaaaaaagtccaggacca gatggattcacagccaaattctaccagaggtacaaagagaagctggtactgttccttgtg aaactattccaaacaatagataaacagggactcttccctaactcatcttatgaggccagc atcatcctgataccaaaacctgacaaagacacaacaaaaaaagaaaatttcaggccaata tctctgaagaacattgatgtgaaaatcctcaataaaatactggcaaaccaaatccagcag cacatcaaaaagcttatccaccatgatcaagtcggtttcatccctgggatacaagtctgg ttcaccatacacaaatcaataaaagtaatccatcacataaacagaagcaatgacaacaac cacaggattatctcaatagatgcagaaaaggcctttgataaaaattcgacaacccttcat gctaaaaaccctcaataa >gi568815594f:39598328_39878431|GENSCAN_predicted_peptide_3|155_aa MLQNKNDCSRFSGATPTENGPDRDPGERFPALDTAFSLRVPGLAESGRRSSRVRRRKPVR LRPNLRWSCPPVLRHAPALLSPWAVDGTGCRGAGDNAQATQEPTAGGQDSGMAGCRSRAL PRGEAAKACHIGGISIPKPVKSSFASILHSVYALP >gi568815594f:39598328_39878431|GENSCAN_predicted_CDS_3|468_bp atgttgcagaacaaaaacgactgcagtagattttctggggctacaccaactgaaaatggg ccggaccgagatccgggagagcgttttcctgcgctagacacggcgttcagcctccgggtt ccgggtctagctgagtcagggcggcgttccagccgagtgcggcgtcggaaacccgttcgg ttgcgcccgaaccttcggtggagctgcccgccagtcctgcgccatgcgcctgcactcctc agcccttgggcagtcgatggaactgggtgccgtggagcaggggacaatgctcaggccacg caggagcccacggcaggggggcaggactcaggcatggcgggctgcaggtcccgagccctg ccccgtggggaggcagctaaggcctgtcacattggcggcatttcaattcctaaacctgtc aagtcctcgtttgcctccattttgcacagcgtgtacgctctgccttaa >gi568815594f:39598328_39878431|GENSCAN_predicted_peptide_4|416_aa MNKLRAEEWFCDVTIVADSLKFRGHKVILAACSPFLRDQFLLNPSSELQVSLMHSARIVA DLLLSCYTGTLEFAVRDIVNYLTATSYLQMEHVVEKCQNALSQFTEPKIGLKEDGVREAS LPGPQSQPRSPHPPPPLSPPLLRPVKLEFPLDEDLELKAEEEDEDEDVSDICIVKVESAL DIAHRLKPPGGLGGGLGIGGSVGGHLGELAQSSVPPSTVAPPQGVVKACYSLSENAEGES LLLTPGGRASVGATSGLVEAAAAAMVARGAGGSQGPLPGSFSGGNPLKNIKCTKCPEVFQ GVEKLVFHMRQQHFIFMCPRCGKEFNHSNNLNHHRNVHRGVKSHSCGICGKCFTQKSTLH DHLNLHSGAQPYRCSYCDMRFAHKPAIRRHLKEQHGKTTAENVLETSVAEINVLIR >gi568815594f:39598328_39878431|GENSCAN_predicted_CDS_4|1251_bp atgaacaagctccgggcagaggagtggttctgcgacgtgaccattgtggccgacagcctc aagtttcgaggccacaaggtcatcttagccgcctgctcaccgttcctgcgggaccagttc ctgcttaaccccagctcggagctgcaggtctccctgatgcacagtgcacgcatcgtggcc gacctgcttctctcctgctacacgggcaccctggaattcgctgttagggacatcgtcaac tatcttacagccacctcctacctgcagatggagcacgtggtggagaaatgccagaatgcc ctcagccagttcactgagcccaaaataggcctcaaagaggatggggtccgtgaggctagc cttccaggaccccaaagccagccccgaagcccccaccccccacctcctctatcccctcca ctcctgcggccagtgaagctggagttcccactggatgaggacttggagctgaaagccgag gaagaggatgaggatgaggacgtatctgacatctgcatcgtcaaggtggagtcggccctg gacatcgcacaccggctcaagccccctggaggcctgggagggggcctgggcattggaggc tccgtgggtggccaccttggggagctggcccagagcagcgtgccccccagcactgtggcc ccaccgcagggtgtggtgaaggcctgctatagcctgtcggagaacgcagaaggggagagc ctgctgttgactccgggaggccgggccagcgtgggggccacctcgggcctggtggaagca gcagcggcggccatggttgcccggggggcggggggcagccagggacccctgcctgggagc ttctcaggtggaaaccccttaaagaacatcaagtgcaccaagtgcccggaagtgttccag ggcgtggagaagctggtcttccacatgcggcagcagcacttcatcttcatgtgccctcgc tgtggcaaggagttcaaccacagcaacaacctcaaccaccacaggaacgtgcatcgtggt gtcaagtcacactcgtgcggcatctgcggcaagtgcttcacacagaagtccaccctgcac gaccacctcaacctgcactcgggagcgcagccctaccgctgctcctactgcgacatgcgc ttcgcccacaagcctgccattaggcggcatctcaaggagcaacacggcaagaccaccgcc gagaacgtgctggagaccagtgtggccgagattaatgtcctcatccgctag >gi568815594f:39598328_39878431|GENSCAN_predicted_peptide_5|97_aa MTLRTVLLSLQALLAAAEPDDPQDAVVANQYKQNPEMFKQTARLWAHVYAGAPVSSPEYT KKIENLCAMGFDRNAVIVALSSKSWDVETATELLLSN >gi568815594f:39598328_39878431|GENSCAN_predicted_CDS_5|294_bp atgactctccgcacggtattattgtcattgcaagcactattggcagctgcagagccagat gatccacaggatgctgtagtagcaaatcagtacaaacaaaatcccgaaatgttcaaacag acagctcgactttgggcacatgtgtatgctggagcaccagtttctagtccagaatacacc aaaaaaatagaaaacctatgtgctatgggctttgataggaatgcagtaatagtggccttg tcttcaaaatcatgggatgtagagactgcaacagaattgcttctgagtaactga >gi568815594f:39598328_39878431|GENSCAN_predicted_peptide_6|728_aa VLSFTHPTSFHSAETYESLLQCLRMEDDKPLSRSLNADVPEQLITPLVSLGHISMLAPDQ FASPMKSVVANFIVKDLLMNDRSTGEKNGKLWSPDEEVSPEVLAKVQAIKLLVRWLLGMK NNQSKSANSTLRLLSAMLVSEGDLTEQKRISKSDMSRLRLAAGSAIMKLAQEPCYHEIIT PEQFQLCALVINDECYQVRQIFAQKLHKALVKLLLPLEYMAIFALCAKDPVKERRAHARQ CLLKNISIRREYIKQNPMATEKLLSLLPEYVVPYMIHLLAHDPDFTRSQDVDQLRDIKEC LWFMLEVLMTKNENNSHAFMKKMAENIKLTRDAQSPDESKTNEKLYTVCDVALCVINSKS ALCNADSPKDPVLPMKFFTQPEKPKPAGVLGAVNKPLSATGRKPYVRSTGTETGSNINVN SELNPSTGNRSRQLFSPIILLLALTTWEQSSEAAETGVSENEENPVRIISVTPVKNIDPV KNKEINSDQATQGNISSDRGKKRTVTAAGAENIQQKTDEKVDESGPPAPSKPRRGRRPKS ESQGNATKNDDLNKPINKGRKRAAVGQESPGGLEAGNAKAPKLQDLAKKAAPAERQIDLQ RSLVDFSKARQPAKHPFFRAFLKSSTLVAPQPLPQENPGSLAFWMHLACPLQEEGQPQGL KGSCPFATHLRPKAEKQNTGYPKTALLLDVGRTQGLRALHFLTIWQLGCPWKLIITSTGS VFLEAMAH >gi568815594f:39598328_39878431|GENSCAN_predicted_CDS_6|2187_bp gttctgtcttttacacatcctacctcgttccactctgcagagacatatgagtccttgtta cagtgcctaagaatggaggatgacaagccactcagtaggagtctgaatgctgatgtgcca gaacaacttataactccattagtttcattgggccacatttctatgttagcaccagatcag tttgcttccccaatgaaatctgtagtagcaaattttattgtgaaagatctgctaatgaat gacaggtcaacaggtgaaaagaatggaaaactgtggtctccagatgaagaggtttcccct gaagtactagcaaaggtacaggcaattaaacttctggtaaggtggctgttgggtatgaaa aacaaccagtctaaatctgccaattcaacccttcggttattatcagcgatgttggttagt gagggtgacctgacagagcaaaagaggatcagtaaatctgatatgtctcgcttgcgatta gctgctggtagtgccataatgaagcttgctcaggaaccttgttaccatgaaattattacc ccagaacagtttcagctctgtgcacttgttattaatgatgagtgttaccaagtaaggcag atatttgctcagaagctgcataaggcacttgtgaagttactgctcccattggagtatatg gcgatctttgccttgtgtgccaaagatcctgtgaaggagagaagagcacacgcacgacaa tgtttactgaaaaatatcagtatacgcagggaatacattaagcagaatcctatggctact gagaaattattatcactgttgcctgaatatgtagttccatacatgattcacctgctagcc catgatccagattttacaagatcacaagatgttgatcagcttcgtgatatcaaagagtgc ctatggttcatgcttgaagttttaatgacaaagaatgaaaacaatagccatgcctttatg aagaagatggcagagaacatcaagttaaccagagatgcccagtctccagatgaatccaag acaaatgaaaaactgtatacagtatgtgatgtggctctctgtgttataaatagtaaaagt gctttgtgcaatgcagattcaccaaaggacccagtcctcccaatgaaattttttacacaa cctgaaaagccaaagcctgctggagtactaggtgcagtaaataagcctttatcagcaacg ggaaggaaaccctatgttagaagcactggcactgagactggaagcaatattaatgtaaat tcagagctgaacccttcaaccggaaatcgatcaaggcaacttttctctccaattattttg ttactagcactcactacttgggaacagagttcagaggcagcagaaactggagttagtgaa aatgaagagaaccctgtgaggattatttcagtcacacctgtaaagaatattgacccagta aagaataaggaaattaattctgatcaggctacccagggcaacatcagcagtgaccgagga aagaaaagaacagtaacagcagctggtgcagagaatatccaacaaaaaacagatgagaaa gtagatgaatcgggacctcccgccccttccaaacccaggagaggacgtcgacccaagtct gaatctcagggcaatgctaccaaaaatgatgatctaaataaacctattaacaagggaagg aagagagctgcagtgggtcaggagagccctgggggtttggaagcaggtaatgccaaagca cccaaactgcaagatttagccaaaaaggcagcaccagcagaaagacaaattgacttacaa aggagcttagtggacttctccaaagcaaggcagcccgcgaagcacccatttttcagagcc tttttgaagagctccaccctggtggccccccaacccctaccccaagaaaaccctggcagt ttagccttctggatgcatctggcatgtcccctccaagaggaaggtcaaccgcagggactg aaaggaagctgtccctttgcaacccacctcaggccaaaagctgaaaagcaaaatactggt taccccaaaactgccttacttttagatgtaggccggacccagggactaagggctctgcat tttttgaccatttggcagctcggctgcccatggaagcttataattacgagcacaggatct gtctttttagaagccatggctcattag