GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:02:35 Sequence gi568815586f:69259973_69490313 : 230341 bp : 39.50% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 72 225 154 1 1 148 93 113 0.992 16.01 1.02 Term + 2398 2587 190 1 1 8 43 377 0.903 21.14 1.03 PlyA + 2987 2992 6 1.05 2.00 Prom + 10775 10814 40 -6.45 2.01 Init + 27778 27819 42 0 0 70 75 39 0.204 1.28 2.02 Intr + 33275 33671 397 1 1 42 53 190 0.030 4.13 2.03 Intr + 47574 49097 1524 1 0 15 47 364 0.148 13.60 2.04 Term + 52733 52890 158 0 2 37 41 144 0.165 1.71 2.05 PlyA + 53321 53326 6 1.05 3.02 PlyA - 53485 53480 6 1.05 3.01 Sngl - 56577 56266 312 0 0 99 37 230 0.614 14.68 3.00 Prom - 58557 58518 40 -0.95 4.03 PlyA - 58674 58669 6 1.05 4.02 Term - 64033 63891 143 2 2 36 49 132 0.398 1.21 4.01 Init - 71327 71105 223 2 1 63 60 250 0.122 18.46 4.00 Prom - 71812 71773 40 -4.45 5.00 Prom + 80040 80079 40 -4.45 5.01 Init + 88437 88572 136 2 1 100 93 92 0.902 11.57 5.02 Intr + 90136 90300 165 2 0 27 83 93 0.725 1.71 5.03 Intr + 92248 92326 79 2 1 77 80 105 0.912 6.29 5.04 Intr + 95939 96008 70 2 1 82 100 14 0.526 0.27 5.05 Intr + 100382 100474 93 1 0 63 23 105 0.497 0.74 5.06 Intr + 102816 102935 120 2 0 74 80 103 0.887 7.87 5.07 Intr + 105661 105727 67 0 1 67 72 39 0.693 -2.24 5.08 Intr + 105818 105912 95 0 2 83 76 72 0.688 4.26 5.09 Intr + 110734 110826 93 0 0 92 95 61 0.809 6.44 5.10 Term + 111686 111802 117 1 0 49 45 104 0.390 -0.24 5.11 PlyA + 116396 116401 6 1.05 6.02 PlyA - 117011 117006 6 1.05 6.01 Sngl - 122446 121733 714 1 0 69 49 278 0.498 17.98 6.00 Prom - 126886 126847 40 -5.45 7.00 Prom + 143302 143341 40 -4.45 7.01 Init + 151786 151900 115 0 1 72 64 83 0.398 4.72 7.02 Intr + 155561 155748 188 0 2 40 94 85 0.588 2.79 7.03 Intr + 162779 163024 246 1 0 33 32 157 0.439 1.33 7.04 Term + 164029 164085 57 0 0 114 54 88 0.932 4.71 7.05 PlyA + 164135 164140 6 1.05 8.00 Prom + 164696 164735 40 -5.05 8.01 Init + 169161 169424 264 2 0 54 75 156 0.209 7.96 8.02 Term + 178595 178645 51 1 0 90 49 69 0.088 -0.35 8.03 PlyA + 179626 179631 6 1.05 9.08 PlyA - 180381 180376 6 1.05 9.07 Term - 187177 187103 75 1 0 64 48 76 0.297 -1.94 9.06 Intr - 199231 199154 78 1 0 45 94 69 0.038 2.03 9.05 Intr - 210732 210548 185 1 2 45 53 246 0.637 15.39 9.04 Intr - 211132 210925 208 1 1 26 -3 190 0.408 1.43 9.03 Intr - 211615 211442 174 1 0 107 92 7 0.414 2.21 9.02 Intr - 216515 216347 169 1 1 93 30 102 0.648 3.93 9.01 Init - 221135 221086 50 2 2 69 95 14 0.503 0.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:69259973_69490313|GENSCAN_predicted_peptide_1|114_aa XDYGSAIETLVTAISLIKQSKVSADDRCKVLISSLQDCLHGIESKSYGSGSRRRERSRER DHSRSREKSRRHKSRSRDRHDDYYRERSRERERHRDRDRDRDRERDREREYRHR >gi568815586f:69259973_69490313|GENSCAN_predicted_CDS_1|345_bp ngtgattatgggagtgctattgagacactggtaactgcaatttctttaattaaacaatcc aaagtatctgctgatgatcgttgcaaagttcttattagttctttgcaagattgccttcat ggaattgagtccaagtcttatggttctggatcaagaagacgtgaacgatcaagagagagg gaccatagtagatcacgagaaaagagtcgacgtcataaatcccgtagtagagaccgtcat gacgattattacagagagagaagcagagaacgagagaggcaccgggatcgtgaccgagac cgtgaccgagagcgtgaccgagagcgcgaatatcgtcatcgttag >gi568815586f:69259973_69490313|GENSCAN_predicted_peptide_2|706_aa MTSIGACFLEKLAKNNHTEDRQGNRRHEHSIDQLAVTDVHSTTAEYTFLSTRGGFSKIDH MLGCKTNLNKYKEIEIIQSIFSNHNEIKQEINSRRKTRKSTNLWELSSALLMNGSEKKTQ VNLDNIWKLMKAKMQHITLMECSESRKNKIPRNPTYKGREGPLQGELQTTAQGNKRGYKQ MEEHPMLMGGRINTVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAK SILSQKNKAGGLTLPDFKLFYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYVIFD KPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLTVRPKTIKTLE ENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTGKETTIRVNRQPTKWKK IFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWVKDMNRHFSKEDIYAAKKHMKKCSPS LAVREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKLVQPLWKSVW QFLRDLELEIPFDPAIPLLGIYPRDYKSCCYKDTCTRMFIAALFTIAKTWNQPKCPTMID WIKKMWHIYTMEYYAAIKNDEFMSFVGTWMKLEIIFLSKLLQEQKTKHRIFSLIDSKPAP QDSGAIIGLQEPEPMTLAQDLLHLKSSATEAACRAISPDTKRDPLG >gi568815586f:69259973_69490313|GENSCAN_predicted_CDS_2|2121_bp atgacctcaattggagcctgcttcctggagaaattggccaagaacaaccatacagaagat cgacaaggaaacagaagacatgaacacagtatagatcagttggctgtaactgatgtccac tcaacaacagcagaatacacattcttaagtactcgtggaggcttctccaagatagaccac atgttaggctgtaaaacaaatcttaacaaatataaggagattgaaatcatacaaagtatc ttctccaatcacaatgaaataaaacaagaaatcaatagcagaaggaaaacaagaaagtcc acgaatttgtgggaactaagcagtgccctcttaatgaatgggtcagagaagaaaacacaa gtgaatttagataatatctggaaactaatgaaagcaaaaatgcaacatatcacacttatg gaatgcagtgaaagtagaaagaataaaatacctaggaatccaacttacaagggacgtgaa ggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggatacaaacaa atggaagaacatcccatgctcatgggtggaagaatcaataccgtgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaag tcaatcctaagccaaaagaacaaagctggaggcctcacactacctgacttcaaactattc tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataatgccgcatatctacaactatgtgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagacttaaccgttagacctaaaaccataaaaaccctagaa gaaaacctgggcattaccattcaggacataggcatgggcaaggacttcatgtctaaaaca ccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagc ttctgcacaggaaaagaaactaccatcagagtgaacaggcaacctacaaagtggaagaaa atttttgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaa caaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggtgaaggacatgaacaga cacttctcgaaagaagacatttatgcagccaaaaaacacatgaaaaaatgctcaccatca ctggccgtcagagaaatgcaaatcaaaaccacaatgagataccatctcacaccagttaga atggcaatcattaaaaagtcaggaaacaacaggtgctggagaggatgtggagaaatagga acacttttacactgttggtgggactgtaaactagttcaaccattgtggaagtcagtgtgg caattcctcagggatctagaactagaaataccatttgacccagccatcccattactgggt atatacccaagagactataaatcatgctgctataaagacacatgcacacgtatgtttatt gcagcattattcacaatagcaaagacttggaaccaacccaaatgtccaacaatgatagac tggattaagaaaatgtggcacatatacaccatggaatactatgcagccataaaaaatgat gagttcatgtcctttgtagggacgtggatgaaattggaaatcatctttctcagtaaacta ttgcaagaacaaaaaaccaaacaccgcatattctcactcatagacagcaaacctgcacca caagattcaggtgccataataggtttgcaagaacctgagcctatgaccctagcccaagac ctactgcatctgaaatccagtgccactgaagctgcttgcagggccatcagtcctgacacc aagagggatccccttggctaa >gi568815586f:69259973_69490313|GENSCAN_predicted_peptide_3|103_aa MAKKGQHKTQAIASEGASPKAWWLTCGVGPVGAEKARIEIWEPPPRFQRMYRNAWVSRQK CAAGVEPSWRTSVRAVQKGNVGLEPPHSVPTGALPSGAVRRGP >gi568815586f:69259973_69490313|GENSCAN_predicted_CDS_3|312_bp atggctaaaaagggccaacataaaactcaagccattgcttcagagggtgcaagccccaag gcttggtggctgacatgtggtgttgggcctgtgggtgcagagaaggcaagaattgagatt tgggaacctccacctagatttcagaggatgtatagaaatgcctgggtgtccaggcagaag tgtgctgcaggggtggaaccttcatggagaacctctgtgagggcagtgcagaagggaaat gtggggctggagcccccacacagcgtccccactggggcactccctagtggagctgtgaga agagggccatag >gi568815586f:69259973_69490313|GENSCAN_predicted_peptide_4|121_aa MQNRQGNQVYVYKKIRKKALGAESRAAEQAVNKKKGSAARKFTETEPAQAKGSAFTRVKQ LPPRDPPAQLSSSAVLEAGSLRSELASLGSCKGSLPGFADCHLLPVSSYGEERVLNLFFF L >gi568815586f:69259973_69490313|GENSCAN_predicted_CDS_4|366_bp atgcagaatcggcaaggcaatcaggtgtatgtttataaaaagataagaaaaaaggcatta ggagccgaaagccgtgcagccgagcaagcagtaaataaaaagaaaggctcggcagcccgg aagttcacagaaacagagccagcacaggcaaaaggcagcgcttttacaagagtaaaacag ctaccgcccagggacccgccggctcagctctctagctctgcagttttagaggctggaagt ctgagatcggaactagcatcattaggttcttgtaagggctctcttcctggctttgcagac tgccaccttctccctgtatcttcctatggtgaagaaagagttctgaatctcttcttcttc ttataa >gi568815586f:69259973_69490313|GENSCAN_predicted_peptide_5|344_aa MKALIVLGLVLLSVTVQGKVFERCELARTLKRLGMDGYRGISLANWMCLAKWESGYNTRA TNYNAGDRSTDYGIFQINSRYWCNDGKTPGAVNACHLSCSALLQDNIADAVACAKRVVRD PQGIRAWWHSVFAVLICPIIDDVHLDHLIKVIGEAVTPPSWCEDNNNCSKQSLSAYPQYF QGVTIVKPIVYGNVARYFGKKREEDGHTHQWTVYVKPYRNEDMSAYVKKIQFKLHESYGN PLRVVTKPPYEITETGWGEFEIIIKIFFIDPNERPVTLYHLLKLFQSDTNAMLGKKTVVS EFYDEMVLAAEVRAVPESDKKPYRHGSYILVGGKQVLNKQNKEV >gi568815586f:69259973_69490313|GENSCAN_predicted_CDS_5|1035_bp atgaaggctctcattgttctggggcttgtcctcctttctgttacggtccagggcaaggtc tttgaaaggtgtgagttggccagaactctgaaaagattgggaatggatggctacagggga atcagcctagcaaactggatgtgtttggccaaatgggagagtggttacaacacacgagct acaaactacaatgctggagacagaagcactgattatgggatatttcagatcaatagccgc tactggtgtaatgatggcaaaaccccaggagcagttaatgcctgtcatttatcctgcagt gctttgctgcaagataacatcgctgatgctgtagcttgtgcaaagagggttgtccgtgat ccacaaggcattagagcatggtggcattcagtttttgcagttttgatttgccccattatt gatgatgttcacttggatcacttgattaaggttattggtgaagcggtgacgccaccctcc tggtgtgaggataacaataactgtagcaaacagtcgctgagcgcttacccgcagtatttc caaggtgttactatcgttaaaccaatagtttacggtaatgttgctcggtattttggaaag aaaagagaagaagatgggcacactcatcagtggacagtatatgtgaaaccatatagaaat gaggatatgtcagcatatgtgaagaaaatccagtttaaattacatgaaagctatggcaat cctttaagagttgttactaaacctccatatgaaattactgaaacaggatggggtgaattc gaaataatcatcaaaatatttttcattgaccctaatgaaagacctgtaaccctgtatcat ttgctaaagctgtttcaatcagacaccaatgcaatgctggggaaaaagacagtggtttca gagttctatgatgaaatggtgctagctgctgaggttagagcagtgcctgaatcagacaaa aagccctatcgtcatgggtcctacattctagtgggaggaaaacaggtattaaataaacag aacaaggaagtataa >gi568815586f:69259973_69490313|GENSCAN_predicted_peptide_6|237_aa MLSCSLGASSATVKQSTRIIPKVSNSRPWLLDRISGPTRSPGKLSALKGWTQANQINDLR QSVIWLGDWVVSLEHRMQMQCNWNTLDFCIIPYSYNETDYSWEMVKGRLLGREDNLSLDI TKLKKQIFEASQAHLSIVPGAEALDQVAENLYGLNPTTWIKSIGGSTVVNCGITFLCLIS LFLVCRTSQRILGKNQENEQAFITMAHLYKKKGRDVAGSQEPRTEGTAEAMAEEHKL >gi568815586f:69259973_69490313|GENSCAN_predicted_CDS_6|714_bp atgctgtcttgtagcttgggtgccagctcagccacagtaaaacagagcaccaggattatt cctaaggtttccaactccaggccctggctcctggacagaatctctggacctacccgcagc ccagggaaactcagtgcccttaagggatggacacaagctaatcaaattaatgatttaaga cagtctgttatttggcttggagattgggtggtgagtctcgaacatcgcatgcaaatgcag tgcaactggaatactttggatttctgtatcatcccctattcctataatgagactgattat tcatgggaaatggtcaaaggacgccttctgggtagggaagataatttatcattggacata actaaattaaagaaacaaatttttgaagcctctcaagctcacttatccattgtgcctgga gctgaggcgttagatcaggtggcagaaaatctttatggattaaaccccacgacttggatt aagtctattgggggctccactgtagtaaattgtggaattacatttctctgtttaatcagc ttgtttttagtgtgccggaccagtcaaagaatcctgggtaaaaatcaagagaatgaacaa gccttcatcaccatggcacatttatataaaaagaaagggagagatgttgcgggaagtcag gaaccccgaacggagggaacggctgaagccatggcagaagaacataaattgtga >gi568815586f:69259973_69490313|GENSCAN_predicted_peptide_7|201_aa MVQAEETARANSLKRKHSWCVWGVPQGPMWPMPSEQGEAWATKGDHVLKTNKQVFGASNG PSDSNNFKASRDPMAKMGEEQPQCVGKGGYNYKSQDCVWGQDKAIKFVIKNKVEAIAIRA ISEVSVFNTCASQLYEKLHCDMSCAIHGKVVRSHSHEAQKDPIPLPDLDLMVLFHDLRES PYKILLYEDVMFGAAAIILES >gi568815586f:69259973_69490313|GENSCAN_predicted_CDS_7|606_bp atggtccaggcagaagaaactgccagggcaaactctctgaagcgaaagcattcctggtgt gtctggggagtaccccaggggccaatgtggccgatgccaagtgagcaaggggaagcctgg gcaacaaagggagatcatgtcttaaaaacaaacaaacaggtatttggagcaagcaatgga cccagtgacagtaacaatttcaaagcaagtagagatccaatggcaaagatgggagaggaa cagcctcaatgtgtaggaaaaggtggctataactacaaaagccaagactgtgtatgggga caggacaaggccattaagttcgtcattaaaaacaaagtagaggccatagccatcagggcc atttctgaagtgagtgtctttaacacctgtgcttcccagctgtatgagaaactgcattgt gatatgagttgtgccattcatggaaaagtagtcaggagtcattctcatgaagcccagaag gatccaatacccctccctgatttagacctaatggtgctgttccatgacctccgcgaaagc ccgtataagatattactctatgaagatgtaatgtttggagctgcagcgatcattttggaa tcatga >gi568815586f:69259973_69490313|GENSCAN_predicted_peptide_8|104_aa MTKQVKKQKYVTHNQKNNHSKETDPEMTDTELAEKNIKTPIIIIVDYLNENMNKVRKEIE GIKKREREKFQENDTILKKERSEEDKIKMRKQSPEMVKLERCER >gi568815586f:69259973_69490313|GENSCAN_predicted_CDS_8|315_bp atgactaaacaagtcaagaagcagaaatatgtgacccataaccagaaaaataatcactca aaggaaacagacccagaaatgacagatacagaattagcagagaagaatattaaaacacct attataattattgttgactatttaaatgaaaacatgaacaaagtgaggaaagaaattgaa ggcattaaaaaaagagagagagagaagtttcaggagaatgacaccatactgaaaaaagaa aggagtgaagaagacaaaattaagatgagaaaacagagtccagagatggtgaagctagag cgttgtgaacgctga >gi568815586f:69259973_69490313|GENSCAN_predicted_peptide_9|312_aa MDKGYTEVYCTILATFLRSEMESQFLLLHYVKGELFILLANQQSNNLKQHPQAAPTVQQF SPEALVEKHSLKQEILSLSEVNTSWNKFKQNISAHLCETLHCSNNFNSVRNLSRKSKNNF SLRKMTVPNMKDRFRAQSKSIPDNPEANLDEKPCKRSLACTGSAPPWREKTTPLGPGLIT DPRTDPEEAKSRELEPKIESASSSRPAGPEKSAAVDNPLPRTPAAEPSARIPKRAAGARE RQLRRGPAAVPVRFPSHSPSAQLALRHMLKKKKEEEKKRRGKKEKKESKKEMLLSGEIEP EQLWTLQLLGIQ >gi568815586f:69259973_69490313|GENSCAN_predicted_CDS_9|939_bp atggataaagggtatacagaggtctactgcactattcttgcaactttcctaagaagtgaa atggaaagccaatttctgctgcttcactatgtcaaaggcgagttatttatcttgttggct aatcaacagagtaataacttaaagcagcacccccaggcagcgcccacagttcaacagttt tctccagaggctctggttgagaaacacagtcttaaacaagaaatcctcagtttatccgaa gtaaacacttcatggaataagtttaaacaaaacatctctgcccacctgtgtgaaactctg cattgcagcaacaattttaacagtgtaaggaatctctcaagaaaatccaagaacaatttt tccttaagaaaaatgacagttccaaatatgaaggaccggttccgagcccagtccaaatcg atcccggacaaccctgaagcaaatctggatgaaaagccctgcaaaagatccctggcatgt acaggcagcgccccaccctggagggaaaagacgacacccctgggacccgggttaattact gaccccaggacagacccagaggaggcgaaaagtcgggaactggagcccaagattgagtct gccagctcttcccgtcccgctggcccggagaagtccgcggcggtcgacaatccccttccc cggacgccggcagccgagccctcagcccggatcccgaagcgggcagcgggggcacgagaa cgccagctccgacgcgggcccgcggccgtcccagtccgtttccctagccactcaccgtcc gcccagctggccttgaggcacatgctaaagaaaaagaaggaggaggagaaaaagaggaga ggaaaaaaagagaaaaaagaaagcaaaaaagaaatgcttctatctggagagattgaaccc gaacaactctggaccctgcaactattgggcattcagtga