GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:52:49 Sequence gi568815586f:69248409_69453216 : 204808 bp : 39.40% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2721 2930 210 2 0 47 106 199 0.686 15.79 1.02 Intr + 4643 4746 104 1 2 9 98 83 0.993 -0.55 1.03 Intr + 8289 8434 146 0 2 60 80 121 0.998 7.51 1.04 Intr + 9324 9497 174 1 0 47 98 108 0.969 6.69 1.05 Intr + 10182 10686 505 1 1 111 115 438 0.969 39.91 1.06 Intr + 11020 11135 116 1 2 94 18 110 0.996 3.77 1.07 Intr + 11636 11789 154 0 1 148 93 113 0.992 16.01 1.08 Term + 13962 14151 190 0 1 8 43 377 0.903 21.14 1.09 PlyA + 14551 14556 6 1.05 2.00 Prom + 22339 22378 40 -6.45 2.01 Init + 39342 39383 42 2 0 70 75 39 0.204 1.28 2.02 Intr + 44839 45235 397 0 1 42 53 190 0.030 4.13 2.03 Intr + 59138 60661 1524 0 0 15 47 364 0.148 13.60 2.04 Term + 64297 64454 158 2 2 37 41 144 0.165 1.71 2.05 PlyA + 64885 64890 6 1.05 3.02 PlyA - 65049 65044 6 1.05 3.01 Sngl - 68141 67830 312 2 0 99 37 230 0.614 14.68 3.00 Prom - 70121 70082 40 -0.95 4.03 PlyA - 70238 70233 6 1.05 4.02 Term - 75597 75455 143 1 2 36 49 132 0.398 1.21 4.01 Init - 82891 82669 223 1 1 63 60 250 0.122 18.46 4.00 Prom - 83376 83337 40 -4.45 5.00 Prom + 91604 91643 40 -4.45 5.01 Init + 100001 100136 136 1 1 100 93 92 0.902 11.57 5.02 Intr + 101700 101864 165 1 0 27 83 93 0.725 1.71 5.03 Intr + 103812 103890 79 1 1 77 80 105 0.912 6.29 5.04 Intr + 107503 107572 70 1 1 82 100 14 0.526 0.27 5.05 Intr + 111946 112038 93 0 0 63 23 105 0.497 0.74 5.06 Intr + 114380 114499 120 1 0 74 80 103 0.887 7.87 5.07 Intr + 117225 117291 67 2 1 67 72 39 0.693 -2.24 5.08 Intr + 117382 117476 95 2 2 83 76 72 0.688 4.26 5.09 Intr + 122298 122390 93 2 0 92 95 61 0.809 6.44 5.10 Term + 123250 123366 117 0 0 49 45 104 0.390 -0.24 5.11 PlyA + 127960 127965 6 1.05 6.02 PlyA - 128575 128570 6 1.05 6.01 Sngl - 134010 133297 714 0 0 69 49 278 0.498 17.98 6.00 Prom - 138450 138411 40 -5.45 7.00 Prom + 154866 154905 40 -4.45 7.01 Init + 163350 163464 115 2 1 72 64 83 0.398 4.72 7.02 Intr + 167125 167312 188 2 2 40 94 85 0.588 2.79 7.03 Intr + 174343 174588 246 0 0 33 32 157 0.439 1.33 7.04 Term + 175593 175649 57 2 0 114 54 88 0.932 4.71 7.05 PlyA + 175699 175704 6 1.05 8.03 PlyA - 176034 176029 6 1.05 8.02 Term - 182711 182610 102 2 0 111 34 48 0.243 -1.00 8.01 Init - 189103 188960 144 1 0 66 56 143 0.496 9.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:69248409_69453216|GENSCAN_predicted_peptide_1|532_aa EAEYGGHDQIDLYDDVISPSANNGDAPEDRDYMDTLPPTVGDDVGKGAAPNVVYTYTGKR IALYIGNLTWWTTDEDLTEAVHSLGVNDILEIKFFENRANGQSKGFALVGVGSEASSKKL MDLLPKRELHGQNPVVTPCNKQFLSQFEMQSRKTTQSGQMSGEGKAGPPGGSSRAAFPQG GRGRGRFPGAVPGGDRFPGPAGPGGPPPPFPAGQTPPRPPLGPPGPPGPPGPPPPGQVLP PPLAGPPNRGDRPPPPVLFPGQPFGQPPLGPLPPGPPPPVPGYGPPPGPPPPQQGPPPPP GPFPPRPPGPLGPPLTLAPPPHLPGPPPGAPPPAPHVNPAFFPPPTNSGMPTSDSRGPPP TDPYGRPPPYDRGDYGPPGREMDTARTPLSEAEFEEIMNRNRAISSSAISRAVSDASAGD YGSAIETLVTAISLIKQSKVSADDRCKVLISSLQDCLHGIESKSYGSGSRRRERSRERDH SRSREKSRRHKSRSRDRHDDYYRERSRERERHRDRDRDRDRERDREREYRHR >gi568815586f:69248409_69453216|GENSCAN_predicted_CDS_1|1599_bp gaagctgaatatggtgggcatgatcagatagatttgtatgacgatgtcatatctccatct gcaaataatggagatgccccagaagaccgagattacatggatactctcccaccaactgtt ggtgatgatgtgggtaaaggagcagcaccaaatgttgtctatacatatactggaaagaga attgcattatatattggaaatctaacatggtggacaacagatgaagacttaactgaagca gttcattctttgggagtaaatgatattttggagataaaattttttgaaaatcgggcaaat ggccagtcaaaggggtttgcccttgttggtgttggatctgaagcatcttcaaaaaagtta atggatctgttacctaaaagagaacttcatggtcagaatcctgttgtaactccatgcaat aaacagttcctgagtcaatttgaaatgcagtccaggaaaactacacaatcaggacaaatg tctggggaaggtaaagctggtcctccaggaggcagttcccgtgcagcatttccacaaggt ggtagaggacggggccgttttccaggggctgttcctggtggggacagatttcctgggcca gcaggaccaggagggccacccccaccttttccagctggacagactccaccacgtccaccc ttaggtcctccaggcccacctggtccaccaggtcctccacctcctggtcaggttctgcct cctcctctagctgggcctcctaatcgaggagatcgccctccaccaccagttctttttcct ggacaaccttttgggcagcctccattgggtccacttcctcctggccctccacctccagtt ccaggctacggcccccctcctggcccaccacctccacaacagggaccacctccacctcca ggcccctttccacctcgtccacccggtccacttgggccaccccttacactagctcctcct ccgcatcttcctggaccacctccaggtgccccaccgccagctccgcatgtgaacccagct ttctttcctccaccaactaacagtggcatgcctacatcagatagccgaggtccaccacca acagatccatatgggcgacctccaccatatgataggggtgactatggcccccctggaagg gaaatggatactgcaagaacgccattgagtgaagctgaatttgaagaaatcatgaataga aatagggcaatctcaagcagtgctatttcgagagctgtgtctgatgccagtgctggtgat tatgggagtgctattgagacactggtaactgcaatttctttaattaaacaatccaaagta tctgctgatgatcgttgcaaagttcttattagttctttgcaagattgccttcatggaatt gagtccaagtcttatggttctggatcaagaagacgtgaacgatcaagagagagggaccat agtagatcacgagaaaagagtcgacgtcataaatcccgtagtagagaccgtcatgacgat tattacagagagagaagcagagaacgagagaggcaccgggatcgtgaccgagaccgtgac cgagagcgtgaccgagagcgcgaatatcgtcatcgttag >gi568815586f:69248409_69453216|GENSCAN_predicted_peptide_2|706_aa MTSIGACFLEKLAKNNHTEDRQGNRRHEHSIDQLAVTDVHSTTAEYTFLSTRGGFSKIDH MLGCKTNLNKYKEIEIIQSIFSNHNEIKQEINSRRKTRKSTNLWELSSALLMNGSEKKTQ VNLDNIWKLMKAKMQHITLMECSESRKNKIPRNPTYKGREGPLQGELQTTAQGNKRGYKQ MEEHPMLMGGRINTVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAK SILSQKNKAGGLTLPDFKLFYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYVIFD KPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLTVRPKTIKTLE ENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTGKETTIRVNRQPTKWKK IFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWVKDMNRHFSKEDIYAAKKHMKKCSPS LAVREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKLVQPLWKSVW QFLRDLELEIPFDPAIPLLGIYPRDYKSCCYKDTCTRMFIAALFTIAKTWNQPKCPTMID WIKKMWHIYTMEYYAAIKNDEFMSFVGTWMKLEIIFLSKLLQEQKTKHRIFSLIDSKPAP QDSGAIIGLQEPEPMTLAQDLLHLKSSATEAACRAISPDTKRDPLG >gi568815586f:69248409_69453216|GENSCAN_predicted_CDS_2|2121_bp atgacctcaattggagcctgcttcctggagaaattggccaagaacaaccatacagaagat cgacaaggaaacagaagacatgaacacagtatagatcagttggctgtaactgatgtccac tcaacaacagcagaatacacattcttaagtactcgtggaggcttctccaagatagaccac atgttaggctgtaaaacaaatcttaacaaatataaggagattgaaatcatacaaagtatc ttctccaatcacaatgaaataaaacaagaaatcaatagcagaaggaaaacaagaaagtcc acgaatttgtgggaactaagcagtgccctcttaatgaatgggtcagagaagaaaacacaa gtgaatttagataatatctggaaactaatgaaagcaaaaatgcaacatatcacacttatg gaatgcagtgaaagtagaaagaataaaatacctaggaatccaacttacaagggacgtgaa ggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggatacaaacaa atggaagaacatcccatgctcatgggtggaagaatcaataccgtgaaaatggccatactg cccaaggtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaag tcaatcctaagccaaaagaacaaagctggaggcctcacactacctgacttcaaactattc tacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaa tggaacagaacagagccctcagaaataatgccgcatatctacaactatgtgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaa atcaattcaagatggattaaagacttaaccgttagacctaaaaccataaaaaccctagaa gaaaacctgggcattaccattcaggacataggcatgggcaaggacttcatgtctaaaaca ccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagc ttctgcacaggaaaagaaactaccatcagagtgaacaggcaacctacaaagtggaagaaa atttttgcaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaa caaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggtgaaggacatgaacaga cacttctcgaaagaagacatttatgcagccaaaaaacacatgaaaaaatgctcaccatca ctggccgtcagagaaatgcaaatcaaaaccacaatgagataccatctcacaccagttaga atggcaatcattaaaaagtcaggaaacaacaggtgctggagaggatgtggagaaatagga acacttttacactgttggtgggactgtaaactagttcaaccattgtggaagtcagtgtgg caattcctcagggatctagaactagaaataccatttgacccagccatcccattactgggt atatacccaagagactataaatcatgctgctataaagacacatgcacacgtatgtttatt gcagcattattcacaatagcaaagacttggaaccaacccaaatgtccaacaatgatagac tggattaagaaaatgtggcacatatacaccatggaatactatgcagccataaaaaatgat gagttcatgtcctttgtagggacgtggatgaaattggaaatcatctttctcagtaaacta ttgcaagaacaaaaaaccaaacaccgcatattctcactcatagacagcaaacctgcacca caagattcaggtgccataataggtttgcaagaacctgagcctatgaccctagcccaagac ctactgcatctgaaatccagtgccactgaagctgcttgcagggccatcagtcctgacacc aagagggatccccttggctaa >gi568815586f:69248409_69453216|GENSCAN_predicted_peptide_3|103_aa MAKKGQHKTQAIASEGASPKAWWLTCGVGPVGAEKARIEIWEPPPRFQRMYRNAWVSRQK CAAGVEPSWRTSVRAVQKGNVGLEPPHSVPTGALPSGAVRRGP >gi568815586f:69248409_69453216|GENSCAN_predicted_CDS_3|312_bp atggctaaaaagggccaacataaaactcaagccattgcttcagagggtgcaagccccaag gcttggtggctgacatgtggtgttgggcctgtgggtgcagagaaggcaagaattgagatt tgggaacctccacctagatttcagaggatgtatagaaatgcctgggtgtccaggcagaag tgtgctgcaggggtggaaccttcatggagaacctctgtgagggcagtgcagaagggaaat gtggggctggagcccccacacagcgtccccactggggcactccctagtggagctgtgaga agagggccatag >gi568815586f:69248409_69453216|GENSCAN_predicted_peptide_4|121_aa MQNRQGNQVYVYKKIRKKALGAESRAAEQAVNKKKGSAARKFTETEPAQAKGSAFTRVKQ LPPRDPPAQLSSSAVLEAGSLRSELASLGSCKGSLPGFADCHLLPVSSYGEERVLNLFFF L >gi568815586f:69248409_69453216|GENSCAN_predicted_CDS_4|366_bp atgcagaatcggcaaggcaatcaggtgtatgtttataaaaagataagaaaaaaggcatta ggagccgaaagccgtgcagccgagcaagcagtaaataaaaagaaaggctcggcagcccgg aagttcacagaaacagagccagcacaggcaaaaggcagcgcttttacaagagtaaaacag ctaccgcccagggacccgccggctcagctctctagctctgcagttttagaggctggaagt ctgagatcggaactagcatcattaggttcttgtaagggctctcttcctggctttgcagac tgccaccttctccctgtatcttcctatggtgaagaaagagttctgaatctcttcttcttc ttataa >gi568815586f:69248409_69453216|GENSCAN_predicted_peptide_5|344_aa MKALIVLGLVLLSVTVQGKVFERCELARTLKRLGMDGYRGISLANWMCLAKWESGYNTRA TNYNAGDRSTDYGIFQINSRYWCNDGKTPGAVNACHLSCSALLQDNIADAVACAKRVVRD PQGIRAWWHSVFAVLICPIIDDVHLDHLIKVIGEAVTPPSWCEDNNNCSKQSLSAYPQYF QGVTIVKPIVYGNVARYFGKKREEDGHTHQWTVYVKPYRNEDMSAYVKKIQFKLHESYGN PLRVVTKPPYEITETGWGEFEIIIKIFFIDPNERPVTLYHLLKLFQSDTNAMLGKKTVVS EFYDEMVLAAEVRAVPESDKKPYRHGSYILVGGKQVLNKQNKEV >gi568815586f:69248409_69453216|GENSCAN_predicted_CDS_5|1035_bp atgaaggctctcattgttctggggcttgtcctcctttctgttacggtccagggcaaggtc tttgaaaggtgtgagttggccagaactctgaaaagattgggaatggatggctacagggga atcagcctagcaaactggatgtgtttggccaaatgggagagtggttacaacacacgagct acaaactacaatgctggagacagaagcactgattatgggatatttcagatcaatagccgc tactggtgtaatgatggcaaaaccccaggagcagttaatgcctgtcatttatcctgcagt gctttgctgcaagataacatcgctgatgctgtagcttgtgcaaagagggttgtccgtgat ccacaaggcattagagcatggtggcattcagtttttgcagttttgatttgccccattatt gatgatgttcacttggatcacttgattaaggttattggtgaagcggtgacgccaccctcc tggtgtgaggataacaataactgtagcaaacagtcgctgagcgcttacccgcagtatttc caaggtgttactatcgttaaaccaatagtttacggtaatgttgctcggtattttggaaag aaaagagaagaagatgggcacactcatcagtggacagtatatgtgaaaccatatagaaat gaggatatgtcagcatatgtgaagaaaatccagtttaaattacatgaaagctatggcaat cctttaagagttgttactaaacctccatatgaaattactgaaacaggatggggtgaattc gaaataatcatcaaaatatttttcattgaccctaatgaaagacctgtaaccctgtatcat ttgctaaagctgtttcaatcagacaccaatgcaatgctggggaaaaagacagtggtttca gagttctatgatgaaatggtgctagctgctgaggttagagcagtgcctgaatcagacaaa aagccctatcgtcatgggtcctacattctagtgggaggaaaacaggtattaaataaacag aacaaggaagtataa >gi568815586f:69248409_69453216|GENSCAN_predicted_peptide_6|237_aa MLSCSLGASSATVKQSTRIIPKVSNSRPWLLDRISGPTRSPGKLSALKGWTQANQINDLR QSVIWLGDWVVSLEHRMQMQCNWNTLDFCIIPYSYNETDYSWEMVKGRLLGREDNLSLDI TKLKKQIFEASQAHLSIVPGAEALDQVAENLYGLNPTTWIKSIGGSTVVNCGITFLCLIS LFLVCRTSQRILGKNQENEQAFITMAHLYKKKGRDVAGSQEPRTEGTAEAMAEEHKL >gi568815586f:69248409_69453216|GENSCAN_predicted_CDS_6|714_bp atgctgtcttgtagcttgggtgccagctcagccacagtaaaacagagcaccaggattatt cctaaggtttccaactccaggccctggctcctggacagaatctctggacctacccgcagc ccagggaaactcagtgcccttaagggatggacacaagctaatcaaattaatgatttaaga cagtctgttatttggcttggagattgggtggtgagtctcgaacatcgcatgcaaatgcag tgcaactggaatactttggatttctgtatcatcccctattcctataatgagactgattat tcatgggaaatggtcaaaggacgccttctgggtagggaagataatttatcattggacata actaaattaaagaaacaaatttttgaagcctctcaagctcacttatccattgtgcctgga gctgaggcgttagatcaggtggcagaaaatctttatggattaaaccccacgacttggatt aagtctattgggggctccactgtagtaaattgtggaattacatttctctgtttaatcagc ttgtttttagtgtgccggaccagtcaaagaatcctgggtaaaaatcaagagaatgaacaa gccttcatcaccatggcacatttatataaaaagaaagggagagatgttgcgggaagtcag gaaccccgaacggagggaacggctgaagccatggcagaagaacataaattgtga >gi568815586f:69248409_69453216|GENSCAN_predicted_peptide_7|201_aa MVQAEETARANSLKRKHSWCVWGVPQGPMWPMPSEQGEAWATKGDHVLKTNKQVFGASNG PSDSNNFKASRDPMAKMGEEQPQCVGKGGYNYKSQDCVWGQDKAIKFVIKNKVEAIAIRA ISEVSVFNTCASQLYEKLHCDMSCAIHGKVVRSHSHEAQKDPIPLPDLDLMVLFHDLRES PYKILLYEDVMFGAAAIILES >gi568815586f:69248409_69453216|GENSCAN_predicted_CDS_7|606_bp atggtccaggcagaagaaactgccagggcaaactctctgaagcgaaagcattcctggtgt gtctggggagtaccccaggggccaatgtggccgatgccaagtgagcaaggggaagcctgg gcaacaaagggagatcatgtcttaaaaacaaacaaacaggtatttggagcaagcaatgga cccagtgacagtaacaatttcaaagcaagtagagatccaatggcaaagatgggagaggaa cagcctcaatgtgtaggaaaaggtggctataactacaaaagccaagactgtgtatgggga caggacaaggccattaagttcgtcattaaaaacaaagtagaggccatagccatcagggcc atttctgaagtgagtgtctttaacacctgtgcttcccagctgtatgagaaactgcattgt gatatgagttgtgccattcatggaaaagtagtcaggagtcattctcatgaagcccagaag gatccaatacccctccctgatttagacctaatggtgctgttccatgacctccgcgaaagc ccgtataagatattactctatgaagatgtaatgtttggagctgcagcgatcattttggaa tcatga >gi568815586f:69248409_69453216|GENSCAN_predicted_peptide_8|81_aa MHLRSKEHRILPANQQIPGEKHRRDSPSQPPESTNSGSTLISRLPSRTLAFSVAPCPRPA ADDQLSAPKVTFLFKQSVQIE >gi568815586f:69248409_69453216|GENSCAN_predicted_CDS_8|246_bp atgcatctacgaagcaaagaacacagaatactgccagcaaaccagcagataccaggagag aagcacagaagagattctccctcacagcccccagaaagcacaaattctggcagcaccttg atctcaagactaccttccagaactctggctttctctgtggcaccatgccctcggccagca gcagatgaccaactcagtgctccaaaggttacatttctgttcaagcaatcagtccaaatt gaatag