GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:53:51 Sequence gi568815583f:78240429_78447974 : 207546 bp : 47.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 794 934 141 0 0 80 59 129 0.374 9.52 1.02 Intr + 5720 5899 180 0 0 54 68 87 0.126 3.14 1.03 Intr + 5985 6169 185 1 2 41 52 80 0.147 -0.79 1.04 Intr + 19235 19324 90 1 0 64 103 16 0.299 0.89 1.05 Intr + 23778 23965 188 2 2 55 99 187 0.564 14.99 1.06 Intr + 24329 24467 139 2 1 121 86 364 0.994 39.97 1.07 Intr + 30069 30249 181 2 1 84 84 75 0.675 6.14 1.08 Intr + 32667 32771 105 1 0 59 93 49 0.856 2.69 1.09 Intr + 33769 33996 228 2 0 103 100 402 0.961 40.74 1.10 Intr + 35070 35300 231 1 0 20 95 170 0.954 8.64 1.11 Intr + 39617 39717 101 0 2 116 83 79 0.999 10.03 1.12 Term + 39817 40032 216 0 0 108 45 168 0.942 11.64 1.13 PlyA + 40541 40546 6 1.05 2.09 PlyA - 41174 41169 6 1.05 2.08 Term - 47037 47028 10 2 1 112 44 5 0.273 -3.83 2.07 Intr - 47954 47861 94 0 1 39 75 128 0.556 5.62 2.06 Intr - 49284 49191 94 0 1 61 71 72 0.908 2.34 2.05 Intr - 49671 49523 149 1 2 90 64 71 0.831 4.85 2.04 Intr - 52360 52188 173 0 2 26 99 143 0.869 8.79 2.03 Intr - 52844 52739 106 0 1 46 101 65 0.829 2.87 2.02 Intr - 54545 54518 28 2 1 90 115 17 0.647 2.29 2.01 Init - 59217 59134 84 0 0 96 89 73 0.531 7.38 2.00 Prom - 61332 61293 40 -6.66 3.00 Prom + 61910 61949 40 -4.66 3.01 Init + 70539 70603 65 2 2 114 107 23 0.617 7.43 3.02 Intr + 76851 77013 163 0 1 94 62 38 0.158 1.78 3.03 Term + 78819 78836 18 2 0 132 29 33 0.324 0.12 3.04 PlyA + 81062 81067 6 1.05 4.00 Prom + 83170 83209 40 -3.06 4.01 Init + 100001 100070 70 1 1 114 94 199 0.829 24.31 4.02 Intr + 100615 100793 179 2 2 76 72 372 0.641 34.04 4.03 Intr + 103071 103184 114 2 0 98 58 142 0.866 12.84 4.04 Term + 107499 107549 51 2 0 113 49 101 0.728 6.13 4.05 PlyA + 111675 111680 6 1.05 5.06 PlyA - 113224 113219 6 1.05 5.05 Term - 121290 121119 172 2 1 97 43 47 0.061 -1.60 5.04 Intr - 134216 134078 139 0 1 87 53 72 0.359 3.12 5.03 Intr - 135965 135763 203 1 2 8 47 125 0.505 -0.67 5.02 Intr - 137270 136193 1078 0 1 -43 53 418 0.119 14.43 5.01 Init - 138294 137718 577 0 1 62 40 239 0.208 11.81 5.00 Prom - 138918 138879 40 -2.16 6.07 PlyA - 139404 139399 6 1.05 6.06 Term - 144457 144359 99 1 0 61 37 77 0.082 -1.97 6.05 Intr - 148733 148626 108 2 0 5 60 136 0.210 2.98 6.04 Intr - 156979 156826 154 0 1 115 64 8 0.062 1.17 6.03 Intr - 164870 164795 76 0 1 102 65 56 0.110 3.27 6.02 Intr - 170344 170251 94 1 1 69 50 135 0.920 7.34 6.01 Init - 172271 172269 3 2 0 108 81 0 0.794 1.30 6.00 Prom - 194029 193990 40 -0.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 180440 180559 120 1 0 103 85 43 0.917 6.19 S.002 Term + 182519 182650 132 1 0 80 43 113 0.962 4.09 S.003 Term + 190822 190912 91 1 1 96 45 96 0.821 3.29 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:78240429_78447974|GENSCAN_predicted_peptide_1|661_aa XGLRQPPQCALDIESGARLGDPCAPEGAEALGCICVPSGPVYIEWVGSLGCSMGLGAVEQ GVALVEEPRAAQKLTEGVGGSGMAGCRSRALPHGKAAKAQREIERSAGPAGCSECRARQA HAHPELQLACKRLAVAPVPAGASPSTPPDKLREPAPALASPERGSHSAAHRARHIAAVHK SCCDEINKFHRENTCFLHMPAGSTDPRKGKGAASGRRDSCRRAPSRPKPPSPAPAMARGG SQSWSSGESDGQPKEQTPEKPRHKMVKETQYYDILGVKPSASPEEIKKAYRKLALKYHPD KNPDEGEKFKLISQAYEVLSDPKKRDVYDQGGEQAIKEGGSGSPSFSSPMDIFDMFFGGG GRMARERRGKNVVHQLSVTLEDLYNGVTKKLALQKNVICEKCEGVGGKKGSVEKCPLCKG RGMQIHIQQIGPGMVQQIQTVCIECKGQGERINPKDRCESCSGAKVIREKKIIEVHVEKG MKDGQKILFHGEGDQEPELEPGDVIIVLDQKDHSVFQRRGHDLIMKMKIQLSEALCGFKK TIKTLDNRILVITSKAGEVIKHGDLRCVRDEGMPIYKAPLEKGILIIQFLVIFPEKHWLS LEKLPQLEALLPPRQKVRITDDMDQVELKEFCPNEQNWRQHREAYEEDEDGPQAGVQCQT A >gi568815583f:78240429_78447974|GENSCAN_predicted_CDS_1|1986_bp naaggcctccggcagccaccacagtgtgctctggacatcgagtctggggcccggctggga gacccttgtgctccagagggcgctgaggctctgggctgcatttgtgtcccctctggccct gtctacatcgagtgggtggggtcccttgggtgttcgatgggactgggcgctgtggagcag ggggtggcgctcgtcgaagagcctcgggccgcacagaagctcacggagggggtgggaggc tcaggcatggcgggctgcaggtcccgagccctgccccacgggaaggcagctaaggcccag cgagaaatcgagcgcagcgccgggccggccggctgctctgagtgcagggcccgccaagcc cacgcccacccggaactacagctggcctgcaagcgcctcgccgtagccccggttcccgct ggcgcctctccttccacacctcccgacaagctgagggagccggctccggccttggccagc ccagaaaggggctcccacagtgcagcgcacagagccagacacatagctgctgtgcacaaa tcctgttgtgatgaaatcaataaatttcatcgtgagaacacgtgctttctgcacatgcca gccggctccacggacccacggaagggcaagggggcggcctcggggcggcgggacagttgt cggagggcgccctccaggcccaagccgccttctccggcccccgccatggcccggggcggc agtcagagctggagctccggggaatcagacgggcagccaaaggagcagacgcccgagaag cccagacacaagatggtgaaggagacccagtactatgacatcctgggcgtgaagcccagc gcgtccccggaggagatcaagaaggcctatcggaagctggcgctcaagtaccacccggac aagaacccggatgagggcgagaagtttaaactcatatcccaggcatatgaagtgctttca gatccaaagaaaagggatgtttatgaccaaggcggagagcaggcaattaaagaaggaggc tcaggcagccccagcttctcttcacccatggacatctttgacatgttctttggtggtggt ggacggatggctagagagagaagaggcaagaatgttgtacaccagttatctgtaactctt gaagatctatataatggagtcacgaagaaattggccctccagaaaaatgtaatttgtgag aaatgtgaaggtgttggtgggaagaagggatcggtggagaagtgcccgctgtgcaagggg cgggggatgcagatccacatccagcagatcgggccgggcatggtacagcagatccagacc gtgtgcatcgagtgcaagggccagggtgagcgcatcaaccccaaggaccgctgcgagagc tgcagcggggccaaggtgatccgtgagaagaagattatcgaggtacatgttgaaaaaggt atgaaagatgggcaaaagatactatttcatggagaaggagatcaggagcctgagctggag cctggtgatgtcataattgtgcttgatcagaaggatcatagtgtctttcagagacgaggc catgacttgatcatgaaaatgaaaattcagctttctgaagctctttgtggcttcaagaag acgataaaaacattggacaatcgaattcttgttattacatccaaagcaggtgaggtgata aagcacggggacctgagatgcgtgcgcgatgaaggaatgcccatctacaaagcacccctg gaaaaagggattctgatcatacagtttttagtaatctttcctgaaaaacactggctttct ctggaaaagcttcctcagctggaagctttactccctcctcgacagaaagtgaggattaca gatgacatggatcaggtggagctgaaggagttttgtcccaatgagcagaactggcgtcag cacagggaggcctacgaggaggacgaagacgggccccaggctggagtgcagtgccagacg gcatga >gi568815583f:78240429_78447974|GENSCAN_predicted_peptide_2|245_aa MPRPFDSSSLRVAVARSPASDVQPGSAVYGILFKQEQAHDDAIWSVAWGTNKKENSETVV TGSLDDLVKVWKWRDERLDLQWSLEGHQLGVVSVDISHTLPIAASSSLDAHIRLWDLENG KQIKSIDAGPVDAWTLAFSPDSQYLATGTHVGKVNIFGVESGKKEYSLDTRGKFILSIAY SPDGKYLASGAIDGIINIFDIATGKLLHTLEGHAMPIRSLTFSPDSQLLVTASDDGYIKI YDVGL >gi568815583f:78240429_78447974|GENSCAN_predicted_CDS_2|738_bp atgccccgccccttcgatagctcatctttgcgcgtcgcagtcgcgcggagcccggcttcc gacgtgcagcctggcagtgcagtgtacggtattctcttcaaacaagagcaagcccatgat gatgccatttggtcagttgcttgggggacaaacaagaaggaaaactctgagacagtggtc acaggctccctagatgacctggtgaaggtctggaaatggcgtgatgagaggctggaccta cagtggagtctggagggacatcagctgggagtggtgtctgtggacatcagccacaccctg cccattgctgcatccagctctcttgatgctcatattcgtctttgggacttggaaaatggc aaacagataaagtccatagatgcaggacctgtggatgcctggactttggccttttctcct gattcccagtatctggccacaggaactcatgtcgggaaagtgaacatttttggtgtggaa agtgggaaaaaggaatattctttggacacgagaggaaaattcattcttagtattgcatat agtcctgatgggaaatacctagccagtggagccatagatggaatcatcaatatttttgat attgcaactggaaaacttctgcataccctggaaggccatgccatgcccattcgctccttg accttttccccggactcccagctccttgtcactgcttcagatgatggctacatcaagatc tatgatgtgggcctgtga >gi568815583f:78240429_78447974|GENSCAN_predicted_peptide_3|81_aa MPHGLFHATDRKLLTNMYNLLWAQSQLLSLASEALCDLPLLLLTPISRRQPSLNVSTLGS PLLDTHSVLLRALPCQKVGNA >gi568815583f:78240429_78447974|GENSCAN_predicted_CDS_3|246_bp atgccccatgggctatttcacgccacggacaggaagttgctgaccaatatgtacaatctg ctctgggcacagtcccaacttctcagcctggcatctgaggccctctgtgatctgcctctg cttcttctcacacccatctccagaaggcagcccagtctaaatgtcagcaccctggggagc cctcttctggacactcacagcgtgctgctccgagcactgccatgccagaaagttggcaat gcctag >gi568815583f:78240429_78447974|GENSCAN_predicted_peptide_4|137_aa MPNFAGTWKMRSSENFDELLKALGVNAMLRKVAVAAASKPHVEIRQDGDQFYIKTSTTVR TTEINFKVGEGFEEETVDGRKCRSLATWENENKIHCTQTLLEGDGPKTYWTRELANDELI LTFGADDVVCTRIYVRE >gi568815583f:78240429_78447974|GENSCAN_predicted_CDS_4|414_bp atgcccaacttcgccggcacctggaagatgcgcagcagcgagaatttcgacgagctgctc aaggcactgggtgtgaacgccatgctgaggaaagtggccgtagcggctgcgtccaagccg cacgtggagatccgccaggacggggatcagttctacatcaagacatccaccacggtgcgc accactgagatcaacttcaaggtcggagaaggctttgaggaggagaccgtggacggacgc aagtgcaggagtttagccacttgggagaatgagaacaagatccactgcacgcaaactctt cttgaaggggacggccccaaaacctactggacccgtgagctggccaacgatgaacttatc ctgacgtttggcgccgatgacgtggtctgcaccagaatttatgtccgagagtga >gi568815583f:78240429_78447974|GENSCAN_predicted_peptide_5|722_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIEAIIKSLPTKKSPGPDGFTAEFYQRYNEEL VPFLLKLFQSIEKEGILSNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHNQVSFIPGMQGWFNIHKSMKVIQHINRTKDKNYMIISIDAEKAFDKI QQPFMLKTLKKLGIQLTKDVKDLFKENYKALLNEIKEDTNKWKNIPCSWIGRINIMKMAI LPKVIYRFNAIPIKPPMTFFTELEKTTLKFIWNQKRVCIAKTILSQKNKAGGITLPDFKL YYKATVTKAAWYCYQNRDIDQWNRTEPSEIIPHIYNHLIFDKPDKNKQWGKDSLFNKWCW ENWLAIWRKLKLDPFLTPYTKINSRWIKDLNVRPKIIKTLEENLGNTIQDIDMGKDFMTK TPKAMATKAKIDKWDLVKLKSFFTAKETTIGVNRQPTEWEKIFTIYPSDKVLISRIHKEL KQIYKKKPKNPIKKWAKDMNRHFSKEDIYAANRHMKKCSSPLAIREMQIKTTMRYHLTPV RMAIIKKSGNNSKDLEPTQMSINDDWIKKMWHIYIMEYDAAIKKDEFLSFVGTWMKLETI ILSKLSQGEKTKHRMFSLIGQGPVQPQHLVLAWHRVECSFPENPHKVPTPGLLSGSENES IVGPGRQSLPQSTSDDPTVLGLPDPRSQQTQTKESTFCLNAPAAVPERALIGLAWITCLS QN >gi568815583f:78240429_78447974|GENSCAN_predicted_CDS_5|2169_bp atggataaattcctggacacatacaccctcccaagactaaaccaggaagaagttgaatcc ctaaatagaccaataacaggctctgaaattgaggcaataattaagagcctaccaaccaaa aaaagtccaggaccagacggattcacagctgaattctaccagaggtataacgaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctctctaactca ttttatgaagccagcatcatcctgataccaaagcctggaagagacacaacaaaaaaagag aattttagaccaatatccctaatgaacatcgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccacaatcaagttagcttcatccct gggatgcaaggctggttcaacatacacaaatcaatgaaagtaatccagcatataaacaga accaaagacaaaaactacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacagcccttcatgctaaaaactctcaaaaaactaggaatccaacttacaaaggatgtg aaggacctcttcaaggagaactacaaagcactgctcaacgaaataaaagaggacacaaac aaatggaagaacattccatgctcatggataggaagaatcaatatcatgaaaatggccata ctgcccaaggtaatttatagattcaatgccatccccatcaagccaccaatgactttcttc acagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagtctgcattgcc aagacaatcctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaacta tactacaaggctacagtaaccaaagcagcatggtactgctaccaaaacagagatatagat caatggaacagaacagagccctcagaaataataccacacatctacaaccatctgatcttt gacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgg gaaaactggctagccatatggagaaagctgaaactggatcctttccttacaccttataca aaaattaattcaagatggattaaagacttaaatgttagacctaaaatcataaaaacccta gaagaaaacctaggcaataccattcaggacatagacatgggcaaggacttcatgactaaa acaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctagttaaactaaag agcttctttacagcaaaagaaactaccatcggagtgaacagacaacctacagaatgggag aaaatttttacaatctacccatctgacaaagtgctaatatccagaatccacaaagaactt aaacaaatttacaagaaaaaaccaaagaaccccatcaaaaagtgggcaaaggatatgaac agacacttctcaaaagaagacatttatgcagccaacagacacatgaaaaaatgctcatca ccactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcacaccagtt agaatggcgatcattaaaaagtcaggaaacaacagcaaagacttggaaccaacccaaatg tccatcaatgatgactggattaagaaaatgtggcacatatacatcatggaatacgatgca gccataaaaaaggatgagttcttgtcctttgtagggacatggatgaagctggaaaccatt attctgagcaaactatcgcaaggagagaaaaccaaacaccgcatgttctcactcataggg cagggccctgtgcagcctcagcacctggtgctggcctggcaccgagttgagtgctcattc cctgagaacccgcataaggtgcccacaccaggccttctcagtggctctgaaaatgagagc atagtgggacctgggaggcagagtctcccgcaaagtaccagtgatgaccccacagttcta ggcttacctgatccccgtagccagcaaacccagacgaaagagagcaccttctgcctaaat gctccggctgcggtcccagagagggctctgattggcttagcttggatcacatgcctgtct cagaactga >gi568815583f:78240429_78447974|GENSCAN_predicted_peptide_6|177_aa MLIEFGVVAAPEGQQSTAVYHWCNTAVLDPLALAGMMAVAAPDDPPPPSVSSSKPSTVGT RQLSGPWHARLCPREADIMELIKPTWTQVPGIQEAVLSHPPQQSPADAGLPGSGQRSSKG AVPPCRLHIPEVMAAVITDKDVPMLPQSRLLQDAWGLGDPWTAVRATLTDYGKEQSS >gi568815583f:78240429_78447974|GENSCAN_predicted_CDS_6|534_bp atgctgattgaatttggtgtggtcgccgcacctgagggccagcagtccacagctgtctac cactggtgcaataccgcggtgctggacccactggcattggctggcatgatggcagtggct gctcctgacgacccgccaccgccatcggtatcatcttccaagccatcaacagtgggcacc aggcagctgtcagggccctggcatgcaaggctgtgcccaagggaagcggacatcatggag ctaattaaaccaacgtggacccaagtgcctggaattcaggaggctgtcctctcccaccca ccccaacagagccccgcagacgctggattaccaggaagtggacaaagaagcagcaaaggg gcagttcctccatgtcgtctgcacatcccagaagtgatggccgcggtcattactgacaaa gatgtacccatgttgccacagtctcggctgctgcaggatgcctggggtctgggcgatcca tggacagcagtgagggctacattgacagattacggcaaagagcagagcagttag