GENSCAN 1.0 Date run: 5-Nov-116 Time: 16:08:07 Sequence gi568815578f:18042661_18287456 : 244796 bp : 43.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 2978 2973 6 1.05 1.02 Term - 14217 13649 569 1 2 68 41 509 0.662 38.78 1.01 Init - 14974 14875 100 1 1 104 99 121 0.977 15.24 1.00 Prom - 26443 26404 40 -3.76 2.00 Prom + 67692 67731 40 -2.86 2.01 Init + 83366 84492 1127 1 2 60 41 339 0.070 20.07 2.02 Intr + 95157 95391 235 0 1 59 67 216 0.048 14.29 2.03 Term + 99548 99697 150 1 0 88 38 119 0.324 4.81 2.04 PlyA + 99816 99821 6 1.05 3.00 Prom + 99828 99867 40 -13.15 3.01 Init + 100001 100259 259 1 1 66 72 327 0.980 26.20 3.02 Intr + 102573 102691 119 1 2 126 89 16 0.975 5.68 3.03 Intr + 108161 108282 122 1 2 119 63 42 0.759 4.09 3.04 Intr + 116424 116605 182 0 2 32 79 90 0.729 2.01 3.05 Intr + 119163 119579 417 1 0 74 95 172 0.669 10.50 3.06 Intr + 119717 120285 569 0 2 96 68 422 0.847 33.50 3.07 Intr + 139050 139186 137 2 2 82 76 122 0.968 9.77 3.08 Intr + 140529 140638 110 0 2 61 84 42 0.918 1.03 3.09 Intr + 141942 142132 191 1 2 84 111 95 0.999 10.70 3.10 Term + 144626 144799 174 1 0 71 44 110 0.799 2.66 3.11 PlyA + 145353 145358 6 1.05 4.00 Prom + 169616 169655 40 0.24 4.01 Init + 173882 173956 75 1 0 7 115 59 0.276 1.59 4.02 Term + 177413 177853 441 1 0 57 42 217 0.535 9.26 4.03 PlyA + 178005 178010 6 1.05 5.00 Prom + 179637 179676 40 -5.66 5.01 Init + 179766 179770 5 2 2 49 105 0 0.315 -2.73 5.02 Intr + 184093 184265 173 1 2 10 75 121 0.680 2.59 5.03 Intr + 184450 184575 126 2 0 96 44 52 0.717 2.25 5.04 Term + 185053 185411 359 2 2 -36 45 279 0.888 5.87 5.05 PlyA + 185645 185650 6 1.05 6.00 Prom + 186310 186349 40 -0.16 6.01 Sngl + 204829 205737 909 0 0 86 39 231 0.984 14.30 6.02 PlyA + 207763 207768 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:18042661_18287456|GENSCAN_predicted_peptide_1|222_aa MPKVFLVKRRSLGVSVRSWDELPDEKRADTYIPVGLGRLLHDPPEDCRSDGGSSSGSGSS SAGEPGGAESSSSPHAPESETPEPGDAEGPDGHLATKQRPVARSKIKVLCRLCPRRHLHH ALASGSRGRRPSLPRPPAQPGTRAPSAPAAPLLHLPLGGGQGASARPPARRVAAAQGRPV PGVGSVLPVAPPDAECCLWNCRKGTGITAFRQQPVLCARPVE >gi568815578f:18042661_18287456|GENSCAN_predicted_CDS_1|669_bp atgcccaaagtcttcctggtgaagaggaggagcctgggggtctcggtccgcagctgggat gagctcccggatgagaaaagggcagacacctacatcccagtgggcctaggccgcctgctc cacgacccccccgaggactgccgcagcgacggcggcagcagcagcggcagcggcagcagc agcgcgggggagcctggaggagcagagagcagctcgtccccgcacgcccccgagagcgaa acccccgagcccggcgacgccgagggccccgatggacacctggcgaccaagcagcgcccg gtcgccagatcgaaaatcaaggtactgtgtcgactctgcccccgccgccacctgcaccac gccctggcgtcgggctcccgaggccggcgcccctccctgccgcgcccgcctgcgcaaccc ggcacccgcgccccgagcgcgcccgccgcgccgctcctgcacctgcccctgggcggcggc cagggcgcctcggcgcgtcctcccgcccgcagggtagctgctgcgcaggggcggccggtt cccggagtgggctccgtcctcccggtggccccgcctgatgccgagtgctgcctctggaac tgccgaaaaggcacaggcatcactgccttccgccagcagcccgttctgtgcgctaggcct gtagaataa >gi568815578f:18042661_18287456|GENSCAN_predicted_peptide_2|503_aa MSELPFTTASKRIKYLGIQLTMDVKDLFKENYKLLLSEIKEFTNKWNNIPCSWIGRLNIV RMAILPQVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRTHIATSILSQKSKAGSITLP DFKLYYKAAVTKTAWYWYQNRDIDQWNRIEPSEIIPHIYNHLIFDKPDKNKKWGKDFVFN KWCWENWLAVCRKLKLDPFLTPYAKINSRWFKDLNVRLKTIKTLEENLGNTIQAIGMGKD FMTKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFTIYPSDKGLISRI YKELKQIYKKKSNNPIKKWAKDMNRHFSKEDIYAANRHMKKCSSSLAIREMQIKTTMRYH LTPVRMAIIKKSGNNSGERRTRRLGLCARGVEPGQYRRRCALCGGLCASGGREREAAAAS VGMSRSSKVVLGLSVLLTAATVAGVHVKQQWDQQRLRDGVIRDIERQIRKKENIRLLGEQ IILTEQLEAEREKMLLAKGSQKS >gi568815578f:18042661_18287456|GENSCAN_predicted_CDS_2|1512_bp atgagtgaactcccatttacaactgcttcaaagagaataaaatacctaggaatccagctt acaatggatgtgaaggacctcttcaaggagaactacaaattactgctcagtgaaataaaa gagttcacaaacaaatggaataacattccatgctcatggataggaagactcaatattgtg agaatggccatactgccccaggtaatttataggttcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga acccacattgccacatcaatcctaagccaaaagagcaaagctggaagcatcacactacct gacttcaaactatactacaaggctgcagtaaccaaaacagcatggtactggtaccaaaac agagatatagaccaatggaacagaatagagccctcggaaataataccacacatctacaac catctgatctttgacaaacctgacaaaaacaagaaatggggaaaggatttcgtatttaat aaatggtgctgggaaaactggctagccgtatgtagaaagctgaaactggatcccttcctt acaccttatgcaaaaattaattcaagatggtttaaagacttaaatgttagacttaaaacc ataaaaactctagaagaaaacctaggcaataccattcaggccataggcatgggcaaggac ttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acagaatgggagaaaatttttacaatctacccatctgacaaagggttaatatccagaatc tacaaagaacttaaacaaatttacaagaaaaaatcaaacaaccccatcaaaaagtgggca aaggatatgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaa aaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagatatcat ctcacaccagttagaatggcaatcattaaaaagtcaggaaataacagcggcgagcggcgc acgcgacggctggggctctgcgctcgaggggtcgagcctgggcagtacaggcggcggtgc gcactctgcggcggcctctgcgcctcgggcgggcgggagagagaggccgcggccgccagc gtggggatgtctaggagctcgaaggtggtgctgggcctctcggtgctgctgacggcggcc acagtggccggcgtacatgtgaagcagcagtgggaccagcagaggcttcgtgacggagtt atcagagacattgagaggcaaattcggaaaaaagaaaacattcgtcttttgggagaacag attattttgactgagcaacttgaagcagaaagagagaagatgttattggcaaaaggatct caaaaatcatga >gi568815578f:18042661_18287456|GENSCAN_predicted_peptide_3|759_aa MDSSIHLSSLISRHDDEATRTSTSEGLEEGEVEGETLLIVESEDQASVDLSHDQSGDSLN SDEGDVSWMEEQLSYFCDKCQKWIPASQLREQLSYLKGDNFFRFTCSDCSADGKEQYERL KLTWQQVVMLAMYNLSLEGSGRQGYFRWKEDICAFIEKHWTFLLGNRKKTSTWWSTVAGC LSVGSPMYFRSGAQEFGEPGWWKLVHNKPPTMKPEGEKLSASTLKIKASKPTLDPIITVE GLRKRASRNPVESAMELKEKRSRTQEAKDIRRAQKEAAGFLDRSTSSTPVKFISRGRRPD VILEKGEVIDFSSLSSSDRTPLTSPSPSPSLDFSAPGTPASHSATPSLLSEADLIPDVMP PQALFHDDDEMEGDGVIDPGMEYVPPPAGSVASGPVVGVRKKVRGPEQIKQEVESEEEKP DRMDIDSEDTDSNTSLQTRAREKRKPQLEKDTKPKEPRYTPVSIYEEKLLLKRLEACPGA VAMTPEARRLKRKLIVRQAKRDRGLPLFDLDQVVNAALLLVDGIYGAKEGGISRLPAGQA TYRTTCQDFRILDRYQTSLPSRKGFRHQTTKFLYRLVGSEDMAVDQSIVSPYTSRILKPY IRSDPHWTPEPDAPLDYCYVRPNHIPTINSMCQEFFWPGIDLSECLQYPDFSVVVLYKKV IIAFGFMVPDVKYNEAYISFLFVHPEWRRAGIATFMIYHLIQTCMGKDVTLHVSASNPAM LLYQKFGFKTEEYVLDFYDKYYPLESTECKHAFFLRLRR >gi568815578f:18042661_18287456|GENSCAN_predicted_CDS_3|2280_bp atggatagtagcatccacctgagtagtctgatcagtcggcatgatgacgaagccacgaga acatcgacctcagaaggactggaggaaggtgaagtggagggagagacgctcctgatcgtc gaatccgaggatcaggcatcagtggacttatcgcacgaccagagtggggattccctcaac agtgatgaaggagacgtgtcttggatggaggagcagctgtcctacttctgtgacaagtgc caaaaatggataccagccagtcagctgagggaacagctcagttaccttaagggtgataat ttttttaggtttacttgttcggattgctcagcagatggcaaggagcagtatgaaaggctg aagctgacatggcagcaagtcgtcatgttggcaatgtacaacttgtctctggaaggaagt ggacgtcaaggttatttcaggtggaaagaagatatctgtgcttttattgagaaacattgg acttttttactagggaataggaaaaagacgtctacctggtggagcaccgtggcaggttgc ctcagcgtgggaagtcccatgtacttccgttcaggtgctcaggaatttggagagccagga tggtggaaacttgttcataacaagcccccaacgatgaaacctgaaggagagaagttgtct gcctctactttgaaaataaaagcctcaaaaccaactttagatcccatcattactgttgag ggacttagaaaacgagcaagtcggaatcctgtggaatctgccatggaattaaaagagaaa aggtctcgaactcaggaagcaaaagacattagaagagcccagaaggaggccgctggcttt cttgacaggagcacatcttctacccctgtaaaattcataagccgaggccgcaggccagat gtgattctggaaaaaggcgaagtgattgacttttcctccttgagctcctctgaccgcacc ccgctgacaagcccatctccttctccttctctggatttctctgcccctggtacacctgcc tctcattctgccacacctagcttgctttcagaagcagatctgattccagatgtgatgccc ccacaagccttgtttcatgatgacgatgagatggaaggcgatggagtcatagacccaggg atggagtacgtcccaccccctgctgggtcagtagcttctgggccagtggttggggtcaga aagaaggtcagaggccctgaacagataaagcaggaggtagagagtgaggaggaaaaaccc gacaggatggatattgacagtgaagacacagattcaaacacatctttgcaaacaagggct agagaaaagaggaagcctcagctggagaaggacacaaagccgaaagagcccaggtatact cccgtgagcatctacgaggaaaagctgctgctcaagaggctggaagcttgtcccggtgct gttgccatgactccggaagctcggagactgaaacgcaaactgattgtcagacaagcgaaa agggataggggattaccactttttgacttggatcaagttgttaatgctgctcttttgtta gttgacgggatttatggagccaaagaaggaggaatttccagacttccagctggacaagcc acgtacagaaccacctgtcaggacttcagaatccttgaccgataccagacttccttgccg tccaggaagggatttcgacaccagaccaccaagtttttgtatcgcttggtaggatcagaa gatatggctgtggaccagagtattgtcagcccttatacctctcggatcttgaaaccttat atcaggagcgaccctcactggacgccggagcccgacgcacctctcgattactgttatgtg cggccaaatcacatcccaacgatcaactccatgtgtcaggagtttttttggcctggcatt gacctgtctgagtgtctgcagtacccagacttcagtgttgttgttctttataaaaaagtc atcattgcctttggcttcatggttcctgatgtgaaatacaatgaagcttacatttcattt ctgttcgtccaccctgaatggagaagagcagggattgcaactttcatgatctatcatctg attcagacctgcatgggcaaggacgtaacccttcacgtctcagcaagcaaccccgctatg ctactgtaccagaagtttggattcaagactgaagaatatgtattagatttctatgataaa tattacccattggagagtacagagtgtaaacacgcattctttctgaggctccggcgctga >gi568815578f:18042661_18287456|GENSCAN_predicted_peptide_4|171_aa MILEFGRFLVDVETAQTACSELSIMPCSRLPCRHLVGHGKLRLQGHPGPDEESFLCFPYD SNLLTERSTSKTPRWCSSTWPACRQEVMEFMLSWKLMFQLKKRNNLSTLSDDKLQECFPS PQYKKVICIGAKENGLPLEYKKKLKAVGPNDRTGKVSEEIEDIIKKESRTH >gi568815578f:18042661_18287456|GENSCAN_predicted_CDS_4|516_bp atgatacttgaatttggaaggttcctggtagacgtagaaacggcccagactgcctgttca gagctaagcatcatgccctgctccagactcccgtgcaggcatctagtgggacacggcaaa ctcaggctgcaaggacatccgggcccagatgaggagagttttctgtgcttcccctacgac agcaacctgctgactgagagatccacctccaaaacccctcggtggtgttcttctacatgg cccgcctgcaggcaagaagtgatggaatttatgttgtcatggaagttaatgtttcaactc aagaagagaaataacctatcgacattatctgatgacaaattacaagagtgtttcccatcc ccacagtataaaaaggttatttgcataggtgcaaaagaaaatggtttgccactggagtac aaaaagaagttaaaagctgtgggaccaaatgaccgtacaggaaaggtctcagaagaaatt gaagacatcatcaaaaaggaatcacgaactcattag >gi568815578f:18042661_18287456|GENSCAN_predicted_peptide_5|220_aa MRGVWRERRELEPGLRTVLAGQLEFRVGVGLAGPALGAASRRCRLRAMRGLEPRPMAAEA PSPIDHPRAEECRRTARDWQAAPPAALVLDPLGEASWAPESAVTYAVKVCSFTPEASEIA NPPGGANNSRRAPLRAVTLTAKGCSFTPEPARPRTHQKEETTNTSEHQKEQTLDTPPLRT VALTTRVRGFILKVSETKNPPIPDTMPSPVRRYINMMFPQ >gi568815578f:18042661_18287456|GENSCAN_predicted_CDS_5|663_bp atgaggggagtgtggagggagagacgcgagctggaaccagggctgcgcacggtgctggcg ggccagctggagttccgggtgggcgtgggcttggcgggccccgcacttggagcagccagt cggcgctgccggctccgggcaatgaggggcttagaacccaggccaatggctgcggaggcg cccagtcccatcgaccacccaagggctgaggagtgcaggcgcacggcgcgggactggcag gcagctccacctgctgccctggtgctggatccactgggtgaagccagctgggctcctgag tctgctgtaacatacgccgtgaaggtctgcagcttcacccctgaagccagcgagatcgca aacccaccggggggagcgaacaactccagacgcgcgcccttaagagctgtaacactcact gcgaagggctgcagcttcactcctgagccagcgagaccacgaacccaccagaaggaagaa actacgaacacatccgaacatcagaaggaacaaactctggacacgccgcctttaagaact gtagcactcaccacgagggtccgcggcttcattcttaaagtcagtgagaccaagaaccca ccaattccggacacaatgccatcaccagtaagaagatacatcaacatgatgttccctcag tag >gi568815578f:18042661_18287456|GENSCAN_predicted_peptide_6|302_aa MATLPKVIYRCNAIPIKLPMAFFTELENTTLKFIWNQKRAHIAKSILSQKNKAGGIMLPD FKLYYKATVTKRAWYWYQNRDINQWNRTEPSEIIPHIYNHLIFDKPDKNKKWGKESLFNK WCWENWLTICRKLKLDPFLTSYTKINSRWIKDLHVRPKTIKTLEENLGNAIQDTGIGKDL MSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYSSDKGLISRIY KELKEIYKKQKQPHQKVGEGYEQTLLKRRHLCSQQTHEKMLIITGHQKNANQNHNEIPSH TS >gi568815578f:18042661_18287456|GENSCAN_predicted_CDS_6|909_bp atggccacactgcccaaggtaatttatagatgcaatgccatccccatcaagctaccaatg gctttcttcacggaattggaaaacactactttaaagttcatatggaaccaaaaaagagcc cacattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacctgac ttcaaactatactacaaggctacagtaaccaaaagagcatggtactggtaccaaaacaga gatataaaccaatggaacagaacagagccctcagaaataataccacacatctacaaccat ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggagtccctatttaataaa tggtgctgggaaaactggctaaccatatgtagaaagctgaaactggatcccttccttaca tcttatacaaaaattaattcaagatggattaaagacttacatgttagaccaaaaaccata aaaaccctagaagaaaacctaggcaatgccattcaggacacaggcataggcaaggactta atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatttaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca gaatgggagaaaatttttgcaatctactcatctgacaaagggctaatatccagaatctac aaagaactcaaagaaatttacaagaaacaaaaacaaccccatcaaaaagtgggtgaagga tatgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaaaaatg ctcatcatcactggccatcagaaaaatgcaaatcaaaaccacaatgagataccatctcac accagttag