GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:44:32 Sequence gi568815594r:44074793_44548523 : 473731 bp : 35.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 PlyA - 775 770 6 1.05 1.11 Term - 16356 16252 105 0 0 64 55 114 0.343 3.13 1.10 Intr - 16872 16796 77 1 2 75 57 46 0.278 -1.48 1.09 Intr - 24403 24231 173 0 2 16 82 121 0.673 2.96 1.08 Intr - 25906 25808 99 0 0 86 74 97 0.420 6.41 1.07 Intr - 28386 28246 141 2 0 60 78 105 0.460 5.25 1.06 Intr - 39194 39077 118 0 1 66 54 100 0.015 3.00 1.05 Intr - 55628 55548 81 0 0 93 113 24 0.361 4.09 1.04 Intr - 57012 56954 59 2 2 91 77 65 0.207 3.31 1.03 Intr - 72195 72050 146 0 2 33 37 112 0.031 -1.24 1.02 Intr - 72332 72260 73 1 1 44 86 105 0.623 4.39 1.01 Init - 73515 73469 47 0 2 64 89 24 0.435 0.31 1.00 Prom - 77291 77252 40 -8.65 2.00 Prom + 77920 77959 40 -5.65 2.01 Init + 81628 81846 219 0 0 83 72 170 0.901 13.58 2.02 Intr + 94110 94215 106 2 1 55 95 75 0.199 3.77 2.03 Term + 97829 97968 140 0 2 87 35 66 0.335 -1.66 2.04 PlyA + 99134 99139 6 1.05 3.06 PlyA - 99504 99499 6 1.05 3.05 Term - 100458 99998 461 1 2 132 39 363 0.999 30.07 3.04 Intr - 109355 109294 62 1 2 66 89 39 0.027 -0.74 3.03 Intr - 111635 111405 231 1 0 63 62 161 0.015 6.97 3.02 Intr - 147967 147786 182 1 2 59 86 154 0.102 10.04 3.01 Init - 153271 153260 12 1 0 68 91 4 0.031 -0.93 3.00 Prom - 160424 160385 40 -3.85 4.00 Prom + 173020 173059 40 -5.45 4.01 Init + 190914 191029 116 2 2 49 44 148 0.752 6.23 4.02 Term + 193113 193623 511 0 1 15 48 188 0.125 0.36 4.03 PlyA + 193773 193778 6 1.05 5.00 Prom + 194630 194669 40 -5.65 5.01 Init + 194684 194808 125 1 2 71 91 106 0.770 8.89 5.02 Term + 210531 210705 175 0 1 73 38 87 0.112 -1.55 5.03 PlyA + 210826 210831 6 1.05 6.03 PlyA - 211372 211367 6 1.05 6.02 Term - 227958 226634 1325 1 2 67 38 353 0.211 18.92 6.01 Init - 274849 274798 52 1 1 85 87 38 0.446 4.77 6.00 Prom - 283592 283553 40 -3.25 7.00 Prom + 312957 312996 40 -2.05 7.01 Init + 340832 340851 20 1 2 99 82 5 0.328 0.61 7.02 Term + 340929 341127 199 0 1 11 54 171 0.267 1.89 7.03 PlyA + 341991 341996 6 1.05 8.02 PlyA - 342504 342499 6 1.05 8.01 Sngl - 373677 372613 1065 0 0 76 42 1599 0.482 148.98 8.00 Prom - 379442 379403 40 -7.65 9.00 Prom + 385349 385388 40 -5.15 9.01 Init + 399611 399916 306 1 0 70 87 159 0.435 11.34 9.02 Intr + 410959 411274 316 0 1 67 110 284 0.272 23.21 9.03 Term + 411971 412320 350 0 2 56 46 171 0.593 3.36 9.04 PlyA + 412881 412886 6 1.05 10.04 PlyA - 414428 414423 6 1.05 10.03 Term - 426908 426449 460 1 1 38 54 309 0.140 16.08 10.02 Intr - 449582 449337 246 1 0 97 43 152 0.373 7.25 10.01 Init - 460787 460726 62 2 2 51 77 109 0.226 6.87 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:44074793_44548523|GENSCAN_predicted_peptide_1|372_aa MTVQRIKAKSVKIALELYPRSPNPPIPQKVTVFEDKAFRKKRKLGQRQRQREEPVKAQGE DSCLQAKERGRTRNNPADTFILDFQPTELCLRRPPALQGKWKEKQKEGALMCRKAHISMR KCFLTVPQVALSLAEGSSVQRQRDSVCLGKSKGTEQESLPGDRGEFPGADPRPPREVNIE IQELQRTPVKYYTRRSSPRHTIIRISKDELMEKNLKAAREKGLSCYTMIRSFLSLPQKPT GWPASYFLYCEPVELRSNLDHFITQYSKINTRRIKDLNVKPKTIKTLEDNVGNTILDVRP GKDFMIKMLKRLAKDLEYAILYSGNFLIVTLVLQDQEESFGWELAPPQAPWVHTDGHKQK AQSQSTLLQSWV >gi568815594r:44074793_44548523|GENSCAN_predicted_CDS_1|1119_bp atgactgtacaaagaataaaagcaaaatctgtcaaaatagctctagaattgtatccacga agtcctaacccccccatacctcagaaggtgaccgtatttgaagataaggcctttagaaag aagaggaaattaggacagaggcagagacagagggaagaacctgtgaaggcacagggagaa gatagctgtctgcaagctaaggagagaggccgtacaagaaacaaccctgcagacaccttc attttggacttccagcctacagaactatgtctgagaagaccaccagcccttcagggaaaa tggaaagagaaacaaaaagaaggagcattgatgtgcagaaaggcccacatctccatgagg aaatgttttctgacagtcccacaggtggccctctcccttgcagaaggcagctcagtacag agacagagagactctgtttgtttggggaaaagtaagggaacagaacaagagtctttacct ggtgatcgaggggaatttcctggagctgacccaagaccaccaagagaggtcaacattgaa attcaggaactgcagagaacccctgtgaaatactacacaagaagatcatccccaagacac acaatcatcagaatctccaaggacgaactgatggaaaaaaatttaaaggcagctagagag aagggcttatcctgctacaccatgattcgaagtttcctgagtcttccccagaagccaacc ggatggccagcatcatacttcctgtactgtgagcctgtggaactaagatcaaatctggac cacttcattacgcaatattcaaaaatcaacacaagacgtatcaaagacttaaatgtaaaa cctaaaactataaaaactctggaagataatgtaggaaataccattctggacgtaagacct ggcaaagatttcatgataaagatgctaaaaaggctggcaaaggatcttgaatatgcaatt ctctactctggaaattttctcatcgttacactggttttgcaagatcaggaggagtcattt ggatgggagttagctccacctcaagctccatgggtgcacactgatggacataagcaaaag gcacaatcccaatctacgctgctacagtcatgggtttga >gi568815594r:44074793_44548523|GENSCAN_predicted_peptide_2|154_aa MAEGKRHILHGSRQENESQAEGENPYKIVRSCETYSLPREEYGGNHCHDSIISIGSLPQH VGIMKAAIQDEIWEDHFSSSMKDGPEGERLKAETSVVIITVAEVENDEVGLCVSRGTLVQ ITFPRLLRQRAPTYIGKREGADVKLECGKSGKLE >gi568815594r:44074793_44548523|GENSCAN_predicted_CDS_2|465_bp atggcagaaggcaaaaggcacatcttacatggcagcagacaagagaatgagagccaagca gaaggggaaaacccttataaaatcgtcagatcctgtgagacttattcactaccacgagaa gagtatgggggaaatcactgccatgattcaattatctccattgggtccctcccacaacat gtgggaattatgaaagctgcaattcaagatgagatttgggaagatcacttctctagcagt atgaaggatggaccggaaggggaaagattgaaggcagagaccagtgtagtcattattact gtagctgaagtggaaaatgatgaagttggactttgtgtctcaaggggaacccttgtacaa attacatttcctaggctgcttagacaacgggctcctacttatattgggaagagggaaggt gctgatgtaaaattagaatgtgggaaaagtggaaaactagagtaa >gi568815594r:44074793_44548523|GENSCAN_predicted_peptide_3|315_aa MTAMASGVSLRLVQKLEFPEFPLCIVPDTALDTGNNLPEITKVDVKLQPHCFYSLKDDAG QQALRKIGLHVGLEHECKFLLSVGSSSLQMDGEPAGVCSGKVFFAWSKATQFSSDCFHQI PLSRRHIAPPVDGLLASASACRYICTCHPKPELLDIRDCGCAGPPQKIVSPKQEHEDRKH DKVTDKGSESGTSCNELSTSSCDSHSEASTPQDNPSSAQQATAHQPNTLTLDRPSKKAPV QWIPPPDKRRNSELFQTLISKSRETNLSKKKVCEKLSVEEEMKKCIQDFKKIHIPDYFPE RKRQWQSELLQKYGL >gi568815594r:44074793_44548523|GENSCAN_predicted_CDS_3|948_bp atgactgcaatggcaagtggagtgtctttgcgcttggtccaaaaactagaattcccagaa tttcctctgtgtatagttccagacacagccttagacacaggaaataatttgcctgagatt acaaaggtggacgtgaagttgcagccacattgtttttactctctgaaagatgatgcagga cagcaggcactcagaaaaattgggttacatgtgggcttggagcatgagtgcaagttttta ctgagtgttggaagtagctctctgcagatggatggggagccagcaggggtatgcagtgga aaggtgttttttgcctggagtaaggcgacgcaattctcctctgactgcttccaccaaatt cccctcagccgccgccacatcgctcccccagttgatggcctgctggcgtctgccagtgcc tgtcgctacatctgcacttgtcaccccaagcctgaactacttgatatcagggactgtggt tgtgcaggaccacctcagaaaatagtatcacctaaacaagaacatgaagataggaaacat gacaaagtcactgataaaggaagtgaaagtgggacttcctgtaatgagctctccacttcc agttgtgacagccattcagaggcaagcactccccaggacaacccatccagtgcccagcag gcaacagctcaccaacctaacactttaacattggatcgcccctctaaaaaagcacctgta caatggatacccccaccagacaaacgcagaaacagtgaactctttcagaccctcatcagc aagtcccgggaaacaaatctgtccaaaaagaaagtctgtgagaagctaagtgtggaagaa gaaatgaaaaagtgtattcaggattttaaaaaaatccacattccagattattttccagag cgcaaacgccaatggcaatctgaactgttgcagaagtatgggttatag >gi568815594r:44074793_44548523|GENSCAN_predicted_peptide_4|208_aa MELKAKARELREECRSLRSRRNQLEERVSAMEDEMNEMNLPTKKSPGPDGFTAEFHQRYK EELVPFLLKLFQSIEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNK ILANRIQQHIKKLIHHNQVGFIPGMQGWFNICKSINVIQHINRTKDKNHMIISIDAEKAF DKIQQPFMLKTLKKLGIDGMYLKIIRAI >gi568815594r:44074793_44548523|GENSCAN_predicted_CDS_4|627_bp atggagctgaaagccaaggctcgagaactacgtgaagaatgcagaagcctcaggagccga cgcaatcaactggaagaaagggtatcagccatggaagatgaaatgaatgaaatgaactta ccaaccaaaaagagtccaggaccagatggattcacagcagaattccaccagaggtacaag gaggaactggtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctc cctaactcattttatgaagccagcatcatcctgattccaaagcctggcagagacacaacc aaaaaagagaatttcagaccaatttccttgatgaacattgatgcaaaaatcctcaataaa atactggcaaaccgaatccagcaacacatcaaaaagcttatccaccataatcaagtgggc ttcatccctgggatgcaaggctggttcaatatatgcaaatcaataaatgtaatccagcat ataaacagaaccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggccttt gacaaaattcaacaacccttcatgctaaaaactctcaagaaattaggtattgatgggatg tatctcaaaataataagagctatctag >gi568815594r:44074793_44548523|GENSCAN_predicted_peptide_5|99_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRDMDEAGNHHSQQTITSSE NQTPYVLTHKWESNNKNTWTQGGEHHTLGPVGGRGVGEG >gi568815594r:44074793_44548523|GENSCAN_predicted_CDS_5|300_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacagggacatggatgaagctggaaaccatcattctcagcaaactatcacaagttcagaa aaccaaacaccatatgttctcactcataagtgggagtcgaacaacaagaacacatggaca cagggaggggaacatcacacactggggcctgtagggggtcggggtgtaggggagggataa >gi568815594r:44074793_44548523|GENSCAN_predicted_peptide_6|458_aa MEMHLLALPSGKNLLVSLLEVLARAIRQKKEIKGIQLGKEEVKLSLFADDMIVYLENPIV SAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTMASKRIKYLGIQL KRDVKDLFKENYKPLLNEIKEVTKKWKNIPCSWVGRINIVKMAILPKVIYRFNAIRIKLP MTFFTELEKTTLKFIWNQKRARNPKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQN RDIDQWNKTEPSEITPHIYNYLIFGKAEQNKQWGKDSLFNKWCWENWLAICRKLKLDPFL TPYTKINSRWIKDLNVRTKTIETLEENPGITIQDIGMGKDFMSKTPKAMATKAKIDKWDL IKLKSFCTAKETTIRVNRQPMKWEKIFTTYSSDKGLISRIYNELKQIYKKKRNNPIKKWA KDMNRHFSKEDIYAAKRHMKKCSSSLAIREMQIKTTMR >gi568815594r:44074793_44548523|GENSCAN_predicted_CDS_6|1377_bp atggaaatgcacctcttagctctcccttctggaaagaacctgctggtgagcttgttggaa gttctggccagggcaattaggcagaagaaggaaataaagggtattcaattaggaaaagag gaagtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaaccccattgtc tcagcccaaaatctccttaagctgataagtaacttcagcaaagtctcaggatacaaaatc aatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatc atgagtgaactcccattcacaatggcttcaaagagaataaaatacttaggaatccaactt aaaagggacgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaa gaggttacaaagaagtggaagaacattccatgctcatgggtaggaagaatcaatattgtg aaaatggccatcctgcccaaggtaatttatagattcaatgccatccgcatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagttcatatggaatcaaaaaaga gcccgcaatcccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacct gacttcaaactatactacaaggctacagtaacaaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacaaaacagagccctcagaaataacgccacatatctacaac tatctgatctttggcaaagctgagcaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaatgttagaactaaaacc atagaaaccctagaagaaaacccaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccattagagtgaacaggcaacct atgaaatgggagaaaattttcacaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaagaaacaaccccatcaaaaagtgggca aaggatatgaacagacacttctcaaaagaagacatttatgcagccaaaagacatatgaaa aaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataa >gi568815594r:44074793_44548523|GENSCAN_predicted_peptide_7|72_aa MAKRTSRQTECKSEGGLAALPWISKDVLESLNAQAEACYRGEAPTKDCTRAMSRENVGGP TESPPGDCQVEL >gi568815594r:44074793_44548523|GENSCAN_predicted_CDS_7|219_bp atggctaaaaggacatcaaggcaaacagaatgcaaaagtgaaggtggcttggcagcactc ccctggatttcaaaagacgtgctggaaagtctaaatgcccaggcagaagcctgttacagg ggtgaagcacccacaaaagactgtaccagggcaatgtcaagggaaaatgtgggtggaccc acagagtccccaccaggggactgtcaggtggagctgtga >gi568815594r:44074793_44548523|GENSCAN_predicted_peptide_8|354_aa MVSSSSSPGASAAAAPGPCAPSPFPEVVELNVGGQVYVTKHSTLLSVPDSTLASMFSPSS PRGGARRRGELPRDSRARFFIDRDGFLFRYVLDYLRDKQLALPEHFPEKERLLREAEYFQ LTDLVKLLSPKVTKQNSLNDEGCQSDLEDNVSQGSSDALLLRGAAAAVPSGPGAHGGGGG GGAQDKRSGFLTLGYRGSYTTVRDNQADAKFRRVARIMVCGRIALAKEVFGDTLNESRDP DRQPEKYTSRFYLKFTYLEQAFDRLSEAGFHMVACNSSGTAAFVNQYRDDKIWSSYTEYI FFRKFAAPAFPSTPRPLLSRRQFEPRWRPRGGVRGRRWLTPYRERLSLHDSLMQ >gi568815594r:44074793_44548523|GENSCAN_predicted_CDS_8|1065_bp atggtttcctcgtccagctcgcccggcgcgtcggccgccgccgccccggggccctgcgca ccctcgcccttccctgaagtagtggagctgaacgtaggcggccaggtttatgtgaccaag cactcgacgctgctcagcgtcccggacagtactttggccagcatgttctcgccctctagt ccccgtggcggcgcccggcgccggggcgagctgcccagggacagccgggcgcgcttcttc atcgaccgggacggcttccttttcaggtacgtgctggattatctgcgggacaagcaactc gcgctgccggagcacttccccgagaaggagcggctgctgcgcgaggccgagtatttccag ctcaccgacttggtcaagctgctgtcgcccaaggtcaccaagcagaactctctcaacgac gagggctgccagagcgacctggaggacaacgtctcgcagggtagcagcgacgcgctgctg ctgcgcggggcggcggccgccgtgccctcgggcccgggagcgcacggtggtggcggcggc ggcggcgcgcaggacaagcgctcgggcttcctcacgctgggctaccggggctcctacacc accgtgcgcgacaaccaggccgacgccaaattccggcgtgtggcgcgcatcatggtgtgc gggcgcatcgcgctggccaaggaggtcttcggggacacgctcaacgagagccgcgacccc gaccggcagccggagaagtacacgtcccgcttctacctcaagttcacctacttggagcag gcctttgatcgcctgtccgaggccggcttccacatggtggcgtgtaactcctcgggcacc gccgccttcgtcaaccagtaccgcgacgacaagatctggagcagctacaccgagtacatt ttcttccgtaagttcgcagccccggcgtttcccagcacccctcgccccctgctgagccgc cgccagtttgagccccgctggaggccccgcgggggtgtccggggcaggcgctggttaact ccttatagagagcggctctccctgcacgattcccttatgcagtag >gi568815594r:44074793_44548523|GENSCAN_predicted_peptide_9|323_aa MPPDGETPPSRFDRHLIQESSGWHLAGAPLGRSFQRKEQVVIFAVLQPPLVIPRQTGSGV DHQQTPADLQQRSLTVRRKTNKQKGITSTSTKRTSTPKSHLKLGLEPDNWCSQQYPEVSE PLAPNDPKSAMWNPSWVMLHSGSCAARMGPSGNMGGGGQVPRDHGEGAEVLGSTQYQEGE CLCWQSWKGVFDRATGSACSVPTGCSTVPLASWLLSSPVCGAPHTRSNYCHGGPQGSRRP LLGLRAPHHEQGEGPLSQGAPSCDKKGPQQVTCTQMWIDLILAGVDQEKIGRQPSEVLLT LWRQLSLEQQFRKMPEREKDNVA >gi568815594r:44074793_44548523|GENSCAN_predicted_CDS_9|972_bp atgcctcctgacggggagacacctcccagcaggtttgacagacatctcatacaggagagc tctggctggcatctggcaggtgcccctctgggacgaagtttccagaggaaggaacaagta gtaatctttgctgttcttcagcctccgctggtaatacccaggcaaacagggtctggagtg gaccaccagcaaactccagcagacctgcagcagaggagcctgactgttagaaggaaaact aacaaacagaaaggaataacttcaacatcaacaaaaaggacatccacaccaaaatcccat ctgaagctcgggttggagcccgataattggtgtagtcagcagtatcctgaggtgagtgaa cccttggcccccaatgatcccaagtcagccatgtggaacccctcatgggttatgctacac agtggcagctgtgctgctaggatgggccccagtgggaacatgggcggtggtggacaggtc cccagagaccatggagaaggtgctgaagtacttggaagcacacagtaccaagaaggagag tgcctttgctggcagagttggaagggtgtttttgacagggctacaggaagtgcttgctca gtccctacgggatgcagcacagtgcctttggcttcctggttgctgtcctcacctgtatgt ggagcaccacatacacgaagtaactactgccatggtggccctcagggaagcagaaggcca ttgctgggactgagggctccacaccatgaacaaggggaaggacccctttctcagggggct ccctcatgtgacaaaaaggggccccaacaagtgacatgcacacagatgtggattgatttg attttggctggggtcgaccaggagaaaattggtaggcaacccagtgaagtactattaact ttatggaggcaattgtccctggagcagcaattccggaaaatgcctgaaagggagaaggac aatgttgcttga >gi568815594r:44074793_44548523|GENSCAN_predicted_peptide_10|255_aa MLKEGPGEVEKLQAPLLPQTRENYEAYVLHGYQEFTSRMKIQWLTVHKRFYQLPSLPVSL LYRVTRVFQDHTPNKLLTFGFLSQILLLGKSNKDDKECYNVHRQTGSGVDLQQTPTDLQL RVLTVRRKTNKQDIHTKTPTVCYHHQRPKVDKTTKMGRNQSRKAENCKNQSTSSPPKECS SSPAKKQSWKENDFDELREEGFRRLVIINLSELKENVRNHCKEAENLEKRLDKWLTRINS REKTLNDLMELKTMA >gi568815594r:44074793_44548523|GENSCAN_predicted_CDS_10|768_bp atgctgaaggagggtccaggtgaagtggagaagttgcaggcccccctgctgccccagacc agggaaaactatgaggcatatgttctgcatggttatcaagagttcaccagcaggatgaag atccagtggctcacagtacacaagaggttttaccagcttccttcccttcctgtctcactt ctttaccgtgttacccgtgtattccaggatcacactccaaataaactgcttacatttgga ttcttatctcagattctacttctggggaagtcaaacaaagatgacaaggaatgctataat gtgcataggcaaacagggtctggagtggacctccagcaaacaccaacagacctgcagctg agggtcctgactgttagaaggaaaactaacaaacaggacatccacaccaaaaccccaact gtatgttaccatcatcaaagaccaaaggtagataaaaccacaaagatggggagaaaccag agcagaaaagctgaaaattgtaaaaatcagagcacctcttctcctccaaaggaatgcagc tcctcaccagcaaagaaacaaagctggaaggagaatgactttgatgagttgagagaagaa ggcttccgacgattggtaataataaacttatctgagctaaaggagaatgttcgaaaccat tgcaaagaagctgaaaaccttgagaaaagattagacaaatggctaactagaataaacagc agagagaagaccttaaatgacttgatggagctgaaaaccatggcatga