GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:38:11 Sequence gi568815594r:155828333_156053847 : 225515 bp : 36.95% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 906 901 6 1.05 1.10 Term - 2493 2314 180 0 0 25 38 178 0.711 3.13 1.09 Intr - 3583 3514 70 0 1 113 101 27 0.671 4.77 1.08 Intr - 8525 8357 169 0 1 107 91 57 0.965 5.98 1.07 Intr - 12282 12240 43 0 1 24 50 92 0.425 -4.11 1.06 Intr - 15498 15349 150 0 0 38 94 136 0.871 8.64 1.05 Intr - 15966 15814 153 0 0 74 76 137 0.955 10.45 1.04 Intr - 25917 25745 173 1 2 23 92 109 0.719 3.54 1.03 Intr - 26545 26418 128 0 2 77 80 118 0.729 9.30 1.02 Intr - 35422 35116 307 2 1 7 98 222 0.664 9.68 1.01 Init - 37894 37855 40 1 1 50 113 23 0.661 1.40 1.00 Prom - 41628 41589 40 -6.05 2.02 PlyA - 42224 42219 6 1.05 2.01 Sngl - 50689 50354 336 1 0 64 41 242 0.715 12.88 2.00 Prom - 52760 52721 40 -8.15 3.00 Prom + 53185 53224 40 -7.75 3.01 Init + 54510 54526 17 2 2 90 57 8 0.099 -2.22 3.02 Term + 64010 64295 286 2 1 98 43 223 0.706 12.69 3.03 PlyA + 64366 64371 6 1.05 4.00 Prom + 65364 65403 40 -6.25 4.01 Init + 67425 67491 67 2 1 26 86 72 0.004 2.09 4.02 Intr + 75712 75791 80 2 2 21 103 126 0.005 5.75 4.03 Intr + 76735 76825 91 0 1 36 115 44 0.889 0.55 4.04 Intr + 79390 79460 71 2 2 69 103 51 0.855 2.68 4.05 Intr + 80555 80682 128 1 2 115 69 157 0.999 15.06 4.06 Intr + 81693 81879 187 0 1 51 64 145 0.865 7.17 4.07 Intr + 85991 86106 116 1 2 67 68 165 0.792 10.73 4.08 Intr + 89063 89142 80 2 2 119 95 60 0.913 8.08 4.09 Intr + 89817 89907 91 1 1 16 92 71 0.725 -1.57 4.10 Term + 91505 91658 154 2 1 71 38 121 0.766 1.81 4.11 PlyA + 92048 92053 6 1.05 5.11 PlyA - 92337 92332 6 1.05 5.10 Term - 97233 97082 152 1 2 51 44 109 0.473 -0.11 5.09 Intr - 100096 100004 93 2 0 75 103 57 0.749 4.92 5.08 Intr - 101373 101210 164 2 2 63 111 30 0.487 1.50 5.07 Intr - 104335 104183 153 0 0 13 92 102 0.246 1.37 5.06 Intr - 106231 106097 135 0 0 120 -11 80 0.112 0.06 5.05 Intr - 109151 109030 122 2 2 73 103 64 0.983 4.77 5.04 Intr - 111206 111039 168 2 0 77 57 115 0.917 6.52 5.03 Intr - 114124 113985 140 2 2 57 88 44 0.497 0.56 5.02 Intr - 114932 114824 109 2 1 97 75 39 0.542 2.44 5.01 Init - 125515 125381 135 1 0 100 53 273 0.464 23.19 5.00 Prom - 138870 138831 40 -2.35 6.00 Prom + 144178 144217 40 -2.45 6.01 Init + 152832 152893 62 2 2 84 94 45 0.673 5.47 6.02 Intr + 153235 153293 59 1 2 87 115 52 0.544 5.41 6.03 Term + 181684 181838 155 2 2 80 49 112 0.354 3.60 6.04 PlyA + 182025 182030 6 1.05 7.00 Prom + 182668 182707 40 -8.25 7.01 Sngl + 185500 185784 285 0 0 68 48 840 0.969 73.29 7.02 PlyA + 187148 187153 6 1.05 8.03 PlyA - 187309 187304 6 1.05 8.02 Term - 190248 190174 75 0 0 89 43 82 0.973 0.66 8.01 Init - 191407 191279 129 1 0 70 76 132 0.990 10.40 8.00 Prom - 205286 205247 40 -0.45 9.00 Prom + 208850 208889 40 -7.35 9.01 Init + 210658 210748 91 0 1 50 52 134 0.450 6.70 9.02 Intr + 210909 211056 148 1 1 48 96 101 0.902 5.27 9.03 Term + 211077 211287 211 0 1 18 49 122 0.812 -2.92 9.04 PlyA + 211773 211778 6 1.05 10.04 PlyA - 213290 213285 6 1.05 10.03 Term - 216474 216365 110 1 2 68 53 99 0.484 2.09 10.02 Intr - 219485 219472 14 1 2 106 94 -2 0.368 -4.29 10.01 Init - 221191 220122 1070 1 2 48 53 285 0.264 14.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 67231 67695 465 0 0 10 38 247 0.805 7.79 S.002 Intr + 75686 75791 106 2 1 74 103 131 0.990 12.50 S.003 Sngl - 165366 165172 195 0 0 37 49 182 0.855 4.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:155828333_156053847|GENSCAN_predicted_peptide_1|470_aa MEQTEKSKVYAENGLLEKIKLCLSKKPLPSPTERKKFDHDFAISTSFHGIHNIVQNRSKI RRVLWLVVVLGSVSLVTWQIYIRLLNYFTWPTTTSIEVQYVEKMEFPAVTFCNLNRSHSY LTQYLKKLKEKVTKELEELWSWLLPDTSKLSQQMDCSVVLHLQEITANSTGSREATDFAA SHQNFSIVEFIRNKGFYLNNSTLLDCEFFGKPCSPKRMNLRPPNEKQWKIRDPWRKEFLE FSNRPIGLGYKVPKLVPSTDRLAAAGQEAFTDNPALGFVDAGIIFVIHSPKKVPQFDGLG LLSPVGMHARVTIRQVKLVQRTPKNEQGQLGDHIEFKDLCTVGTHNSSCPVSCEEIEYPA TISYSSFPSQKALKYLSKKLNQSRKYIRENLVKIEINYSDLNYKITQQQKAHCWSNQIQR QKVKEWLSWAEERKDEELEFNAYRVSVREDENILQIDKDYGMAAKQCERT >gi568815594r:155828333_156053847|GENSCAN_predicted_CDS_1|1413_bp atggagcagacagaaaaatcaaaagtatatgctgagaacggactcttagaaaagataaag ctttgcctttcaaagaaaccactgccatctcccactgagcgaaagaagtttgaccatgac tttgccatctccacttcctttcatgggatacacaatattgttcagaaccggagcaaaatt cgcagggtgctctggttggtggtggttctgggctcagtctcacttgtgacatggcagatc tacattcgcttgctcaactacttcacatggccaaccacaacgtccattgaggttcaatat gtggaaaagatggagttcccagctgtgacattttgtaatttgaacaggtcgcactcttac ctaacccagtatttgaagaagctgaaggaaaaggtcacaaaggagctggaggagctgtgg tcctggctgctgcccgatactagcaagttgagtcagcagatggattgcagtgtggtcctc catcttcaagaaattactgccaattccactggctctagagaggctactgattttgctgca agtcaccaaaacttcagcattgtggaatttatcaggaacaaaggtttttatctcaacaat agcactttgttggactgtgagttttttggaaagccatgtagcccaaagcgaatgaacctc agacccccaaatgaaaaacaatggaagatcagggatccctggaggaaagagttcctggag ttcagcaatcgtcctattggtttgggctataaggtgcccaagctggtaccaagcactgat aggctagctgctgcaggccaggaggcattcactgataacccagcccttggtttcgttgat gctgggatcatctttgttatccattcaccaaagaaggtgccacagtttgatgggttaggc ttgttgtcacctgtgggaatgcacgcaagggtaaccatccgccaagtgaagttagttcag cgaacgcccaagaatgaacaaggacagcttggagaccacattgaatttaaggatttatgt acagtaggaacacataactctagctgccccgtttcttgtgaagaaatagaatacccggcc actatttcttattcctcttttccaagtcaaaaagctttgaaatatctttccaagaagttg aatcaaagccggaaatacatcagggagaatcttgtaaaaattgaaattaactatagtgac ctaaactataagataacccagcagcaaaaggcgcactgctggagtaatcagattcagaga cagaaagtaaaagagtggttgtcatgggccgaggagaggaaggatgaggagttagagttt aatgcgtacagagtttcagttagggaagatgaaaacatcctgcaaatagataaagattat ggaatggctgcaaaacaatgtgaacgtacttaa >gi568815594r:155828333_156053847|GENSCAN_predicted_peptide_2|111_aa MRGSSSHRVPTGVLPSGVVRRGPKFSRPLNGRSTDSLHCVPEKAADTQCQPVKAARKGTA PAKPQGQRCPKTMGTGLLHQYNVTVRHESKEIILELYNLTALLDFGLAWGL >gi568815594r:155828333_156053847|GENSCAN_predicted_CDS_2|336_bp atgaggggttccagctcccacagagtccccactggagtattgcctagtggagttgtgaga agagggccaaaattctccagacccctgaatggtagatccactgacagcttgcactgtgta cctgaaaaagctgcagacactcaatgccagcctgtaaaagcagccaggaaggggacagcc cctgcaaagccacaggggcagaggtgccccaagaccatgggaaccggcctcttgcatcag tacaacgtgactgtgagacatgagtcaaaggagatcattttggagctttacaatttgact gccttgctggattttggacttgcatggggcctgtag >gi568815594r:155828333_156053847|GENSCAN_predicted_peptide_3|100_aa MEVKQRGRLTPLMARYLSETKLPEEQLGSNICCSAIFAVLQPPLLISRQTGSGVDLQQTP TDLQLRVLTVRRKTNKQKGHPHQDKNYMMNAQASVANSIN >gi568815594r:155828333_156053847|GENSCAN_predicted_CDS_3|303_bp atggaggtgaagcagaggggcagattgacacctctcatggccaggtacctctctgagaca aagcttccagaggaacaactgggcagcaacatttgctgttcagcaatattcgctgttctg cagcctccgctgctgatatccaggcaaacagggtctggagtggacctccagcaaactcca acagacctgcagctgagggtcctgactgtcagaaggaaaactaacaaacagaaaggacat ccacaccaagataagaactacatgatgaatgcacaagcttcagtagccaattcaatcaac tag >gi568815594r:155828333_156053847|GENSCAN_predicted_peptide_4|354_aa MEEYSMLMDRKTQYRENGHTGQEGSEEDKSQTGVNRASKGGLIYGNYLHLEKVLNAQELQ SETKGNKIHDEHLFIITHQAYELWFKQILWELDSVREIFQNGHVRDERNMLKVVSRMHRV SVILKLLVQQFSILETMTALDFNDFREYLSPASGFQSLQFRLLENKIGVLQNMRVPYNRR HYRDNFKGEENELLLKSEQEKTLLELVEAKEESEEKEEQVAEFQKQKEVLLSLFDEKRHE HLLSKGREEPRFQVPFQLLTSLMDIDSLMTKWRYNHVCMVHRMLGSKAGTGGSSGYHYLR STVSDRYKVFVDLFNLSTYLIPRHWIPKMNPTIHKFLYTAEYCDSSYFSSDESD >gi568815594r:155828333_156053847|GENSCAN_predicted_CDS_4|1065_bp atggaagaatattccatgctcatggacaggaagactcaatatcgtgaaaatggccatact ggccaagaaggcagcgaagaagacaaatcacaaactggtgtgaatagagccagcaaagga ggtcttatctatgggaactacctgcatttggaaaaagttttgaatgcacaagaactgcaa agtgaaacaaaaggaaataaaatccatgatgaacatctttttatcataactcatcaagct tatgaactctggtttaagcaaatcctctgggagttggattctgttcgagagatctttcag aatggccatgtcagagatgaaaggaacatgcttaaggttgtttctcggatgcaccgagtg tcagtgatcctgaaactgctggtgcagcagttttccattctggagacgatgacagccttg gacttcaatgacttcagagagtacttatctccagcatcaggcttccagagtttgcaattc cgactattagaaaacaagataggtgttcttcagaacatgagagtcccttataacagaaga cattatcgtgataacttcaaaggagaagaaaatgaactgctacttaaatctgagcaggaa aagacacttctggaattagtggaggctaaagaagagtctgaagaaaaagaggaacaggtg gctgaatttcagaagcaaaaagaggtgctactgtccttatttgatgagaaacgtcatgaa catctccttagtaaaggcagggaagagcctaggttccaggtgccttttcagttgctgact tctcttatggacatagattcactgatgaccaaatggagatataaccatgtgtgcatggtg cacagaatgctgggcagcaaagctggcaccggtggttcctcaggctatcactacctgcga tcaactgtgagtgataggtacaaggtatttgtagatttatttaatctttcaacatacctg attccccgacactggataccgaagatgaacccaaccattcacaaatttctatatacagca gaatactgtgatagctcctacttcagcagtgatgaatcagattaa >gi568815594r:155828333_156053847|GENSCAN_predicted_peptide_5|456_aa MDVRALPWLPWLLWLLCRGGGDADSRAPFTPTWPRSREREAAAFRESLNRHRYLNSLFPS ENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVHMSIPNVSLPLRFDWRDKQVV TQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALN WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFRVQPPSWLLSRSGIEC LWLFQVHSASCQWIYHSGVWRTVALFSQLLAIPSKLVIVVFPDSLVEGASGDGNGDWACF PGSRRKRAQGKNCRISLLQVGDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHCSS GEANHAVLITGFDKTGSTPYWIVRNSWGSSWGVDGYAHVKMGSNVCVSYISSPLRWRIYR KDFKHQLLYLNKWCLPFEQQSDHVEGRKRAKSLQLN >gi568815594r:155828333_156053847|GENSCAN_predicted_CDS_5|1371_bp atggacgtgcgggcgctgccgtggctgccgtggctgctgtggctgctgtgccggggcggc ggcgatgcggactcccgcgcccccttcaccccgacctggccgcggagccgcgagcgtgaa gccgccgccttccgggaaagtcttaatagacatcgatacttgaattctttatttcccagt gaaaactccaccgccttctatggaataaatcagttttcctatttgtttcctgaagagttt aaagccatttatttaagaagcaaaccttccaagtttcccagatactcagcagaagtacat atgtccatccccaatgtgtctttgccgttaagatttgactggagggacaagcaggttgtg acacaagtgagaaaccagcagatgtgtggaggatgctgggccttcagcgtggtgggggca gtggaatctgcttatgcaataaaggggaagcccctggaagacctaagtgtccagcaggtc attgactgttcgtataataattatggctgcaatggaggctctactctcaatgctttgaac tggttaaacaagatgcaagtaaaactggtgaaagattcagaatatccttttaaagcacaa aatggtctgtgccattacttttctggttcacattctggattttcaatcaaaggttattct gcatatgacttcagggtacagcctccctcctggctgctttcacggtctggcattgagtgt ctgtggcttttccaggtgcatagtgcaagctgtcagtggatctaccattctggggtctgg aggacagtggccctcttctcgcagctccttgccatcccatccaagttggtgattgtcgtc ttccctgattcattagttgagggtgcatctggggatggaaatggggactgggcttgcttt cctggctctaggaggaaaagggctcagggaaagaactgcaggatctcattgttgcaagtt ggtgaccaagaagatgaaatggcaaaagcacttcttacctttggccctttggtagtcata gtagatgcagtgagctggcaagattatctgggaggcattatacagcatcactgctctagt ggagaagcaaatcatgcagttctcataactgggtttgataaaacaggaagcactccatat tggattgtgcggaattcctggggaagttcttggggagtagatggttatgcccatgtcaaa atgggaagtaatgtttgtgttagttatattagcagccctctgagatggcgtatctatcgg aaggatttcaaacaccaattgctttacctgaacaaatggtgcttaccctttgaacagcag agtgaccacgtagaaggaaggaaaagggcaaaatcgcttcagttaaactga >gi568815594r:155828333_156053847|GENSCAN_predicted_peptide_6|91_aa MRVKNVWGDWKNRHEATAMVSSSAVSVSVCRPRQFFECGPALLLQDIFILRRAALTDQKI MPQTTSATSLLRPMIPLEVKALFPVAQVEIS >gi568815594r:155828333_156053847|GENSCAN_predicted_CDS_6|276_bp atgcgtgtcaagaatgtatggggagactggaagaacagacatgaagctactgctatggtg agctcatcagctgtcagtgttagtgtttgtagaccaagacaattctttgaatgtggccca gctttgcttcttcaggatattttcattctgcgaagagctgctctcactgaccagaaaata atgccacaaacaacttcagcaacatccttactacggcccatgatcccactggaagtgaaa gcattgtttcctgttgctcaagtagaaatatcttaa >gi568815594r:155828333_156053847|GENSCAN_predicted_peptide_7|94_aa MPLHSSLGNKSETPSQKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEE GEEEEEEEEEEEEEEEEEEEEEEEEGKKWEISRK >gi568815594r:155828333_156053847|GENSCAN_predicted_CDS_7|285_bp atgccattgcactccagcctgggcaacaagagcgaaactccatctcaaaaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa ggagaagaagaagaagaagaggaggaggaggaggaggaggaggaggaagaggaagaggaa gaggaagaggaagagggaaaaaagtgggaaataagtagaaaataa >gi568815594r:155828333_156053847|GENSCAN_predicted_peptide_8|67_aa MSDSQRWLRIQAQIPSSVKDKREKEVEDSYGKVSRKSTLNTNKKASVMASYSAVVFIPTV IREQSAR >gi568815594r:155828333_156053847|GENSCAN_predicted_CDS_8|204_bp atgagtgactctcagaggtggcttagaattcaggctcaaataccatcttcagtcaaagac aaaagagagaaggaagtggaggacagttatggaaaggtgtccaggaaaagtaccctaaac accaataagaaagccagtgtgatggcctcctatagtgctgttgtgtttattccaacagtc atacgagaacaatcagccagataa >gi568815594r:155828333_156053847|GENSCAN_predicted_peptide_9|149_aa MPPPPEQVLVSVAEETHKQCTSQDSMQATPEPTQMRRNQKTNSGDMTKQGSIVPHKNHTN SPEMDPNQEIPDLPEKELRREAPVKGKAQCKEIQKIIQEVKGEIFNEIDSINKKQAKLQE TIETLIEMQNALERPSNKIEKNRRKKFRA >gi568815594r:155828333_156053847|GENSCAN_predicted_CDS_9|450_bp atgccacctccaccggaacaggtgctggtatccgtggctgaagagacccacaaacagtgc acatcacaggactctatgcaggcaacccctgagccaacccaaatgagaagaaaccagaaa accaactctggtgatatgacaaaacaaggttctatagtaccccacaaaaatcacactaac tcaccagaaatggatccaaaccaggaaatccctgatttacctgaaaaagaactcagaagg gaggcaccagtgaaaggcaaagcccagtgtaaagaaatccaaaaaataatacaagaagtg aagggagaaatattcaatgaaatagatagcataaataaaaaacaagcaaaacttcaggaa acaatagagacacttatagaaatgcaaaatgctctggaaagacccagcaataaaattgaa aaaaatagaagaaagaaattcagagcttga >gi568815594r:155828333_156053847|GENSCAN_predicted_peptide_10|397_aa MAGSNGISSSRSLRNRHTEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKL PMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQ NRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAIWRKLKLDPF LTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKAKIDKWD LIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKW AKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRYRQ KAGYFLPSSIGLQVLQLLDSWAYTSGLPGALGSLATD >gi568815594r:155828333_156053847|GENSCAN_predicted_CDS_10|1194_bp atggctgggtcaaatggtatttctagttctagatccctgaggaatcgccacactgaaata aaagaggacacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatc atgaaaatggccatactgcccaaggtaatctacagattcaatgccatccccatcaagcta ccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaa agagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacacta cctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaa aacagagatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctac aactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctattt aataaatggtgctgggaaaactggctagccatatggagaaagctgaaactggatcccttc cttacaccttatacaaaaatcaattcaagatggattaaagatttaaacgttagacctaaa accataaaaaccctagaagaaaacctaggcattaccattcaggacataggcgtgggcaag gacttcatgtccaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggat ctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaa cctacaacatgggagaaaattttcgcaacgtactcatctgacaaagggctaatatccaga atctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgg gcgaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatg aagaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccactatgagatat catctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacagatatcgccaa aaagctgggtacttcctgccctcgagcattggacttcaagttcttcagcttttggactct tgggcttacaccagtggtttgccaggggctcttgggtctttggccacagactga