GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:12:08 Sequence gi568815596r:168771434_168989519 : 218086 bp : 39.04% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11119 11207 89 0 2 103 92 29 0.565 4.96 1.02 Intr + 11612 11699 88 2 1 96 14 53 0.253 -2.25 1.03 Term + 16919 17107 189 1 0 49 49 242 0.752 12.87 1.04 PlyA + 18800 18805 6 1.05 2.03 PlyA - 19569 19564 6 1.05 2.02 Term - 31267 30997 271 0 1 84 50 149 0.363 4.67 2.01 Init - 36612 36458 155 0 2 64 30 149 0.631 6.20 2.00 Prom - 40066 40027 40 -5.35 3.00 Prom + 40387 40426 40 -4.95 3.01 Init + 49020 49030 11 2 2 56 116 0 0.184 -0.48 3.02 Intr + 53201 53284 84 2 0 68 115 48 0.490 3.52 3.03 Intr + 56725 56787 63 1 0 88 70 50 0.332 0.11 3.04 Intr + 56987 57068 82 2 1 90 70 59 0.732 3.02 3.05 Intr + 62794 62892 99 0 0 93 121 97 0.746 12.89 3.06 Intr + 71559 71684 126 2 0 23 111 62 0.460 1.96 3.07 Intr + 72776 72910 135 1 0 34 52 98 0.494 0.64 3.08 Intr + 79651 79749 99 0 0 97 79 9 0.410 0.29 3.09 Intr + 79846 79971 126 0 0 83 59 135 0.878 10.06 3.10 Intr + 83919 84027 109 2 1 83 93 86 0.931 7.54 3.11 Intr + 85257 85345 89 1 2 68 84 66 0.947 2.97 3.12 Intr + 85894 85954 61 0 1 64 70 90 0.760 2.19 3.13 Intr + 90527 90616 90 0 0 45 90 60 0.446 0.95 3.14 Term + 93401 93537 137 0 2 86 41 117 0.719 4.00 3.15 PlyA + 93602 93607 6 1.05 4.00 Prom + 95720 95759 40 -3.65 4.01 Init + 97178 97357 180 1 0 70 41 161 0.670 8.83 4.02 Term + 97646 98269 624 1 0 -23 42 297 0.460 7.40 4.03 PlyA + 98689 98694 6 1.05 5.09 PlyA - 99485 99480 6 1.05 5.08 Term - 100122 99998 125 1 2 117 38 121 0.994 7.57 5.07 Intr - 102250 102152 99 2 0 99 95 19 0.879 2.86 5.06 Intr - 104743 104639 105 2 0 62 115 100 0.994 9.37 5.05 Intr - 105951 105805 147 1 0 65 106 132 0.956 11.99 5.04 Intr - 118100 117954 147 0 0 64 82 128 0.969 9.09 5.03 Intr - 118549 118302 248 0 2 22 88 52 0.720 -5.22 5.02 Intr - 119253 118885 369 2 0 47 116 189 0.294 10.90 5.01 Init - 120803 120661 143 2 2 80 92 29 0.553 2.15 5.00 Prom - 126883 126844 40 -4.35 6.00 Prom + 129825 129864 40 -2.85 6.01 Init + 136137 136355 219 2 0 65 43 176 0.232 9.58 6.02 Term + 141579 141896 318 2 0 20 38 278 0.693 9.90 6.03 PlyA + 142025 142030 6 1.05 7.18 PlyA - 142751 142746 6 1.05 7.17 Term - 152389 152189 201 1 0 113 46 287 0.994 23.41 7.16 Intr - 153370 153224 147 1 0 73 115 171 0.974 17.81 7.15 Intr - 155929 155723 207 1 0 116 105 179 0.991 20.75 7.14 Intr - 159429 159232 198 0 0 96 91 201 0.987 19.83 7.13 Intr - 161100 160944 157 2 1 108 106 133 0.997 16.19 7.12 Intr - 163992 163751 242 0 2 78 94 237 0.669 18.73 7.11 Intr - 165000 164797 204 0 0 98 89 98 0.672 9.37 7.10 Intr - 173333 173172 162 2 0 109 72 140 0.998 13.75 7.09 Intr - 186695 186531 165 2 0 70 98 155 0.975 13.94 7.08 Intr - 192875 192773 103 1 1 117 77 176 0.999 18.66 7.07 Intr - 198118 197913 206 1 2 73 -12 210 0.634 6.58 7.06 Intr - 198782 198612 171 2 0 78 76 214 0.983 18.42 7.05 Intr - 200617 200414 204 1 0 78 80 290 0.996 25.67 7.04 Intr - 202407 202282 126 0 0 120 64 100 0.991 10.76 7.03 Intr - 205254 205144 111 0 0 90 113 126 0.987 14.96 7.02 Intr - 208546 208433 114 1 0 95 75 106 0.997 9.72 7.01 Intr - 214851 214677 175 2 1 101 82 47 0.531 4.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 117858 117785 74 1 2 71 48 81 0.874 -0.51 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:168771434_168989519|GENSCAN_predicted_peptide_1|121_aa MVSRKGSLPTDRKKPETGYQQLPNKISGVGQILSSEKARIEGAAGLYRFADSNRTNKIRD RAEQISLLGEGITVARSGMMGAKLGMEMSAQEDSAAGRIVAQGGGEEESTQGGSEELEEE A >gi568815596r:168771434_168989519|GENSCAN_predicted_CDS_1|366_bp atggtgtcaaggaaaggcagtctcccaacagatagaaaaaaacctgaaactggttatcag cagcttcccaataagatctcaggagttggtcaaattctttcttctgagaaggcaagaatt gagggtgctgcaggcctgtacagatttgccgacagtaacagaaccaataaaatacgtgac agagcagagcagatatctctgctaggtgagggaatcactgtggcccggagtggaatgatg ggagccaaattgggcatggagatgtctgcccaggaggacagtgcagctggcagaatcgta gcccaaggtgggggtgaagaggaatccacacaaggtgggagtgaggagttggaggaggaa gcatga >gi568815596r:168771434_168989519|GENSCAN_predicted_peptide_2|141_aa MLKVYKPVIENSIAFSGSGEACGKGIHIQPLKVRGCWTKAQSMASIDPWATRHTNHTELL TGQSVSGSLMLKCVWLLSFHLRSSILDTVIKVGCVLDFRDSKGLEYVSPCQTFQGGSPSP GLSSRLCPEKVKNCSGSRRPV >gi568815596r:168771434_168989519|GENSCAN_predicted_CDS_2|426_bp atgctgaaagtttataagccagtgattgaaaactcaattgccttcagtggcagtggtgaa gcctgtggcaaaggcattcacattcaacctcttaaagtaaggggctgctggaccaaagca cagtctatggcttccattgacccatgggctaccaggcacacaaaccacactgagctactc accggacaatctgtcagtgggtccctcatgttgaaatgtgtctggcttttgtcctttcac ctacgctccagcattctggacactgtaatcaaggttgggtgtgtcctggacttccgggat tcaaagggcctagagtacgtcagtccttgtcaaacattccaaggaggaagtcccagccct ggactttcatcccgtttgtgtcccgagaaagtaaagaactgctctgggtcaagacggcca gtgtga >gi568815596r:168771434_168989519|GENSCAN_predicted_peptide_3|436_aa MSQRANLEISYAKGLQKLASKLSKALQNTRKSCVSSAWAWASEGMKSTADLHQKLGKAIE LEAIKPTYQVLNVQEKKRKSAKKKLMVSTKKHEALFQLVESSKQSMTEKEKRKLLNKLTK STEKLEKEDENYYQKNMAGYSTRLKWENTLENCYQKWSPWWEYNRTARNSHAAYARPLVK QYNPRLYINMTKLNNYQFWRSILELEKERIQLLCNNLNQYSQHISLFGQTLTTCHTQIHC AISKIDIEKDIQAVMEETAILSTENKSEFLLTDYFEEDPNSAMDKERRKSLLKPKLLRLQ RDIEKASKDKEGLERMLKTYSSTSSFSDAKSQKDTAALMDEGYRRITGERKDVVAEGIGQ SAPGAAQLSSRLCKALYSFQARQDDELNLEKGDIVIIHEKKEGGWWFGSLNGKKGHFPAA YVEELPSNAGNTATKA >gi568815596r:168771434_168989519|GENSCAN_predicted_CDS_3|1311_bp atgtcacaaagggcaaacctggaaattagctatgccaaaggacttcagaaactggcaagc aagctgagcaaagcattacagaacacgagaaaaagttgtgttagcagtgcctgggcctgg gcctcagagggaatgaaatccacagcggacctgcatcaaaaacttggcaaagcaattgaa ttggaagcaataaaaccgacttatcaagtcctaaatgtacaagagaagaagagaaaatca gccaagaagaaattaatggttagtaccaagaaacatgaagcacttttccagcttgtagaa agctccaagcaatctatgactgagaaggagaagcggaagctcctcaataaactgacaaaa tcaactgaaaagttggaaaaggaagatgaaaattactaccaaaaaaacatggcgggttat tctaccagactgaaatgggaaaacacactagagaactgctaccagaaatggagcccatgg tgggaatacaacagaactgccagaaacagccacgcagcttatgcacggcccctggtgaag cagtacaaccccaggctctacataaatatgaccaaattaaataactatcaattctggaga agcattctggagctggagaaggaaagaattcaacttttatgcaataacttaaaccagtac agccaacatatttctctttttggccaaaccctgaccacatgccacacgcagattcactgt gccatcagcaagattgacattgaaaaagatatccaggctgtaatggaagaaactgcaatt ttatctacagaaaacaaatctgagttcctgttaacggattactttgaagaagatcctaac agtgcaatggataaagagagacgaaagtctttactaaaaccaaaattattgagactgcag agagacattgaaaaagcctcaaaagacaaggaaggcctggaacgaatgcttaaaacgtac tccagcacctcctccttctctgatgcaaagagccagaaagacacagcagcgttaatggat gagggttatcggagaataacaggggaaaggaaagacgtggttgcagagggaatagggcag tctgcccctggtgcagcccagctcagcagcagactttgcaaggccttgtattcttttcaa gccaggcaagatgatgagttgaatttggaaaagggtgacattgtgattatacacgagaaa aaagaaggaggatggtggtttggatctttgaatgggaaaaaaggccattttcctgccgct tatgtggaggagttaccttcaaatgctggcaacacagctacaaaggcataa >gi568815596r:168771434_168989519|GENSCAN_predicted_peptide_4|267_aa MDKFLDTYTLPRLNKEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEEL HINRTKDKNHIIISIDAEKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIIWN GQKLEAFPLKTGTRQGCPLLPLLFNIVLEVLARAIRQKKEIKGIQLGKEELKLSLFADDM IVYLENPIVSAQNLLKLISNFSKLSGCKINVQKSQAFLYTNNRQTESQIMSELPFTIASK RIKYLGIQLTRDVKDLFKENYNHCSVK >gi568815596r:168771434_168989519|GENSCAN_predicted_CDS_4|804_bp atggataaattcctcgacacatacactctcccaagactaaacaaggaagaagttgaatct ctgaataggccaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaa aaaagtccaggaccagatggattcacagctgaattctaccagaggtacaaggaggagctg catataaacagaaccaaagacaaaaaccacataattatctcaatagatgcggaaaaggcc tttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtattgatggg atgtatctcaaaataataagagctatctatgacaaacccacagccaatatcatatggaat ggacaaaaactggaagcattccctttgaaaaccggcacaagacagggatgccctctccta ccactcctattcaacatagtgttggaagttctggccagggcaatcaggcagaagaaggaa ataaagggcattcaattaggaaaagaggaactcaaattgtccctgtttgcagatgacatg attgtatatctagaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaac ttcagcaaactctcaggatgcaaaatcaatgtgcaaaaatcacaagcatttttatacacc aataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaac tacaaccactgctcagtgaaataa >gi568815596r:168771434_168989519|GENSCAN_predicted_peptide_5|460_aa MWHLSIPHPAETQPWVTDRISQRRQCSAQYLAQNRNSINPGWFEMKGINLYLKPINDLSW ASPSGAINTFGSQRRLWLLFRKELRIPECPVPPFSPRESKGEFNQSCQGEPALSANQKAS LRLEAGGGKCTRFEIGKLAGLRELSLESGCWLEWARIWCGEGGGTQACLRSPYFTVNFYC SEVEERDRIDVWGLILWEGAGHLAILMISVQKGLELDQGFQKSTVLLVRKWTQGHSACEC PVQLHLPTNKLLGALSYIMVEDELALFDKSINEFWNKFKSTDTSCQMAGLRDTYKDSIKA FAEISRQNKLIQEKKDNLLKLIAEVKGKKQELEVLTANIQDLKEEYSRKKETISTANKAN AERLKRLQKSADLYKDRLGLEIRKIYGEKLQFIFTNIDPKNPESPFMFSLHLNEARDYEV SDSAPHLEGLAEFQENVRKTNNFSAFLANVRKAFTATVYN >gi568815596r:168771434_168989519|GENSCAN_predicted_CDS_5|1383_bp atgtggcatctttctattccccaccctgcagaaacccaaccctgggtcacagaccgtata tcccagagaaggcagtgctcagctcagtacctggcacaaaatagaaactcaataaaccct ggttggtttgaaatgaaaggtataaacctttatttgaagccgataaatgacttgtcgtgg gcgagcccttcgggtgctattaatactttcgggtcacagcggagactctggctgttgttc agaaaagaactacgaatcccagaatgccctgttcccccattttcccctcgagaatccaag ggtgagttcaaccaatcctgtcaaggagagcctgcgctatcggccaatcagaaggccagc ctgcgcctggaggcgggtggcgggaagtgcactaggtttgaaatcggaaagttggcgggg ctgcgggagctgagcctagagtccggctgttggctagagtgggcgcggatctggtgtggg gaaggcggcgggactcaggcctgcctgcgaagtccctattttactgttaacttttactgt tctgaggtggaggagagagacaggatagatgtgtggggtcttattctatgggagggtgcc ggccacctggctatcctgatgatctctgttcagaaagggttagagttggatcaggggttt cagaagtcaacagttttacttgtgagaaaatggacccagggtcatagtgcgtgtgagtgt cctgtgcagttacacttacctactaacaagctgctaggagcattgtcctacataatggta gaggacgaactggcacttttcgataaaagcataaatgaattttggaataaattcaaaagt acggacacctcctgtcagatggcgggactaagagatacctacaaggattccatcaaagca tttgcagagatcagcaggcaaaataagctcattcaagaaaaaaaggataacttgttaaaa ttgattgctgaagtaaaaggcaaaaagcaggaattggaagtactgactgcaaatatccag gatcttaaggaagaatattctaggaagaaggaaactatttctactgctaataaagcgaat gcagagaggttgaaaaggctgcagaaatctgcagacttgtataaagatcgacttggacta gaaattcgaaaaatttatggtgagaaattgcagtttattttcactaatattgaccctaag aatcctgagagcccatttatgttttccttacatctcaatgaagcaagggactatgaagtg tcagatagtgcccctcatcttgagggcctagcagaatttcaagagaatgtaaggaagacc aacaatttttcagcttttcttgccaatgttcggaaagcttttactgccacggtttataat taa >gi568815596r:168771434_168989519|GENSCAN_predicted_peptide_6|178_aa MLVAEAFEHTPGIQTASLGTYLKTNLFLFLFAVGFYLLLRVLNIDLLWSVPIAKKWCANP DWIHIDTTPFAGLRGEERRRWQHSSPFLALAKWKLKRPPWEDLGALQGEQAICVPGKETQ HYRRQRRASLLSHSPTPVRQLSRSHSNSCSKSWLFPGFRLCLMNSMDRIVICAHTSCC >gi568815596r:168771434_168989519|GENSCAN_predicted_CDS_6|537_bp atgctggtggcagaggcctttgaacacactccaggcatccaaacggccagtctgggcaca tacctgaagaccaacctctttctcttcctgtttgcagttggcttttacctgcttcttagg gtgctcaacattgacctgctgtggtccgtgcccatagccaaaaagtggtgtgctaacccc gactggatccacattgacaccacgccttttgctggactccgcggggaggaacgtaggcgc tggcagcacagcagcccgtttctggcgctggccaagtggaaactgaagcggcctccctgg gaggatctgggtgcgctgcagggggagcaggccatctgtgtgcctggaaaagaaacccag cactaccgaagacaaagaagagcttctttactctcgcactcacctacccctgtacgtcaa ctgagccgatcacactctaattcctgttcaaagagttggctttttccaggattcaggctt tgtctgatgaacagcatggaccggattgtcatctgtgctcatacatcctgctgctaa >gi568815596r:168771434_168989519|GENSCAN_predicted_peptide_7|964_aa XYEKNLVFAQRWGIRKGIVMGFFTGFVWCLIFLCYALAFWYGSTLVLDEGEYTPGTLVQI FLSVIVGALNLGNASPCLEAFATGRAAATSIFETIDRKPIIDCMSEDGYKLDRIKGEIEF HNVTFHYPSRPEVKILNDLNMVIKPGEMTALVGPSGAGKSTALQLIQRFYDPCEGMVTVD GHDIRSLNIQWLRDQIGIVEQEPVLFSTTIAENIRYGREDATMEDIVQAAKEANAYNFIM DLPQQFDTLVGEGGGQMSGGQKQRVAIARALIRNPKILLLDMATSALDNESEAMVQEVLS KIQHGHTIISVAHRLSTVRAADTIIGFEHGTAVERGTHEELLERKGVYFTLVTLQSQGNQ ALNEEDIKGKASIRQRSKSQLSYLVHEPPLAVVDHKSTYEEDRKDKDIPVQEEVEPAPVR RILKFSAPEWPYMLVGSVGAAVNGTVTPLYAFLFSQILGGYAFAKSGELLTKRLRKFGFR AMLGQDIAWFDDLRNSPGALTTRLATDASQVQGAAGSQIGMIVNSFTNVTVAMIIAFSFS WKLSLVILCFFPFLALSGATQTRMLTGFASRDKQALEMVGQITNEALSNIRTVAGIGKER RFIEALETELEKPFKTAIQKANIYGFCFAFAQCIMFIANSASYRYGGYLISNEGLHFSYV FRVISAVVLSATALGRAFSYTPSYAKAKISAARFFQLLDRQPPISVYNTAGEKWDNFQGK IDFVDCKFTYPSRPDSQVLNGLSVSISPGQTLAFVGSSGCGKSTSIQLLERFYDPDQGKV MIDGHDSKKVNVQFLRSNIGIVSQEPVLFACSIMDNIKYGDNTKEIPMERVIAAAKQAQL HDFVMSLPEKYETNVGSQGSQLSRGEKQRIAIARAIVRDPKILLLDEATSALDTESEKTV QVALDKAREGRTCIVIAHRLSTIQNADIIAVMAQGVVIEKGTHEELMAQKGAYYKLVTTG SPIS >gi568815596r:168771434_168989519|GENSCAN_predicted_CDS_7|2895_bp nngtatgagaaaaatcttgtgttcgcccagcgttggggaattagaaaaggaatagtgatg ggattctttactggattcgtgtggtgtctcatctttttgtgttatgcactggccttctgg tacggctccacacttgtcctggatgaaggagaatatacaccaggaacccttgtccagatt ttcctcagtgtcatagtaggagctttaaatcttggcaatgcctctccttgtttggaagcc tttgcaactggacgtgcagcagccaccagcatttttgagacaatagacaggaaacccatc attgactgcatgtcagaagatggttacaagttggatcgaatcaagggtgaaattgaattc cataatgtgaccttccattatccttccagaccagaggtgaagattctaaatgacctcaac atggtcattaaaccaggggaaatgacagctctggtaggacccagtggagctggaaaaagt acagcactgcaactcattcagcgattctatgacccctgtgaaggaatggtgaccgtggat ggccatgacattcgctctcttaacattcagtggcttagagatcagattgggatagtggag caagagccagttctgttctctaccaccattgcagaaaatattcgctatggcagagaagat gcaacaatggaagacatagtccaagctgccaaggaggccaatgcctacaacttcatcatg gacctgccacagcaatttgacacccttgttggagaaggaggaggccagatgagtggtggc cagaaacaaagggtagctatcgccagagccctcatccgaaatcccaagattctgcttttg gacatggccacctcagctctggacaatgagagtgaagccatggtgcaagaagtgctgagt aagattcagcatgggcacacaatcatttcagttgctcatcgcttgtctacggtcagagct gcagataccatcattggttttgaacatggcactgcagtggaaagagggacccatgaagaa ttactggaaaggaaaggtgtttacttcactctagtgactttgcaaagccagggaaatcaa gctcttaatgaagaggacataaagggcaaggcttccatccggcaacgctccaagtctcag ctttcttacctggtgcacgaacctccattagctgttgtagatcataagtctacctatgaa gaagatagaaaggacaaggacattcctgtgcaggaagaagttgaacctgccccagttagg aggattctgaaattcagtgctccagaatggccctacatgctggtagggtctgtgggtgca gctgtgaacgggacagtcacacccttgtatgcctttttattcagccagattcttggggga tatgcctttgctaaatctggggagctcctaacaaaaaggctacgtaaatttggtttcagg gcaatgctggggcaagatattgcctggtttgatgacctcagaaatagccctggagcattg acaacaagacttgctacagatgcttcccaagttcaaggggctgccggctctcagatcggg atgatagtcaattccttcactaacgtcactgtggccatgatcattgccttctcctttagc tggaagctgagcctggtcatcttgtgcttcttccccttcttggctttatcaggagccaca cagaccaggatgttgacaggatttgcctctcgagataagcaggccctggagatggtggga cagattacaaatgaagccctcagtaacatccgcactgttgctggaattggaaaggagagg cggttcattgaagcacttgagactgagctggagaagcccttcaagacagccattcagaaa gccaatatttacggattctgctttgcctttgcccagtgcatcatgtttattgcgaattct gcttcctacagatatggaggttacttaatctccaatgaggggctccatttcagctatgtg ttcagggtgatctctgcagttgtactgagtgcaacagctcttggaagagccttctcttac accccaagttatgcaaaagctaaaatatcagctgcacgcttttttcaactgctggaccga caacccccaatcagtgtatacaatactgcaggtgaaaaatgggacaacttccaggggaag attgattttgttgattgtaaatttacatatccttctcgacctgactcgcaagttctgaat ggtctctcagtgtcgattagtccagggcagacactggcgtttgttgggagcagtggatgt ggcaaaagcactagcattcagctgttggaacgtttctatgatcctgatcaagggaaggtg atgatagatggtcatgacagcaaaaaagtaaatgtccagttcctccgctcaaacattgga attgtttcccaggaaccagtgttgtttgcctgtagcataatggacaatatcaagtatgga gacaacaccaaagaaattcccatggaaagagtcatagcagctgcaaaacaggctcagctg catgattttgtcatgtcactcccagagaaatatgaaactaacgttgggtcccaggggtct caactctctagaggggagaaacaacgcattgctattgctcgggccattgtacgagatcct aaaatcttgctactagatgaagccacttctgccttagacacagaaagtgaaaagacggtg caggttgctctagacaaagccagagagggtcggacctgcattgtcattgcccatcgcttg tccaccatccagaacgcggatatcattgctgtcatggcacagggggtggtgattgaaaag gggacccatgaagaactgatggcccaaaaaggagcctactacaaactagtcaccactgga tcccccatcagttga