GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:55:38 Sequence gi568815590r:116545009_116855797 : 310789 bp : 39.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 774 813 40 -3.85 1.01 Init + 1769 1946 178 1 1 37 43 118 0.398 1.67 1.02 Intr + 3736 3939 204 2 0 44 97 59 0.487 0.65 1.03 Term + 4426 4529 104 2 2 68 43 98 0.777 0.76 1.04 PlyA + 4531 4536 6 1.05 2.03 PlyA - 5301 5296 6 1.05 2.02 Term - 25230 25028 203 1 2 106 48 159 0.832 10.27 2.01 Init - 53651 53510 142 2 1 76 85 116 0.850 10.69 2.00 Prom - 61369 61330 40 -4.65 3.05 PlyA - 61403 61398 6 1.05 3.04 Term - 63559 63402 158 2 2 60 48 162 0.670 6.51 3.03 Intr - 65589 65413 177 1 0 46 13 130 0.196 0.27 3.02 Intr - 66078 66033 46 0 1 98 68 19 0.593 -2.04 3.01 Init - 67031 66930 102 2 0 56 105 90 0.876 7.79 3.00 Prom - 69160 69121 40 -8.15 4.19 PlyA - 69334 69329 6 1.05 4.18 Term - 71329 71059 271 0 1 52 53 208 0.172 7.67 4.17 Intr - 79168 79084 85 2 1 22 103 110 0.069 3.86 4.16 Intr - 81797 81556 242 1 2 114 18 180 0.484 9.87 4.15 Intr - 83628 83580 49 1 1 42 105 66 0.856 0.52 4.14 Intr - 86509 86312 198 2 0 24 55 201 0.553 8.80 4.13 Intr - 89570 89434 137 1 2 60 52 73 0.611 0.19 4.12 Intr - 91839 91667 173 0 2 8 87 142 0.143 3.92 4.11 Intr - 96321 96118 204 0 0 89 52 133 0.344 8.27 4.10 Intr - 99635 99478 158 0 2 60 77 50 0.866 -0.09 4.09 Intr - 101595 101463 133 0 1 55 72 159 0.979 10.30 4.08 Intr - 103918 103798 121 0 1 50 77 91 0.965 3.78 4.07 Intr - 110997 110848 150 2 0 63 111 119 0.883 10.06 4.06 Intr - 112306 112207 100 2 1 16 94 59 0.670 -2.45 4.05 Intr - 113972 113805 168 0 0 49 49 150 0.005 6.20 4.04 Intr - 181164 181008 157 0 1 96 116 177 0.961 19.96 4.03 Intr - 182270 182138 133 1 1 71 23 88 0.533 0.33 4.02 Intr - 193681 193605 77 1 2 57 86 77 0.268 1.79 4.01 Init - 210789 210658 132 0 0 102 70 147 0.592 14.81 4.00 Prom - 212012 211973 40 -6.65 5.00 Prom + 215422 215461 40 -6.45 5.01 Init + 221596 221783 188 0 2 79 86 257 0.093 23.18 5.02 Intr + 225184 225358 175 1 1 87 59 76 0.514 3.62 5.03 Term + 226448 226834 387 1 0 97 41 225 0.603 12.65 5.04 PlyA + 226901 226906 6 1.05 6.06 PlyA - 227479 227474 6 1.05 6.05 Term - 247163 246972 192 2 0 119 49 121 0.510 7.74 6.04 Intr - 251664 251522 143 1 2 41 62 107 0.011 2.55 6.03 Intr - 261738 261550 189 1 0 66 44 120 0.479 4.04 6.02 Intr - 269138 269024 115 2 1 35 75 103 0.911 2.80 6.01 Init - 269985 269911 75 0 0 54 95 128 0.991 11.24 6.00 Prom - 280791 280752 40 -2.95 7.00 Prom + 285911 285950 40 -5.95 7.01 Init + 292672 292732 61 0 1 53 54 104 0.659 4.96 7.02 Term + 293718 293989 272 1 2 89 47 177 0.949 8.36 7.03 PlyA + 294047 294052 6 1.05 8.07 PlyA - 297697 297692 6 1.05 8.06 Term - 302683 302492 192 1 0 76 49 256 0.773 16.94 8.05 Intr - 304021 303938 84 1 0 52 72 76 0.721 1.50 8.04 Intr - 305759 305610 150 2 0 32 82 210 0.713 14.24 8.03 Intr - 307088 306940 149 0 2 49 95 177 0.985 13.53 8.02 Intr - 307700 307541 160 2 1 88 22 195 0.982 11.54 8.01 Intr - 309460 309237 224 2 2 74 85 185 0.809 13.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 169349 169290 60 2 0 66 57 89 0.809 4.90 S.002 Term - 221797 221377 421 0 1 77 49 446 0.845 33.38 S.003 Term - 283552 283433 120 1 0 81 50 158 0.930 8.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:116545009_116855797|GENSCAN_predicted_peptide_1|161_aa MTGEASGNLQSWQKVKGKQGMSYMAARERDREPAKGKCSNRERQRATKGEVPHFETISSL QSDRISGSTLKTLVMPQFNDAFFRTKKDDSHRLYFPVEDSHHWLHLYCYAYVLSINYAML LVEIHLIKDTATGAILEAETRPSPDTKPFSTFILDFQSPQL >gi568815590r:116545009_116855797|GENSCAN_predicted_CDS_1|486_bp atgactggggaagcctcaggaaacttacaatcatggcagaaggtgaaggggaagcaagga atgtcttacatggcggcaagagagagagacagagagccagcaaaggggaagtgcagcaac agagagagacagagagcgaccaaaggggaagtgccacattttgaaaccatcagctctctg caaagtgacagaatatctggcagtaccttaaaaaccttggtgatgccacaattcaatgat gcatttttcagaactaaaaaggatgactcccacagattgtactttccagtggaagatagt catcactggctgcatttatattgctatgcttacgtattaagcataaattatgctatgctg ttggttgaaattcatctgataaaggacacagcaacaggcgctatcttggaagcggagacc aggccttcaccagacaccaaacctttcagcacctttatcttggacttccagtctccacaa ctgtag >gi568815590r:116545009_116855797|GENSCAN_predicted_peptide_2|114_aa MVHAGVPCRRPLAEAAGAALPGRASTWRKGHRCVAYTEELHRLMQGSGEGKEGEQISKWP SFPKRSKTTPPAAQLLMSLNAAHEKHQQKLCSVLGFREKLSMWTRAFFYEKEKW >gi568815590r:116545009_116855797|GENSCAN_predicted_CDS_2|345_bp atggtccacgcaggtgtgccctgcaggaggccgctggcggaggcagctggtgctgctctt ccaggtcgtgcctctacttggaggaaagggcatcgttgtgtagcttacactgaggagctc catcggctaatgcagggctctggagaagggaaggagggagagcagatttccaaatggccc tcgtttccaaaaaggagcaaaacgactcctcctgcagctcagttgctcatgtctctcaat gctgcacatgaaaaacatcagcaaaaactctgctcagtgctggggtttcgagagaaacta tcaatgtggactagagcatttttctatgagaaggaaaaatggtga >gi568815590r:116545009_116855797|GENSCAN_predicted_peptide_3|160_aa MKQVCPQVKERGEESQVEASYVVVQPKEGPLRLLVMKDHLGLSSERILVAIQHSPEPEIM YAEHGHPVEMIKAFAWDWARWEEANPVTAGPAHLSPCVLFPHAGVHIASSVLDTEDSMMK KTDMALLSWGYILLGKRDTKPAGRYTAHIIADCDEVLGRK >gi568815590r:116545009_116855797|GENSCAN_predicted_CDS_3|483_bp atgaaacaagtttgcccacaggtgaaggagagaggggaggaaagtcaggtagaagcatcc tacgtagttgtgcagcctaaggaaggcccattacggctgctggtaatgaaggatcacctt ggcctgtcttcagaaagaattctggtagctatccagcattctccagaaccagaaatcatg tacgctgagcatggccacccagtggaaatgataaaggcctttgcttgggactgggccaga tgggaagaagcaaatcctgtaacagcaggacctgcccacctaagtccctgtgttctgttc cctcatgctggtgttcatattgccagcagtgttctagacactgaggatagcatgatgaag aagacagatatggccctgctgtcatggggttacattctactgggcaagagagacactaaa ccagcaggtcgatacacagcccacatcatagcagattgtgatgaagttctgggaaggaaa tga >gi568815590r:116545009_116855797|GENSCAN_predicted_peptide_4|895_aa MASRKEGTGSTATSSSSTAGAAGKGKGKGGSGDSAVKQVQIDGLGFSADPYCVCYKDDGK PAFPGEGLLYFVSTLAGAGLADNRIDSLLGCFFTIRVAELTCGLRLHTEVAAHPVVLKII KHYQEEGQGTEVVQGVLLGLVVEDRLEITNCFPFPQHTEDDADFDEVQYQMEMMRSLRHV NIDHLHVGWYQSTYYGSFVTRALLDSQFSYQHAIEESVVLIYDPIKTAQGSLSLKAYRLT PKLMEVCKEKDFSPEALKKANITFEYMFEEVPIVIKNSHLINVLMWELEKKSAVADKHEL LSLASSNHLGKNLQLLMDRVDEMSQDIVKYNTYMRNTSKQQQQKHQYQQRRQQENMQRQS RGEPPLPEEDLSKLFKPPQPPARMDSLLIAAPDPEALASTVNIFFLNFYTSLLPNEHPER SPGFMYLPRNSLSPDPESLGLIPAVKPSRSCHVMSLLKSKFDFYTIEYHSAIKKNEIMTF TTTRMDLEAIILSEAVGTQSIMNFGDSEGKRKCKFSGLNPDLTESRALRMKLNQAPVFTS TPGDYCLVKFALVQMLLDMKCSYIQTNALCLPLVVIDFQPPYQVNILGGGREDDYDKLRT YITPKENRNQWNPDEALEGLEVLLGGNPFFDFDEGEALIRKPDAASGRKNFGSLGAVSQE AASVRDERHSAEEASLFPFVGGCEDDRGSLAHTDEQRHLMLHRRCQAGCSKEHRNQCEFL IQRGMNEGNWSHRSRRAEKPAREVVATQRLAGSKTQLSGHRPKIEDWASVTWQEQEPHKP MGPQSGLQSGLLEYLSAHPLSGFVDSPEAGKSGSRHGKWYIQEYNVPAVIHGLGEGHGSY CLQKSAAPARSTFAEHRPPTMKSATGHWDLCHCYCRKTQGPREPAGQQQHWTDPS >gi568815590r:116545009_116855797|GENSCAN_predicted_CDS_4|2688_bp atggcgtcccgcaaggaaggtaccggctctactgccacctcttccagctccaccgccggc gcagcagggaaaggcaaaggcaaaggcggctcgggagattcagccgtgaagcaagtgcag atagatggccttggcttttcagcagatccttattgtgtctgttacaaagatgatgggaaa cctgcttttcctggtgaaggtttattgtacttcgtttccactttggcgggagcaggttta gctgacaaccgcattgattctctcttaggctgtttcttcaccatccgtgtggctgagctg acctgtgggctcaggctgcacacagaagtagctgctcaccctgtggtattaaagataatc aaacattatcaagaagaaggacaaggaactgaagttgttcaaggagtgcttttgggtctg gttgtagaagatcggcttgaaattaccaactgctttcctttccctcagcacacagaggat gatgctgactttgatgaagtccaatatcagatggaaatgatgcggagccttcgccatgta aacattgatcatcttcacgtgggctggtatcagtccacatactatggctcattcgttacc cgggcactcctggactctcagtttagttaccagcatgccattgaagaatctgtcgttctc atttatgatcccataaaaactgcccaaggatctctctcactaaaggcatacagactgact cctaaactgatggaagtttgtaaagaaaaggatttttcccctgaagcattgaaaaaagca aatatcacctttgagtacatgtttgaagaagtgccgattgtaattaaaaattcacatctg atcaatgtcctaatgtgggaacttgaaaagaagtcagctgttgcagataaacatgaattg ctcagccttgccagcagcaatcatttggggaagaatctacagttgctgatggacagagtg gatgaaatgagccaagatatagttaaatacaacacatacatgaggaatactagtaaacaa cagcagcagaaacatcagtatcagcagcgtcgccagcaggagaatatgcagcgccagagc cgaggagaacccccgctccctgaggaggacctgtccaaactcttcaaaccaccacagccg cctgccaggatggactcgctgctcattgcagctcctgaccccgaggccctggcatctact gtgaatattttttttttgaatttttatacttcgctgctcccaaatgagcaccccgagaga agtccaggcttcatgtacttgcccaggaattccttgtccccggacccggaatcacttggc ctaattccggctgttaagccctcgcgtagttgccatgtgatgagtttattaaagtcaaaa tttgatttctacaccatagaataccactctgccataaaaaagaatgaaataatgactttt acaacaactcggatggatctggaggccattattctaagtgaagctgtgggcacacagagt ataatgaactttggagactcagaagggaagagaaaatgcaaattttcaggcctcaaccca gacttaactgaatcaagagctttgaggatgaagcttaaccaagcacctgtcttcaccagc accccaggtgactattgcttggtgaagtttgctctagtacagatgctccttgacatgaag tgcagttacatccagacaaatgcattatgtctgccattggttgtgatagatttccaacct ccctaccaagttaatattttaggagggggtagggaggatgattatgacaaattacggact tacattactcctaaggaaaacagaaatcaatggaaccctgatgaagcgttagagggactc gaggtcctgctgggagggaacccgttttttgactttgacgaaggagaggctctaatcagg aaaccagatgctgcttcagggcgaaagaactttggcagccttggagctgtgagtcaggag gctgcatctgtcagagacgagcggcacagtgcagaagaggcctctctctttcctttcgtg ggtggctgtgaagatgacagaggatcacttgcacacaccgatgaacagaggcaccttatg ctccatagaaggtgtcaggcagggtgcagcaaggaacacaggaatcagtgtgagttcttg atacagaggggaatgaatgaagggaattggtcacataggtcacgaagggctgagaagcca gcaagggaagttgtggccacccagaggttagcaggcagcaagacacagttgtcaggacat cgtccgaagattgaagattgggcctcagtcacctggcaggaacaggagccacataaaccc atgggaccacagagtggcctgcagagtggcctcctggagtatttatctgcccatccgttg tctggatttgtggacagtcccgaagcgggaaaatcaggaagcagacatggcaaatggtac atccaggaatacaacgtccctgctgtgatccatgggttgggagaaggccacggcagctat tgcctccagaagagtgctgcacctgccaggtccactttcgcagaacacagaccccctacc atgaaatcagcaactggacactgggacctctgccactgttactgcaggaaaacccagggc ccccgtgaaccagccggtcaacagcagcattggacagacccaagttga >gi568815590r:116545009_116855797|GENSCAN_predicted_peptide_5|249_aa MKITRQKHAKKHLGFFRNNFGVREPYQILLDGTFCQAALRGRIQLREQLPRYLMGETQLC TTRCVLKELETLGKDLYGAKLIAQKCQVRNCPHFKNAVSGSECLLSMVEEGNPHHYFVAT QDQNLSVKVKKKPGVPLMFIIQNTMVLDKPSPKTIAFVKAVESGQLVSVHEKESIKHLKE EQGLVKNTEQSRRKKRKKISGPNPLSCLKKKKKAPDTQSSASEKKRKRKRIRNRSNPKVL SEKQNAEGE >gi568815590r:116545009_116855797|GENSCAN_predicted_CDS_5|750_bp atgaagatcacaaggcagaaacatgccaagaagcatcttggcttcttccgcaacaacttc ggagtccgcgagccgtaccagatcctgctggacggcaccttctgtcaggcggcgctgcgg ggccgcatccagctgcgggagcagctgccccgctacctcatgggggagacgcagctgtgc accacaagatgtgtgttaaaagagctagaaacattgggaaaggacttatatggggcaaaa ctgattgcacaaaaatgccaagttcgaaattgtcctcatttcaagaatgcagtgagtgga tcagaatgtctgctttccatggttgaagagggaaatcctcatcattattttgtggcaaca caggatcagaatttgtctgtgaaagtaaaaaagaagcctggagttcctctcatgtttatt attcagaacactatggttttggacaaaccttctcccaaaacaattgcctttgtaaaagca gtggagtcaggtcagcttgtctcagtgcatgagaaagaaagtatcaaacatctcaaagag gaacagggtttagtgaaaaacactgaacagagtagaagaaaaaagcgcaagaaaataagt ggtcccaatcctcttagttgtttgaagaaaaagaaaaaggcaccggacacacaatcatct gcttctgaaaagaaaagaaaaagaaaaagaattcggaacagatctaacccaaaagtactt tctgagaagcagaatgcagaaggagaatga >gi568815590r:116545009_116855797|GENSCAN_predicted_peptide_6|237_aa MRPDIYQVPTRHEDNDVEMDELGTQWCDSGKLLSISNLYVFTGNTGIMMPYSGPESSVVT TEGDPYGRVAEWTDLHTSIAPGQTGLKLVCSKGNIAPQDHFTGCILSHPTEKFLLNSVSV NPKNASYAEKSFDHVQYPFMIKALNKLNIEEMYLNTLKAIYDKSKINIIFNGEKASHSEL SLNTPQIRLSSCKGLELHVNFESTQSLVLQLVISKQVNNNRASGLLREKKAEVQGGP >gi568815590r:116545009_116855797|GENSCAN_predicted_CDS_6|714_bp atgagacctgatatttaccaagttccaacaagacatgaagataatgatgtggaaatggat gaactgggcacacagtggtgtgattcaggcaagttgcttagcatctccaatctctatgtc tttaccggtaacactgggatcatgatgccctactcaggaccagagagctctgtggtgaca actgaaggagacccttatggcagagtggctgagtggacagatttgcacacaagcattgca cctggacaaacaggactcaaacttgtctgctccaagggcaatattgctccacaggatcat tttactggctgcatcctcagccatccaacagagaaattccttctcaactcagtgtctgta aaccccaagaatgcctcttatgcagaaaaatcatttgaccacgtacaatatcctttcatg ataaaagctctcaacaaattaaacatagaagaaatgtacctcaacacattaaaggcaatc tatgacaagtccaaaattaatatcatattcaatggtgaaaaggcaagtcattcagaatta tcgcttaacacccctcaaattagactctccagttgtaaaggcctagagctacatgtgaat tttgagagcactcagtcccttgtactccaacttgtcattagtaaacaagttaataataat cgcgccagtggattgttacgtgagaagaaggctgaagtccagggtggtccctag >gi568815590r:116545009_116855797|GENSCAN_predicted_peptide_7|110_aa MVKEQKSSFVGEDEEVSSGDERPYFSALCGGKQLSKLVHPLNHHPYCPFKPELCLPLSFP NSWASFKAQYLSLRPEMFPDSPIRVPNTCCIAQSGIYKIVSIAFLFICVS >gi568815590r:116545009_116855797|GENSCAN_predicted_CDS_7|333_bp atggtgaaggagcagaagagctcatttgtgggtgaagatgaggaggtcagctctggagat gaaaggccctatttttctgcactctgtggagggaaacagctcagcaagcttgttcacccc ctgaaccatcacccatactgtccctttaaaccagaactctgcttgccactctcctttcct aactcttgggcatccttcaaggctcagtatttgtcacttcgtccagaaatgtttcctgac tccccaatccgagtccctaatacctgctgcatagcccaatccggcatctataaaattgtt tcaattgcattcttattcatctgtgtctcctaa >gi568815590r:116545009_116855797|GENSCAN_predicted_peptide_8|319_aa XKETKAKRKRKLIVDSVKELDSKTIRAQLSDYSDIVTTLDLAPPTKKLMMWKETGGVEKL FSLPAQPLWNNRLLKLFTRCLTPLVPEDLRKRRKGGEADNLDEFLKEFENPEVPREDQQQ QHQQRDVIDEPIIEEPSRLQESVMEASRTNIDESAMPPPPPQGVKRKAGQIDPEPVMPPQ QVEQMEIPPVELPPEEPPNICQLIPELELLPEKEKEKEKEKEDDEEEEDEDASGGDQDQE ERRWNKRTQQMLHGLQRALAKTGAESISLLELCRNTNRKQAAAKFYSFLVLKKQQAIELT QEEPYSDIIATPGPRFHII >gi568815590r:116545009_116855797|GENSCAN_predicted_CDS_8|960_bp nttaaagaaacaaaagccaagaggaagaggaagctaattgttgacagtgtcaaagagttg gatagcaagacaattagagcccaacttagtgattattcagatattgttactactttggat ctggcaccgcccaccaagaaattgatgatgtggaaagagacaggaggagtagaaaaactg ttttctttacctgctcagcctttgtggaataacagactactgaagctctttacacgctgt cttacaccgcttgtaccagaagaccttagaaaaaggaggaaaggaggagaggcagataat ttggatgaattcctcaaagaatttgaaaatccagaggttcctagagaggaccagcaacag cagcatcagcagcgtgatgttatcgatgagcccattattgaagagccaagccgcctccag gagtcagtgatggaggccagcagaacaaacatagatgagtcagctatgcctccaccacca cctcagggagttaagcgaaaagctggacaaattgacccagagcctgtgatgcctcctcag caggtagagcagatggaaataccacctgtagagcttcccccagaagaacctccaaatatc tgtcagctaataccagagttagaacttctgccagaaaaagagaaggagaaagagaaggaa aaagaagatgatgaagaggaagaggatgaagatgcatcagggggcgatcaagatcaggaa gaaagaagatggaacaaaaggactcagcagatgcttcatggtcttcagcgtgctcttgct aaaactggagctgaatctatcagtttgcttgagttatgtcgaaatacgaacagaaaacaa gctgccgcaaagttctacagcttcttggttcttaaaaagcagcaagctattgagctgaca caggaagaaccgtacagtgacatcatcgcaacacctggaccaaggttccatattatataa