GENSCAN 1.0 Date run: 4-Nov-116 Time: 18:23:06 Sequence gi568815588f:117443268_117648229 : 204962 bp : 47.12% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3810 3943 134 0 2 87 93 44 0.595 5.09 1.02 Intr + 3991 4123 133 2 1 63 31 79 0.120 -0.60 1.03 Intr + 14782 14899 118 1 1 89 72 47 0.295 3.67 1.04 Intr + 15191 15326 136 1 1 58 94 46 0.267 2.34 1.05 Intr + 24068 24117 50 0 2 113 44 16 0.021 -1.90 1.06 Intr + 28269 28319 51 2 0 104 103 -15 0.029 0.70 1.07 Term + 33919 33987 69 0 0 124 48 36 0.049 1.14 1.08 PlyA + 34183 34188 6 1.05 2.02 PlyA - 34217 34212 6 1.05 2.01 Sngl - 36945 36502 444 0 0 41 42 249 0.812 11.88 2.00 Prom - 44793 44754 40 -2.56 3.00 Prom + 49154 49193 40 -6.06 3.01 Init + 49726 49821 96 0 0 88 73 -16 0.193 -2.79 3.02 Intr + 53395 53601 207 0 0 60 105 99 0.681 8.07 3.03 Intr + 55947 56059 113 2 2 92 93 42 0.568 4.28 3.04 Term + 69635 69743 109 2 1 109 43 44 0.375 -0.02 3.05 PlyA + 72886 72891 6 1.05 4.00 Prom + 84428 84467 40 -2.66 4.01 Init + 91405 91768 364 0 1 69 44 172 0.355 7.84 4.02 Intr + 92038 92175 138 2 0 72 68 80 0.297 4.84 4.03 Intr + 92308 92400 93 2 0 80 101 31 0.071 3.54 4.04 Intr + 99080 99242 163 0 1 -132 68 300 0.032 5.03 4.05 Intr + 100000 100406 407 1 2 22 102 615 0.039 50.29 4.06 Intr + 102048 102209 162 1 0 98 43 47 0.335 1.15 4.07 Intr + 102365 102549 185 0 2 129 62 205 0.992 21.51 4.08 Term + 104798 104965 168 1 0 119 42 108 0.989 7.18 4.09 PlyA + 105507 105512 6 1.05 5.14 PlyA - 105661 105656 6 1.05 5.13 Term - 110475 110330 146 1 2 52 42 139 0.754 3.77 5.12 Intr - 113070 112893 178 0 1 98 28 100 0.681 4.49 5.11 Intr - 118158 118012 147 0 0 28 110 57 0.574 2.33 5.10 Intr - 122877 122749 129 0 0 1 94 133 0.294 6.09 5.09 Intr - 129199 128947 253 0 1 67 39 79 0.009 -1.66 5.08 Intr - 142383 142235 149 0 2 81 77 75 0.573 4.73 5.07 Intr - 143283 143199 85 2 1 141 74 57 0.927 9.42 5.06 Intr - 144734 144640 95 2 2 36 95 19 0.735 -3.84 5.05 Intr - 144985 144770 216 1 0 73 103 87 0.603 7.40 5.04 Intr - 157829 157732 98 0 2 138 53 15 0.439 2.73 5.03 Intr - 164120 164043 78 0 0 50 88 47 0.225 0.42 5.02 Intr - 166741 166661 81 2 0 62 84 65 0.232 3.11 5.01 Init - 191773 191719 55 1 1 64 98 53 0.538 5.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100406 406 1 1 61 102 608 0.924 56.25 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:117443268_117648229|GENSCAN_predicted_peptide_1|230_aa XNHMAKGAACLDDQGPRPLISPCILLGLSVLQILMTAMEEEQLFEDLTHAHRVFNDHLWM TPTVSSCQVPTDDYLVGPLVSQTEYLWCPQSSVFFHQQLHSGSSLTDGGPGHGPFFPGAG QPDMTSSRQQHVGPRPVSTPMELHFPNTWHKHDLVKCSQHLFEIGGVFSFIFQMEAACVR GKRNARARENQPLATSILFSVCGFRLPHHQSEERLGGRGDFEKLLLCQVN >gi568815588f:117443268_117648229|GENSCAN_predicted_CDS_1|693_bp nncaatcacatggccaagggagctgcctgcttggatgaccaggggcccaggcctcttatc tccccttgcattttattgggactgtcagtcctgcaaatactcatgacggcaatggaggag gagcagttgtttgaagacctcacccatgcccatcgtgtctttaacgaccacctgtggatg actcctacggtctccagctgccaggtgcctactgatgactacctggttggcccactggtc tcccaaacagagtatctttggtgtccccaaagttctgtgtttttccatcagcagctgcat tctggttcctccctcaccgatggcggtcctggtcacggccctttcttccctggggcaggg cagcctgatatgacatcgtcgaggcagcagcatgtgggcccccgacccgtgtctaccccc atggagctccattttcctaacacctggcacaagcacgacctagtgaaatgctcacagcat ctcttcgagatcggtggcgttttctcattcatttttcagatggaggctgcctgtgtccgt ggcaagaggaatgcgcgagccagagagaaccagcccctggcaaccagcattctattttcg gtctgtggatttcgcttacctcatcaccagagtgaggagaggctaggaggaagaggagac tttgagaagctgcttctttgccaggtgaactag >gi568815588f:117443268_117648229|GENSCAN_predicted_peptide_2|147_aa MANRLAQVKAITTRWANGLAQVKTLTTRWANGLAQVKTLTTRQANGLAQVKTLTTRQAKR LAQVKTLTTRWANGLAQVKTLTTRLANGLAQVKAFTTRRVNGLAQVKALTTRLVNGQAQV KALTTRWVNWLAQVKALAICMVHPHVV >gi568815588f:117443268_117648229|GENSCAN_predicted_CDS_2|444_bp atggcaaacaggctagctcaggttaaagccatcaccaccagatgggcaaatgggctagct caggttaaaaccctcaccaccagatgggcaaacgggctagctcaggttaaaaccctcacc accagacaggcaaacgggttagctcaggttaaaaccctcaccaccagacaggcaaagagg ctagctcaggttaaaaccctcaccactagatgggcaaacgggctagctcaggttaaaacc ctcaccaccagactggcgaatgggctagctcaggttaaagccttcaccaccagacgggtg aatgggctagctcaggttaaagccctcaccaccagactggtgaatggacaagctcaggtt aaagccctcaccaccagatgggtaaactggctagctcaggttaaagcccttgccatctgc atggtacatccacacgtggtataa >gi568815588f:117443268_117648229|GENSCAN_predicted_peptide_3|174_aa MPNFHSQGRKSLTHSESRELETFGEQSPKVAERASLSAEETTPYQDFPAGGSRRTPETAG RLHFELKVLPIKCKIRSLETHRVWLTPDADLHWRLCGGDEPVSSNTTDRSFPFSANIHVH FVLDVVEVFLTTAQQTEIRCPMDLPALLKVLYGSQGLHSKYSHLLSLYLRSSAI >gi568815588f:117443268_117648229|GENSCAN_predicted_CDS_3|525_bp atgcctaactttcacagtcagggaaggaaaagcctcacacactcagaaagcagagaacta gaaacatttggtgaacagtctcccaaagtggctgagcgagcgagcctgtcggccgaggag accacgccataccaggattttccagcaggtggcagccgacgcacgccagagacagctggg cgcctacactttgaactcaaggttcttccaattaagtgcaagataaggagcttggagact catcgcgtttggcttactcctgacgcagacttgcactggaggttgtgtggaggggatgag ccggtttcctccaacaccacagacaggtctttcccattctctgcaaatattcacgtccat tttgtccttgatgtggtggaagtgttcttaacaactgcacagcagactgagatacgctgc ccgatggatcttccagccctgctcaaagtcctctatggctcccaggggctacacagcaaa tatagccacctccttagtctttatttaagatcttctgcaatttga >gi568815588f:117443268_117648229|GENSCAN_predicted_peptide_4|559_aa MPPFVAHAAPGKLLAAGAKRRAAAVGPLRVHARGGRTLRAQRRGGLRGPSGLRLLRVPRG SLGPHSRCLLQTPGAGHEAEGSAWPWEKRGARSLPAPRRRGDANRGAGPGMANLQDRGAP ECGAEPGKKTLDEGSWGRLAGCGWKHPFSRPPAQALLCGHQQRFPGRFLNSGLETAGAKP GRRKRAGPEAGKSGPGVNEEEEEEEEEEEEEEEEKKKLEEEEEAKEEEKPQRAPQERVSW ERGAKARRSPAARMFQPAPKRCFTIESLVAKDSPLPASRSEDPIRPAALSYANSSPINPF LNGFHSAAAAAAGRGVYSNPDLVFAEAVSHPPNPAVPVHPVPPPHALAAHPLPSSHSPHP LFASQQRDPSTFYPWLIHRYRYLGHRFQAPGEEVARPAAPGRLRSRPRSGAWGLAPLRQR LVGCRTGGDPAFRAGQLSRRSLRNDTSPESFLLHNALARKPKRIRTAFSPSQLLRLEHAF EKNHYVVGAERKQLAHSLSLTETQVKVWFQNRRTKFKRQKLEEEGSDSQQKKKGTHHINR WRIATKQASPEEIDVTSDD >gi568815588f:117443268_117648229|GENSCAN_predicted_CDS_4|1680_bp atgccaccctttgttgcgcacgccgcccccggcaagctgctggcggccggggctaagcgg agggcggccgcggtcgggcctctccgggtccacgccaggggtgggaggacacttcgggcc cagcgtagaggaggcctccgaggcccaagcgggcttcggctgctgcgagtccccagaggc tccctcgggcctcactcgcggtgtctcctccagacgccaggggcaggccacgaagccgag ggctccgcgtggccgtgggagaagcggggtgcacggagccttcccgcgcccagacgccga ggggatgcaaacaggggagccgggcctggtatggccaacctccaggatcgaggggcccca gaatgcggggcagagccggggaagaagacgctggacgaggggtcttggggccgcctcgct ggctgcggttggaagcacccgttttcccgcccgcccgcgcaggcgctgctctgtggccac cagcagaggtttcccggccgcttcctaaactcggggctggagaccgcaggagctaaacca ggaagaagaaagagggcagggcctgaggccgggaagagcggcccgggggtcaacgaggag gaggaagaagaggaggaggaggaagaagaggaggaggaggagaagaagaaactggaggaa gaggaggaggcgaaggaggaggagaagccgcagcgggcgccgcaggagcgagtgagctgg gagcgaggggcgaaggcgcggagaagcccggccgcccgcatgttccagccggcgcccaag cgctgcttcaccatcgagtcgctggtggccaaggacagtcccctgcccgcctcgcgctcc gaggaccccatccgtcccgcggcactcagctacgctaactccagccccataaatccgttc ctcaacggcttccactcggccgccgccgccgccgccggtaggggcgtctactccaacccg gacttggtgttcgccgaggcggtctcgcacccgcccaaccccgccgtgccagtgcacccg gtgccgccgccgcacgccctggccgcccaccccctaccctcctcgcactcgccacacccc ctattcgcctcgcagcagcgggatccgtccaccttctacccctggctcatccaccgctac cgatatctgggtcatcgcttccaagcgcccggcgaggaggtggcgaggcccgcggcgcca gggcggctgcggagccggccgaggtcaggagcctggggactggctcctctgcgacagcgg ctggttgggtgccgcaccggtggagacccggcgttccgggccgggcagctgagcaggcgt tcccttcggaacgacactagccccgagagtttccttttgcacaacgcgctggcccgaaag cccaagcggatccgaaccgccttctccccgtcccagcttctaaggctggaacacgccttt gagaagaatcactacgtggtgggcgccgaaaggaagcagctggcacacagcctcagcctc acggaaactcaggtaaaagtatggtttcagaaccgaagaacaaagttcaaaaggcagaag ctggaggaagaaggctcagattcgcaacaaaagaaaaaagggacgcaccatattaaccgg tggagaatcgccaccaagcaggcgagtccggaggaaatagacgtgacctcagatgattaa >gi568815588f:117443268_117648229|GENSCAN_predicted_peptide_5|569_aa MRYQPFKELGRKLLQAENTVRICMGIEAFDEYPYVFLRSKENSTGDPANQWQPPYPLTVA ANTNPMELHLKGPVPYPVWFTGIMILPPSALKPMSSGTMLCVPPGACREENGKAQTTPCL RKSKDALQWNSTYQCILGQKEKGREEKADGGHNINNGHSLRACDELAMVLGPRGTVLILK VDTDNQPSKHIGNKLKNDICCSEHETGWSCIVQIFQDYWGTNAEHLVLKDFHCSGQPLIL KEAKPGFSTKQQMSFQQQKNTSPMHMNLRQHRTFTICWLRQVTQPSLCTQRLDEPVARKC QVETSQQTVHERWWNSVLHHPHASQLSPSPELSFTWRKVPSSRLPPSLGQLASKDWSKQE YKDPVPSLIQLRIHPAKPPSDAEEKPLYEMFKEASSRTYPAEKFHWSSSFTTQLSWLLWA ASSCWGLVPMVNLCCDIAGEIVAKAAGTQGGQDHDHRHPRPQALHPTDKVSEVPDPSGTR SSDRAGPPTTDLPALSYLTQDEATARVAPVEDKGWTWWALLEECDAPLPYPDPSSGRWPA VRFWHPDSQLRGPPETPSRATDTNKQKAN >gi568815588f:117443268_117648229|GENSCAN_predicted_CDS_5|1710_bp atgaggtaccagcccttcaaagaactagggagaaagctgctccaagcagagaacactgta cgcatctgcatggggatagaagcttttgatgaatatccatatgtgtttctcaggagcaag gaaaatagtacaggggaccctgccaatcaatggcagcctccttatccactgactgtggcg gccaacaccaatccaatggagttgcatttgaaaggacccgtgccataccccgtctggttt acgggcatcatgatccttcccccatcagccttgaagcccatgagttcagggaccatgctt tgtgtccctccaggggcctgcagagaggaaaatggcaaagcgcagactacaccttgtctc agaaagagcaaggatgccctgcagtggaatagcacttaccagtgcattctgggacaaaag gaaaaaggcagggaggagaaagcagatggtggccataatatcaataatggccattccctg agggcctgcgatgagctagccatggttctaggccccagaggtactgtgctgattctaaag gtagatacagacaaccaaccaagtaaacacataggaaataaactcaagaatgatatatgc tgttcagagcatgaaacaggatggtcctgcattgtccagatcttccaggactattggggg accaatgctgagcacttggtcctgaaagattttcactgcagtggacagcctctcatcctc aaggaggctaagcctggtttttccacaaaacagcagatgagcttccagcagcaaaagaac acaagcccaatgcacatgaacttaagacaacatcgcacatttactatctgctggctaagg caagtcacacagccaagcctctgtacccaaagattggatgaaccagtggcaagaaaatgc caggtggaaacatcccaacagacagtccatgagcgatggtggaactctgtgctgcaccac ccacatgcctcgcagctgagtccttctccagaactttccttcacctggaggaaggtgcct agttcaaggttacccccctccctggggcagctagcatccaaggactggtccaagcaggag tacaaagacccagtccccagcctcattcagcttaggattcatcctgccaagcctccctcg gatgctgaggagaagccactctatgagatgttcaaggaggcctcatcacggacctatccg gctgagaagttccattggtccagctccttcactacccagctatcctggttactctgggct gccagctcctgctggggcctggttcccatggttaatctgtgctgcgacatagcaggggag atagtggctaaagctgctggcacccaagggggacaggaccatgaccacaggcacccaagg ccccaggccctgcaccccacagacaaggtctcagaagtgcctgatccctcaggaacaaga tccagtgaccgggcaggtcccccaacaactgacttacctgccctctcttacctgacccag gatgaggccactgccagagtggcccctgttgaggacaaaggctggacatggtgggctctg ctagaggagtgcgacgcgcctctgccctacccggaccccagctctgggcgctggcccgcc gtccgcttctggcaccccgactcgcagctccggggccctccggagactcccagccgggca acagacactaataaacaaaaggccaattag