GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:26:29 Sequence gi568815590r:116747503_116966729 : 219227 bp : 38.98% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8084 8325 242 1 2 47 92 194 0.136 12.88 1.02 Intr + 18813 18945 133 0 1 63 34 55 0.000 -2.67 1.03 Intr + 18962 19090 129 1 0 -17 119 105 0.006 3.07 1.04 Intr + 19096 19289 194 0 2 -5 86 264 0.007 14.27 1.05 Intr + 22690 22864 175 1 1 87 59 76 0.514 3.62 1.06 Term + 23954 24340 387 1 0 97 41 225 0.604 12.65 1.07 PlyA + 24407 24412 6 1.05 2.06 PlyA - 24985 24980 6 1.05 2.05 Term - 44669 44478 192 2 0 119 49 121 0.510 7.74 2.04 Intr - 49170 49028 143 1 2 41 62 107 0.011 2.55 2.03 Intr - 59244 59056 189 1 0 66 44 120 0.479 4.04 2.02 Intr - 66644 66530 115 2 1 35 75 103 0.911 2.80 2.01 Init - 67491 67417 75 0 0 54 95 128 0.992 11.24 2.00 Prom - 78297 78258 40 -2.95 3.00 Prom + 83417 83456 40 -5.95 3.01 Init + 90178 90238 61 0 1 53 54 104 0.659 4.96 3.02 Term + 91224 91495 272 1 2 89 47 177 0.949 8.36 3.03 PlyA + 91553 91558 6 1.05 4.13 PlyA - 95203 95198 6 1.05 4.12 Term - 100189 99998 192 1 0 76 49 256 0.773 16.94 4.11 Intr - 101527 101444 84 1 0 52 72 76 0.721 1.50 4.10 Intr - 103265 103116 150 2 0 32 82 210 0.713 14.24 4.09 Intr - 104594 104446 149 0 2 49 95 177 0.985 13.53 4.08 Intr - 105206 105047 160 2 1 88 22 195 0.982 11.54 4.07 Intr - 106966 106743 224 2 2 74 85 185 0.987 13.72 4.06 Intr - 108786 108664 123 1 0 120 107 170 0.999 21.74 4.05 Intr - 109269 109144 126 1 0 90 82 102 0.906 9.53 4.04 Intr - 109971 109765 207 1 0 39 77 194 0.232 11.53 4.03 Intr - 115757 115628 130 2 1 51 72 104 0.396 4.45 4.02 Intr - 119239 119084 156 1 0 75 95 77 0.475 6.39 4.01 Init - 126697 126617 81 1 0 83 49 158 0.509 10.42 4.00 Prom - 128248 128209 40 -9.35 5.03 PlyA - 129138 129133 6 1.05 5.02 Term - 130167 129913 255 0 0 87 48 144 0.849 4.90 5.01 Init - 137896 137831 66 1 0 90 72 64 0.593 6.12 5.00 Prom - 142042 142003 40 -7.35 6.00 Prom + 149263 149302 40 -4.15 6.01 Sngl + 150947 152125 1179 1 0 60 45 702 0.922 58.82 6.02 PlyA + 153184 153189 6 1.05 7.03 PlyA - 153733 153728 6 1.05 7.02 Term - 165522 165104 419 1 2 9 42 265 0.175 8.45 7.01 Init - 176575 176518 58 1 1 68 64 17 0.099 -3.17 7.00 Prom - 176908 176869 40 -1.85 8.00 Prom + 189876 189915 40 -6.25 8.01 Init + 190742 190951 210 1 0 88 3 222 0.140 12.58 8.02 Term + 195056 195199 144 1 0 111 43 157 0.702 10.43 8.03 PlyA + 196475 196480 6 1.05 9.04 PlyA - 197248 197243 6 1.05 9.03 Term - 198797 198598 200 0 2 4 32 215 0.971 4.08 9.02 Intr - 199062 198883 180 1 0 44 80 100 0.906 3.72 9.01 Init - 202783 202624 160 1 1 78 48 245 0.802 17.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 19303 18883 421 0 1 77 49 446 0.844 33.38 S.002 Term - 81058 80939 120 1 0 81 50 158 0.930 8.79 S.003 Term - 218948 218833 116 0 2 97 36 104 0.839 3.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:116747503_116966729|GENSCAN_predicted_peptide_1|419_aa MLESTSGGTGLGTALPTWARGKRTALTRPSICTCFTAESPEPPLPLPFPAAPAVELEEVA VEPVPSLRDAIFPSRQEERETLRMCAPVRNRAPQRFSSVRTSSALRDRPFKRGHCGGSAR WSGHDARKTEAFASVAKGASFPGETGIEGTGACVRRLLMLPGPVASVPPGMKITRQKHAK KHLGFFRNNFGVREPYQILLDGTFCQAALRGRIQLREQLPRYLMGETQLCTTRCVLKELE TLGKDLYGAKLIAQKCQVRNCPHFKNAVSGSECLLSMVEEGNPHHYFVATQDQNLSVKVK KKPGVPLMFIIQNTMVLDKPSPKTIAFVKAVESGQLVSVHEKESIKHLKEEQGLVKNTEQ SRRKKRKKISGPNPLSCLKKKKKAPDTQSSASEKKRKRKRIRNRSNPKVLSEKQNAEGE >gi568815590r:116747503_116966729|GENSCAN_predicted_CDS_1|1260_bp atgctagaaagtacgtctggtgggacggggctgggaactgcgctccccacgtgggccaga ggaaaaagaacagcactcacaaggccatctatctgcacttgcttcacggctgaatctccc gagccgcctttgcctttgcctttccctgctgcgccggcggtggagctggaagaggtggca gtagagccggtaccttccttgcgggacgccatctttccaagcagacaggaagaaagagaa acactgcgcatgtgcgcccccgtgcggaatcgcgcaccccagcggttttccagcgtacgc acaagttccgccctaagggaccgcccctttaagcgcgggcactgcggtggaagcgcgcgc tggtctggacatgacgcccggaagacagaggcgtttgcttccgtggccaaaggcgcttca tttccgggtgaaactggcattgagggtactggggcgtgcgtgaggcgtttactgatgctt cctggtccggtggcctcggtcccgccaggcatgaagatcacaaggcagaaacatgccaag aagcatcttggcttcttccgcaacaacttcggagtccgcgagccgtaccagatcctgctg gacggcaccttctgtcaggcggcgctgcggggccgcatccagctgcgggagcagctgccc cgctacctcatgggggagacgcagctgtgcaccacaagatgtgtgttaaaagagctagaa acattgggaaaggacttatatggggcaaaactgattgcacaaaaatgccaagttcgaaat tgtcctcatttcaagaatgcagtgagtggatcagaatgtctgctttccatggttgaagag ggaaatcctcatcattattttgtggcaacacaggatcagaatttgtctgtgaaagtaaaa aagaagcctggagttcctctcatgtttattattcagaacactatggttttggacaaacct tctcccaaaacaattgcctttgtaaaagcagtggagtcaggtcagcttgtctcagtgcat gagaaagaaagtatcaaacatctcaaagaggaacagggtttagtgaaaaacactgaacag agtagaagaaaaaagcgcaagaaaataagtggtcccaatcctcttagttgtttgaagaaa aagaaaaaggcaccggacacacaatcatctgcttctgaaaagaaaagaaaaagaaaaaga attcggaacagatctaacccaaaagtactttctgagaagcagaatgcagaaggagaatga >gi568815590r:116747503_116966729|GENSCAN_predicted_peptide_2|237_aa MRPDIYQVPTRHEDNDVEMDELGTQWCDSGKLLSISNLYVFTGNTGIMMPYSGPESSVVT TEGDPYGRVAEWTDLHTSIAPGQTGLKLVCSKGNIAPQDHFTGCILSHPTEKFLLNSVSV NPKNASYAEKSFDHVQYPFMIKALNKLNIEEMYLNTLKAIYDKSKINIIFNGEKASHSEL SLNTPQIRLSSCKGLELHVNFESTQSLVLQLVISKQVNNNRASGLLREKKAEVQGGP >gi568815590r:116747503_116966729|GENSCAN_predicted_CDS_2|714_bp atgagacctgatatttaccaagttccaacaagacatgaagataatgatgtggaaatggat gaactgggcacacagtggtgtgattcaggcaagttgcttagcatctccaatctctatgtc tttaccggtaacactgggatcatgatgccctactcaggaccagagagctctgtggtgaca actgaaggagacccttatggcagagtggctgagtggacagatttgcacacaagcattgca cctggacaaacaggactcaaacttgtctgctccaagggcaatattgctccacaggatcat tttactggctgcatcctcagccatccaacagagaaattccttctcaactcagtgtctgta aaccccaagaatgcctcttatgcagaaaaatcatttgaccacgtacaatatcctttcatg ataaaagctctcaacaaattaaacatagaagaaatgtacctcaacacattaaaggcaatc tatgacaagtccaaaattaatatcatattcaatggtgaaaaggcaagtcattcagaatta tcgcttaacacccctcaaattagactctccagttgtaaaggcctagagctacatgtgaat tttgagagcactcagtcccttgtactccaacttgtcattagtaaacaagttaataataat cgcgccagtggattgttacgtgagaagaaggctgaagtccagggtggtccctag >gi568815590r:116747503_116966729|GENSCAN_predicted_peptide_3|110_aa MVKEQKSSFVGEDEEVSSGDERPYFSALCGGKQLSKLVHPLNHHPYCPFKPELCLPLSFP NSWASFKAQYLSLRPEMFPDSPIRVPNTCCIAQSGIYKIVSIAFLFICVS >gi568815590r:116747503_116966729|GENSCAN_predicted_CDS_3|333_bp atggtgaaggagcagaagagctcatttgtgggtgaagatgaggaggtcagctctggagat gaaaggccctatttttctgcactctgtggagggaaacagctcagcaagcttgttcacccc ctgaaccatcacccatactgtccctttaaaccagaactctgcttgccactctcctttcct aactcttgggcatccttcaaggctcagtatttgtcacttcgtccagaaatgtttcctgac tccccaatccgagtccctaatacctgctgcatagcccaatccggcatctataaaattgtt tcaattgcattcttattcatctgtgtctcctaa >gi568815590r:116747503_116966729|GENSCAN_predicted_peptide_4|593_aa MAVWEEAVAGPAPAVPAPGAATFRRTRPARTMFYAHFVLSKRGPLAKIWLAAHWDKKLTK AHVFECNLESSVESIISPKVKMALRTSGHLLLGVVRIYHRKAKYLLADCNEAFIKIKMAF RPGDFGMDDREIMREGSAFEDDDMLVSTTTSNLLLESEQSTSNLNEKINHLEYEDQYKDD NFGEGNDGGILDDKLISNNDGGIFDDPPALSEAGVMLPEQPAHDDMDEDDNVSMGGPDSP DSVDPVEPMPTMTDQTTLVPNEEEAFALEPIDITVKETKAKRKRKLIVDSVKELDSKTIR AQLSDYSDIVTTLDLAPPTKKLMMWKETGGVEKLFSLPAQPLWNNRLLKLFTRCLTPLVP EDLRKRRKGGEADNLDEFLKEFENPEVPREDQQQQHQQRDVIDEPIIEEPSRLQESVMEA SRTNIDESAMPPPPPQGVKRKAGQIDPEPVMPPQQVEQMEIPPVELPPEEPPNICQLIPE LELLPEKEKEKEKEKEDDEEEEDEDASGGDQDQEERRWNKRTQQMLHGLQRALAKTGAES ISLLELCRNTNRKQAAAKFYSFLVLKKQQAIELTQEEPYSDIIATPGPRFHII >gi568815590r:116747503_116966729|GENSCAN_predicted_CDS_4|1782_bp atggctgtctgggaggaggcggtggcgggcccggcgcccgcggtgccagccccgggagca gcaaccttccgccgaactcggccagccagaacaatgttctacgcacattttgttctcagt aaaagagggcctctggccaaaatttggctagcggcccattgggataagaagctaaccaaa gcccatgtgttcgagtgtaatttagagagcagcgtggagagtatcatctcaccaaaggtg aaaatggcattacggacatcaggacatctcttactgggagtagttcgaatctatcacagg aaagccaaataccttcttgcagactgtaatgaagcattcattaagataaagatggctttt cggccaggtgattttggaatggatgatcgtgagataatgagagaaggcagtgcttttgag gatgacgacatgttagtaagcactactacttctaacctcctattagagtctgaacagagc accagcaatctgaatgagaaaattaaccatttagaatatgaagatcaatataaggatgat aattttggagaaggaaatgatggtggaatattagatgacaaacttattagtaataatgat ggcggtatctttgatgatccccctgccctctctgaggcaggggtgatgttgccagagcag cctgcacatgacgatatggatgaggatgataatgtatcaatgggtgggcctgatagtcct gattcagtggatcccgttgaaccaatgccaaccatgactgatcaaacaacacttgttcca aatgaggaagaagcatttgcattggagcctattgatataactgttaaagaaacaaaagcc aagaggaagaggaagctaattgttgacagtgtcaaagagttggatagcaagacaattaga gcccaacttagtgattattcagatattgttactactttggatctggcaccgcccaccaag aaattgatgatgtggaaagagacaggaggagtagaaaaactgttttctttacctgctcag cctttgtggaataacagactactgaagctctttacacgctgtcttacaccgcttgtacca gaagaccttagaaaaaggaggaaaggaggagaggcagataatttggatgaattcctcaaa gaatttgaaaatccagaggttcctagagaggaccagcaacagcagcatcagcagcgtgat gttatcgatgagcccattattgaagagccaagccgcctccaggagtcagtgatggaggcc agcagaacaaacatagatgagtcagctatgcctccaccaccacctcagggagttaagcga aaagctggacaaattgacccagagcctgtgatgcctcctcagcaggtagagcagatggaa ataccacctgtagagcttcccccagaagaacctccaaatatctgtcagctaataccagag ttagaacttctgccagaaaaagagaaggagaaagagaaggaaaaagaagatgatgaagag gaagaggatgaagatgcatcagggggcgatcaagatcaggaagaaagaagatggaacaaa aggactcagcagatgcttcatggtcttcagcgtgctcttgctaaaactggagctgaatct atcagtttgcttgagttatgtcgaaatacgaacagaaaacaagctgccgcaaagttctac agcttcttggttcttaaaaagcagcaagctattgagctgacacaggaagaaccgtacagt gacatcatcgcaacacctggaccaaggttccatattatataa >gi568815590r:116747503_116966729|GENSCAN_predicted_peptide_5|106_aa MAEGDGEAGTSYVAGAGGREQRLGILTTILNANFLKSPLKTNQKNIKYLANYNRNSLDKA LSCGQYGHLANGLQPVRRTLPQPKNSGPTVSGAQDCPLSYPSPISA >gi568815590r:116747503_116966729|GENSCAN_predicted_CDS_5|321_bp atggcggaaggtgatggggaagcaggcacatcttacgtggctggagcaggaggaagagag caaaggcttgggattttaacaactatccttaatgccaatttcttaaagtctccattaaaa acaaaccaaaagaacataaaatatttggctaactacaacagaaattcacttgataaagct ttgagctgtggccagtatggccacttagctaatggactccagccagtgagacgaaccttg ccccaacccaaaaacagtggacccacagtgagtggagcacaagactgtccactttcttat ccatctcctatatcagcataa >gi568815590r:116747503_116966729|GENSCAN_predicted_peptide_6|392_aa MWKRLRNWVTSRGWNSLEGSEKDRKMWESLELPRDLLNGFDENADSDTKNKVQAEVVSDG DEELVGNWSQGDSCYVLAKRLAGFCPCPKDLWNFELEKDDLGYLAEKMSKQQSIQDVTWA LLKAFSFERETEHRSLENFQPDNVIEKKNPFSGEKFKPATEICISSKEPNANPQDHGENV SRSCHRPSWQPLPSQAQKPRRKKCFCGLGPGLPSCVQPRDLVPCVSAALVMAERGQCTAW AVGSEGGSPKPWQPPHDVEPVGAEKSRIEVWEPLPRFQKMYGNVWMPRQKFAAGAGPSWR TSARSVWKGNVGLGSSHRVPTGALTSGAVGRGPPSSRSQNGRSTDSMHHAPGKATDTRRH PMKAARREAVPCKATGVELPKTTGTHFLHQLD >gi568815590r:116747503_116966729|GENSCAN_predicted_CDS_6|1179_bp atgtggaagcgacttaggaactgggtaacaagcagaggttggaacagtttggagggctca gaaaaagacaggaaaatgtgggaaagtttggaacttcctagagacttgctgaatggtttt gatgaaaatgctgatagtgacacgaaaaataaggtccaggctgaggttgtctcagatgga gatgaggaacttgttgggaactggagccaaggtgactcctgttatgttttagcaaagaga ctggcagggttttgcccctgccctaaagatttgtggaactttgaacttgagaaagatgat ttagggtatctggcggaaaaaatgtctaagcagcaaagcattcaagatgtgacttgggcg ctgttaaaagcattcagttttgaaagggaaacagagcatagaagtttggaaaatttccag cctgacaatgtgatagaaaagaaaaacccattttctggagagaaattcaagccagctaca gaaatttgcataagtagcaaggagcctaatgctaatccccaagaccatggggaaaacgtc tccaggtcatgtcacagaccttcatggcagcctctcccatcacaggcccagaagcctagg agaaagaagtgtttttgtgggctgggcccagggctgccaagctgtgtgcagcctagagac ttggtgccctgtgtctccgctgctctagtcatggctgaaagaggccaatgtacagcttgg gctgtgggttcagagggtggaagtcccaagccttggcagcctccacatgatgttgagcct gtgggtgcagagaagtcaagaattgaggtttgggaacctctgcctagatttcagaagatg tatggaaatgtctggatgcccaggcaaaagtttgctgcaggggcagggccctcatggaga acctctgctaggtcagtgtggaagggaaatgtggggttggggtcctcacacagagtccct actggggcactgactagtggagctgtgggaagagggccaccatcctccaggtcccagaat ggtagatccactgacagcatgcaccatgcacctggaaaagccacagacactcgacgccat cccatgaaagcagccagaagggaggctgtaccctgcaaagccacaggggtggagctgccc aagaccacgggaacccacttcttgcatcagcttgactag >gi568815590r:116747503_116966729|GENSCAN_predicted_peptide_7|158_aa MTRMGVVAHACNHSTLGGREPLAEGVAIVSAGSADSVFPPAGSEESGQSGRVGFPPAQHT RLSKGQSECFLKRILDPMPPDWMRSPSRGRQTLYRGAFPLAQVGASLGGSSQRKEQAAIS AVLQPPLISPGKGGTQVNRAWSGPPANHSSPTEEGPDC >gi568815590r:116747503_116966729|GENSCAN_predicted_CDS_7|477_bp atgaccaggatgggcgtggtggctcatgcctgtaatcacagcactttgggaggccgagag ccccttgcagaaggagtggccatagtctccgcaggatcagctgactcagtctttccccct gctggctctgaggaatccgggcagtccggaagagtgggattcccaccagcacagcacacc cgcttgtccaaggggcagtcagagtgcttccttaaacggatcctggatcccatgcctcct gactggatgagatcccccagtagaggtcgccagacactttataggggagcgttcccactg gctcaggtaggtgcttctctgggtgggagctcccagaggaaggagcaggcagccatctct gctgttctgcagcctccactgatatctccagggaagggaggcacccaggtgaatagggcc tggagtggacccccagcaaaccacagcagccctacggaagaggggcctgactgttaa >gi568815590r:116747503_116966729|GENSCAN_predicted_peptide_8|117_aa MGPGDFRRCRERISQGLQGLPGRAELWFPPRPACDFFGDGRSTDIQEEALAASPLLEDLR RRLTRAFQWAVEMHFQNHQLARTLLDLNMKVQQLKKEYELEITSDSQSPKDDAANPE >gi568815590r:116747503_116966729|GENSCAN_predicted_CDS_8|354_bp atgggccccggggacttccgccgctgcagagagagaatttcccaggggctccagggactc ccaggtagagcggagctttggttcccacctcgtcccgcgtgcgacttcttcggggacggc aggagcacggacatccaggaggaggccctcgccgccagcccgctgctggaggacctcaga cgacggctgacgcgcgccttccagtgggcggtggaaatgcatttccaaaaccaccagctg gctagaactttactggacctaaacatgaaagtgcagcaattgaaaaaggagtatgaactg gaaattacatcagactcccaaagcccaaaagatgatgctgcgaatccggaataa >gi568815590r:116747503_116966729|GENSCAN_predicted_peptide_9|179_aa MLAALAALARSQRLLGLGVRSGHAREALQPDTALWGPLSGLANAGASSLCLWGGRGEALS PPICINVRIRKQMRGTQPPEKQCLTQATVIGVHPSESYCPALSSDGTRGCQTRGNRGCGN GSSSNSESNQERRQLALEGIVPGSRHRGILAVITAVNRADGCAGTLGPARKGAGEMRFL >gi568815590r:116747503_116966729|GENSCAN_predicted_CDS_9|540_bp atgctggcagccctcgcagcccttgctcgctctcagcgcctcctcggcctcggcgtccgc tctgggcatgctcgagaagcccttcagcccgacactgcgctctgggggcccctctctggg ctggccaatgcgggagccagctccctctgcttgtggggaggaagaggtgaagccctctca cccccaatatgcatcaatgtcagaatcaggaagcagatgagaggtacacagccccctgag aagcagtgcttgacacaggctactgtaataggtgtccacccatctgaatcctactgccca gcactcagctcagatggaacaagaggctgccagacgagaggaaacagaggctgcggcaat ggcagcagcagcaacagtgaaagcaaccaagagagaaggcagctggcactggagggcatc gtgccaggttccaggcacagagggattctggcagtgattacagcggtaaatagagctgat ggttgtgccgggaccctaggccctgccaggaagggagcaggggagatgagattcttataa