GENSCAN 1.0 Date run: 5-Nov-116 Time: 12:44:07 Sequence gi568815575f:16619725_16860356 : 240632 bp : 43.77% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1442 1500 59 1 2 70 68 97 0.565 6.68 1.02 Intr + 1758 1883 126 0 0 -9 86 110 0.421 0.89 1.03 Intr + 13144 13174 31 1 1 79 50 11 0.001 -5.47 1.04 Intr + 26606 26765 160 1 1 68 105 79 0.188 7.16 1.05 Intr + 30457 30479 23 2 2 98 92 0 0.174 -1.34 1.06 Intr + 30545 30576 32 1 2 94 113 22 0.199 2.23 1.07 Intr + 31179 31245 67 0 1 35 111 69 0.199 2.81 1.08 Intr + 32034 32162 129 2 0 26 91 51 0.507 0.09 1.09 Term + 34681 34785 105 0 0 85 49 71 0.915 1.31 1.10 PlyA + 34924 34929 6 1.05 2.17 PlyA - 35953 35948 6 1.05 2.16 Term - 46524 46426 99 0 0 83 39 66 0.662 -0.67 2.15 Intr - 47833 47790 44 2 2 94 119 8 0.741 2.46 2.14 Intr - 51724 51648 77 0 2 78 96 61 0.665 5.06 2.13 Intr - 58726 58638 89 1 2 60 61 83 0.612 1.57 2.12 Intr - 63502 63370 133 0 1 115 101 164 0.985 21.05 2.11 Intr - 69877 69726 152 1 2 47 115 20 0.826 -0.44 2.10 Intr - 71896 71816 81 1 0 34 77 95 0.851 2.83 2.09 Intr - 73500 73417 84 0 0 77 105 12 0.770 1.82 2.08 Intr - 73763 73647 117 2 0 82 80 74 0.900 6.66 2.07 Intr - 76695 76565 131 1 2 73 87 10 0.944 -0.19 2.06 Intr - 79369 79199 171 2 0 82 62 117 0.977 8.41 2.05 Intr - 83192 83013 180 0 0 20 80 217 0.259 13.94 2.04 Intr - 91869 91789 81 1 0 76 111 -1 0.337 0.61 2.03 Intr - 92887 92611 277 1 1 23 93 272 0.916 18.29 2.02 Intr - 100109 99805 305 0 2 87 26 312 0.480 21.21 2.01 Init - 108773 108725 49 2 1 94 58 47 0.653 1.41 2.00 Prom - 111582 111543 40 -7.46 3.00 Prom + 117493 117532 40 -7.06 3.01 Init + 118709 118753 45 1 0 69 119 -10 0.325 0.98 3.02 Intr + 123977 124116 140 1 2 71 67 143 0.956 9.76 3.03 Intr + 135221 135369 149 2 2 58 76 174 0.896 13.08 3.04 Intr + 136939 136997 59 2 2 48 74 16 0.484 -5.20 3.05 Intr + 137438 137585 148 1 1 53 93 140 0.577 10.81 3.06 Term + 140508 140635 128 1 2 86 37 169 0.999 10.04 3.07 PlyA + 141157 141162 6 1.05 4.00 Prom + 160307 160346 40 -3.76 4.01 Init + 166001 166016 16 1 1 76 108 6 0.473 1.79 4.02 Intr + 167224 167310 87 2 0 32 85 106 0.464 4.64 4.03 Intr + 168303 168352 50 1 2 60 67 51 0.378 -1.40 4.04 Term + 171023 171115 93 1 0 36 49 127 0.595 1.43 4.05 PlyA + 172898 172903 6 1.05 5.00 Prom + 175251 175290 40 -4.16 5.01 Init + 181260 181411 152 2 2 36 58 175 0.727 6.82 5.02 Intr + 198224 198368 145 2 1 84 50 86 0.849 4.68 5.03 Intr + 198850 199153 304 0 1 114 58 132 0.897 8.96 5.04 Intr + 200440 200531 92 2 2 74 70 76 0.729 4.11 5.05 Intr + 208370 208540 171 1 0 11 96 127 0.398 5.94 5.06 Intr + 209852 210046 195 1 0 -15 80 289 0.689 17.41 5.07 Intr + 212899 213018 120 0 0 50 78 161 0.996 11.99 5.08 Intr + 217869 217961 93 2 0 68 63 57 0.700 1.36 5.09 Term + 221704 222042 339 0 0 81 44 360 0.999 25.44 5.10 PlyA + 222511 222516 6 1.05 6.09 PlyA - 223924 223919 6 1.05 6.08 Term - 225379 225311 69 1 0 83 40 83 0.950 0.94 6.07 Intr - 226214 226104 111 2 0 61 101 27 0.742 1.88 6.06 Intr - 229577 229520 58 1 1 61 80 87 0.652 4.09 6.05 Intr - 232904 232687 218 2 2 80 81 48 0.255 0.60 6.04 Intr - 233151 233025 127 2 1 97 8 96 0.943 3.18 6.03 Intr - 234118 233958 161 1 2 113 67 185 0.981 17.69 6.02 Intr - 237985 237870 116 2 2 89 47 20 0.895 -1.83 6.01 Intr - 239125 238952 174 2 0 57 91 161 0.902 13.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:16619725_16860356|GENSCAN_predicted_peptide_1|243_aa MTSVAMGTAINELPSDFIVRMPLLEHVEKQDGPGTKWAPALEQSERTRDSSYSTSPQNAD PSPPFPQLKNKETGGTNSIHFIKLQQGFKEAVGVGCLGQHTAHSKDTHSRSTGYGQETSG NSTAEAVSLLGNQNGVKDNGLERRKWPNGTKILQTTLSTKLYFSFLSCQNQNAFKATEYR EELIVLTPIYSTVFPFAKEKAESGHLPRMGPNTLDDLFQELDKNGDGEVSFEEFQVLVKK ISQ >gi568815575f:16619725_16860356|GENSCAN_predicted_CDS_1|732_bp atgaccagtgtggcgatgggcacggcgatcaacgagctgcctagtgacttcattgtcaga atgcccctactggagcatgtggagaagcaggatggtccagggaccaagtgggctccagca ctggagcagtctgagaggaccagagactcttcctacagcactagccctcaaaacgcagat cccagccctccatttcctcagctgaaaaataaggagactggggggaccaacagcatccac ttcataaagttgcagcaaggatttaaggaggccgtgggagttgggtgcctaggtcagcac actgcccatagtaaagatacgcactcaagaagcacaggctatggtcaggagacctctggg aactcaactgctgaagctgtttcactattgggcaaccagaatggtgtgaaagataatggt ctggagagaaggaaatggcccaatggcactaaaatcctgcagacaaccctcagcacaaaa ctgtacttcagctttttgagttgtcaaaatcaaaatgcattcaaggcaacagagtacaga gaagagctgatcgtcctcactccaatatacagtactgtctttccttttgctaaagaaaaa gctgagtcaggacacttgccaaggatgggtccaaacaccctagatgatctctttcaagaa ctggacaagaatggagatggagaagttagttttgaagaattccaagtattagtaaaaaag atatcccagtga >gi568815575f:16619725_16860356|GENSCAN_predicted_peptide_2|689_aa MAFHHVGQAGLELLASDSATVSDGCSGGASPLGCPPPATGCCKPNQLLKPRNIPLAGRRR GPDPQRPQRALVAFDSTLRDSQGDVRKSEEHRSRRSPASMTTPPRPRKRHFRPKAPPLAL VLDHCTNPGANPKLTSVSELGQQQGGEGKVGEEGAAHPEARVGGGKIPGAALDSPGSDPF PSPTGERQASRGVEPAVPTGHPPRCYHHRAGPALNPGSDCYHSLALTVLELHRTRSSATI LPMKYILVTGGVISGIGKGIIASSIGTILKSCGLRVTAIKIDPYINIDAGTFSPYEHGEV FVLNDGGEVDLDLGNYERFLDINLYKDNNITTGKIYQHVINKERRGDYLGKTVQGAHWLD FKFHGNRETDKQNPFSVITLMFIFAYWILVVNHPLWLILGGTIGDIEGMPFVEAFRQFQF KAKRENFCNIHVSLVPQLSATGEQKTKPTQNSVRALRGLGLSPDLIVCRSSTPIEMAVKE KISMFCHVNPEQVICIHDVSSTYRVPVLLEEQSIVKYFKERLHLPIGDSASNLLFKWRNM ADRYERLQKICSIALVGKYTKLRDCYASVFKALEHSALAINHKLNLMYIDSIDLEKITET EDPVKFHEAWQKLCKAEKMLPETGPNPDPKRGFLDLAQERIQDADSTEFRPNAPVPLSEF CKMMVGLQSACLPATHAHPVVSLLLGTYF >gi568815575f:16619725_16860356|GENSCAN_predicted_CDS_2|2070_bp atggcgtttcaccatgttggccaggctggtcttgaactcctggcctcagactcagccacc gtctcggacggctgctcgggtggagcatctccattgggctgcccaccgcctgccaccggc tgctgcaagcccaaccaactgctcaagccccggaacatcccgctcgcgggccgccgccgc ggtcccgatccccagagaccgcagcgagcactggttgcctttgactccactctgcgcgac tcccagggagatgtccggaagtcggaggaacaccgcagccgccgcagccccgcctctatg acgactccgccgcggccgcgcaagcgtcacttccgtccaaaggctccgcccctggcgctt gttttggaccattgcacaaacccgggtgcaaaccccaagctcaccagcgtgagtgagctg ggccagcagcagggaggagaggggaaggtgggcgaggagggcgccgcgcaccccgaggcc cgtgtgggcggtgggaagatcccgggggcggctttggacagccccggcagcgaccccttc cccagcccgacaggtgagcgccaggccagccgcggggtggagcccgccgtgcccaccggc caccctccccggtgctaccaccaccgcgcaggccccgcactcaaccccggttctgattgt tatcatagcttagctttgactgttctagaacttcatagaacaagatcatcagccactatt ctgccaatgaagtacatcctggtcacgggtggggtcatctcaggcattggtaaagggatc attgccagcagcattggaacgattctaaaatcatgtggactccgagttactgccataaaa atcgacccctatattaacatcgatgctggcactttttcaccttatgaacacggtgaagtc ttcgtcttaaatgatggtggagaagttgatttagaccttggaaattatgaaagatttttg gatattaatctttataaagacaacaatatcaccacggggaagatatatcagcatgtgatc aataaagagaggcgtggtgattacctggggaaaacagtgcaaggagctcattggctggat tttaaatttcatggaaacagggaaactgacaaacagaaccccttctccgttataaccctg atgtttatatttgcttattggatactagtggtcaaccatcctctttggttgattctggga ggcaccattggagacatcgaaggaatgccgtttgtggaggcgtttagacaattccagttt aaggcgaaaagagagaatttctgtaatatccacgttagccttgtcccacagctcagtgct accggagaacaaaaaaccaaacccacccaaaacagcgtccgcgcactgaggggtttaggc ctgtctccagatctgattgtctgccgaagttcaacgcccattgagatggccgtgaaggag aagatttctatgttttgtcacgtgaaccctgaacaggtcatatgtatccatgatgtttct tccacataccgagttcctgtgcttttagaggaacaaagcattgtgaaatattttaaggag agattgcacctgcccatcggtgattctgcaagtaatttgctttttaagtggagaaatatg gctgacaggtatgaaaggttacagaaaatatgctccatagccctggttggcaaatacacc aagctcagagactgctacgcctctgtgttcaaagccctggaacactcagccctggccatc aaccacaagttgaatctgatgtacatagactccattgatctggagaagatcactgaaacc gaggaccctgtgaaatttcatgaagcttggcagaagctatgcaaagctgaaaagatgtta ccagaaacgggtcccaatccagaccccaagagagggttcttggatcttgctcaagaaaga attcaggatgctgattccacagagtttaggccaaatgccccagttcctctgagcgagttc tgcaagatgatggttggccttcagtcagcctgtttgccagccacccatgcacacccagtc gtgtccctgctgctgggcacctacttctaa >gi568815575f:16619725_16860356|GENSCAN_predicted_peptide_3|222_aa MQSARVSNLRKVTQLDKRNFLRDPPAGVQFNFDFDQMYPVALVMLQEDELLSKMRFALVP KLVKEEVFWRNYFYRVSLIKQSAQLTALAAQQQAAGKEEKSNGREQDLPLAEAVRPKTPP VVIKSQLKTQEDEEEISTSPGVSEFVSDAFDACNLNQEDLRKEMEQLVLDKKQEETAVLE EDSADWEKELQQELQEYEVVTESEKRDENWDKEIEKMLQEEN >gi568815575f:16619725_16860356|GENSCAN_predicted_CDS_3|669_bp atgcagtcagcaagggtaagtaatttgcggaaggtcactcagctggacaagaggaatttc cttcgtgaccctccggctggcgtgcaatttaatttcgactttgatcagatgtaccccgtg gccctggtcatgctccaggaggatgagctgctaagcaagatgagatttgccctcgttcct aaacttgtgaaggaagaagtgttctggaggaactacttttaccgcgtctccctgattaag cagtcagcccagctcacggccctggctgcccaacagcaggccgcagggaaggaggagaag agcaatggcagagagcaagatttgccgctggcagaggcagtacggcccaaaacgccaccc gttgtaatcaaatctcagcttaaaactcaagaggatgaggaagaaatttctactagccca ggtgtttctgagtttgtcagtgatgccttcgatgcctgtaacctaaatcaggaagatcta aggaaagaaatggagcaactagtgcttgacaaaaagcaagaggagacagccgtactggaa gaggattctgcagattgggaaaaagaactgcagcaggaacttcaagaatatgaagtggtg acagaatctgaaaaacgagatgaaaactgggataaggaaatagagaaaatgcttcaagag gaaaattag >gi568815575f:16619725_16860356|GENSCAN_predicted_peptide_4|81_aa MNEVPITAVTVLPALSAAVRGLAGVVGSPSWRRAERSSACRTEKEAVDQCPHQSDAVEKA GQLSLPSFADDKLLNDVNSLA >gi568815575f:16619725_16860356|GENSCAN_predicted_CDS_4|246_bp atgaatgaagtgccaatcaccgccgtcaccgtgctcccggcgctgagcgccgccgtccgg gggctcgctggggtcgtgggctcgccctcctggcgtcgggcagaaagatccagtgcttgc aggactgaaaaagaagctgtggaccaatgtcctcaccagtcagatgctgtggagaaggct gggcagctttcactgccctcatttgctgatgacaagctgctcaacgatgttaactcactt gcctaa >gi568815575f:16619725_16860356|GENSCAN_predicted_peptide_5|536_aa MCDAPDPLALLLLLSLCGTPAPVLPSVMNKSSLGLARSDASAMLPVQPTEPVLEMSAFCQ SRGEMVEKELVLLRIKVLDSRVLCPHSVGGNFTWKSPLGFEIGTMEEAGICGLGVKADML CNSQSNDILQHQGSNCGGTSNKHSLEEDEGSDFITENRNLVSPAYCTQESREEIPGGEAR TDPPDGQQDSECNRNKEKTLGKEVLLLMQALNTLSTPEEKLAALCKKYADLLEESRSVQK QMKILQKKQAQIVKEKVHLQSEHSKAILARSKLESLCRELQRHNKTLKEENMQQAREEEE RRKEATAHFQITLNEIQAQLEQHDIHNAKLRQENIELGEKLKKLIEQYALREEHIDKVFK HKELQQQLVDAKLQQTTQLIKEADEKHQREREFLSLYMDKFEEFQTTMAKSNELFTTFRQ EMEKKTVRDKEYKALQIKLERLEKLCRALQTERNELNEKVEVLKEQVSIKAAIKAANRDL ATPVMQPCTALDSHKELNTSSKRALGAHLEAEPKSQRSAVQKPPSTGSAPAIESVD >gi568815575f:16619725_16860356|GENSCAN_predicted_CDS_5|1611_bp atgtgtgatgcccctgaccccttagctctcttgcttctgctctcgctgtgtggcacgcct gctcctgttttgccttccgtcatgaataagagctccctgggcctcgccagaagcgatgcc agcgccatgcttcctgtacagccaacagaaccggtcttggagatgtctgcattttgccag agtcggggagagatggttgagaaagaacttgtgcttctaagaataaaagtcttggacagt cgtgtcctatgcccacattcagttggtgggaacttcacgtggaagtcgccccttggtttt gaaattggcacaatggaagaagctggaatttgtgggctaggggtgaaagcagatatgttg tgtaactctcaatcaaatgatattcttcaacatcaaggctcaaattgtggtggcacaagt aacaagcattcattggaagaggatgaaggcagtgactttataacagagaacaggaatttg gtgagcccagcatactgcacgcaagaatcaagagaggaaatccctgggggagaagctcga acagatccccctgatggtcagcaagattcagagtgcaacaggaacaaagaaaaaacttta ggaaaagaagttttattactgatgcaagccctaaacaccctttcaaccccagaggagaag ctggcagctctctgtaagaaatatgctgatcttctggaggagagcaggagtgttcagaag caaatgaagatcctgcagaagaagcaagcccagattgtgaaagagaaagttcacttgcag agtgaacatagcaaggctatcttggcaagaagcaagctagaatctctttgcagagaactt cagcgtcacaataagacgttaaaggaggaaaatatgcagcaggcacgagaggaagaagaa cgacgtaaagaagcaactgcacatttccagattaccttaaatgaaattcaagcccagctg gagcagcatgacatccacaacgccaaactccgacaggaaaacattgagctgggggagaag ctaaagaagctcatcgaacagtacgcactgagggaagagcacattgataaggtgttcaaa cataaggaactgcaacagcagctcgtggatgccaaactgcagcaaacgacacaactgata aaagaagctgatgaaaaacatcagagagagagagagtttctttctctttatatggataag tttgaagaattccagactaccatggcaaaaagcaatgaactgtttacaaccttcagacag gaaatggaaaagaaaacagtccgtgataaagagtacaaggcccttcaaataaaactggaa cggttagagaagctgtgcagggctcttcagacagaaaggaatgagctcaatgagaaggtg gaagtcctgaaagagcaggtatccatcaaagcggccatcaaagcggcgaacagggattta gcaacacctgtgatgcagccctgtactgccctggattctcacaaggagctgaacacttcc tcgaaaagagccctgggagcgcacctggaggctgagcccaagagtcagagaagcgctgtg caaaagcccccgtccacaggctctgctccggccatcgagtcggttgactaa >gi568815575f:16619725_16860356|GENSCAN_predicted_peptide_6|344_aa XFGGFGSVTGKIECEIKINHEGEVNRARYMPQNPHIIATKTPSSDVLVFDYTKHPAKPDP SGECNPDLRLRGHQKEGYGLSWNSNLSGHLLSASDDHTVCLWDINAGPKEGKIVDAKAIF TGHSAVVEDVAWHLLHESLFGSVADDQKLMIWDTRSNTTSKPSHLVDAHTAEVNCLSFNP YSEFILATGSADKTVALWDLRNLKLKLHTFESHKDEIFQVCDSFLVSVCQEMKSRDSSVK LYQTRGSTIEKNVLYVVLHGDAHRVSKIGEEQSAEDAEDGPPELLFIHGGHTAKISDFSW NPNEPWVICSVSEDNIMQIWQMAENIYNDEESDVTTSELEGQGS >gi568815575f:16619725_16860356|GENSCAN_predicted_CDS_6|1035_bp naatttggtggctttggttctgtaacaggaaaaattgaatgtgaaattaaaatcaatcac gaaggagaagtaaaccgtgctcgttacatgccgcagaatcctcacatcattgctacaaaa acaccatcttctgatgtgttggtttttgactatacaaaacaccctgctaaaccagaccca agtggagaatgtaatcctgatctcagattaagaggtcaccagaaggaaggctatggtctc tcctggaattcaaatttgagtggacatctcctaagtgcatctgatgaccatactgtttgt ctgtgggatataaacgcaggaccaaaagaaggcaaaattgtggatgctaaagccatcttt actggccactcagctgttgtagaggatgtggcctggcacctgctgcacgagtcattgttt ggatctgttgctgatgatcagaaacttatgatatgggacaccaggtccaataccacctcc aagccgagtcacttggtggatgcgcacactgccgaagtcaactgcctctcattcaatccc tacagcgaatttattctagccaccggctctgcggataagaccgtagctttatgggatctg cgtaacttaaaattaaaactccataccttcgaatctcataaagatgaaattttccaggta tgtgacagttttctggtgtctgtatgtcaggaaatgaaatccagggattcatcagttaag ctataccagactcgtgggtccactattgaaaaaaatgtcttgtacgtggtgttacatgga gatgctcatagggtaagtaaaattggggaagaacaatcagcagaagatgcagaagatggg cctccagaactcctgtttattcatggaggacacactgctaagatttcagattttagctgg aaccccaatgagccttgggtcatttgctcagtgtctgaggataacatcatgcagatatgg caaatggctgaaaatatttacaatgatgaagagtcagatgtcacgacatccgaactggag ggacaaggatcttaa