GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:29:12 Sequence gi568815594f:102769085_102987427 : 218343 bp : 37.88% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4610 4742 133 2 1 59 59 124 0.117 6.33 1.02 Intr + 10157 10261 105 1 0 35 87 62 0.007 0.29 1.03 Intr + 57419 57963 545 1 2 71 86 157 0.144 4.56 1.04 Intr + 58398 58563 166 0 1 63 48 108 0.164 3.34 1.05 Intr + 91927 92124 198 0 0 122 45 30 0.274 0.73 1.06 Intr + 95291 95410 120 1 0 52 47 117 0.419 3.77 1.07 Intr + 99711 99815 105 2 0 54 56 193 0.759 12.19 1.08 Intr + 99896 100103 208 1 1 78 89 217 0.399 18.53 1.09 Intr + 116138 116346 209 0 2 46 107 94 0.605 4.97 1.10 Term + 128963 129157 195 1 0 37 41 198 0.270 6.43 1.11 PlyA + 129257 129262 6 1.05 2.00 Prom + 142317 142356 40 -6.15 2.01 Init + 144329 144385 57 1 0 56 103 38 0.581 3.46 2.02 Term + 151427 151759 333 1 0 66 54 313 0.266 19.33 2.03 PlyA + 152248 152253 6 1.05 3.00 Prom + 152782 152821 40 -6.15 3.01 Sngl + 154701 155609 909 2 0 60 32 283 0.760 15.91 3.02 PlyA + 155628 155633 6 -0.45 4.00 Prom + 155999 156038 40 -3.65 4.01 Sngl + 156738 157670 933 2 0 43 39 336 0.955 20.40 4.02 PlyA + 157931 157936 6 1.05 5.06 PlyA - 158702 158697 6 1.05 5.05 Term - 159448 159310 139 0 1 35 50 164 0.896 3.75 5.04 Intr - 160175 160073 103 0 1 73 97 39 0.920 1.61 5.03 Intr - 163204 163040 165 2 0 71 97 101 0.597 8.31 5.02 Intr - 167027 166852 176 1 2 97 85 96 0.586 8.86 5.01 Init - 169069 168990 80 1 2 45 65 21 0.643 -3.74 5.00 Prom - 170476 170437 40 -8.25 6.00 Prom + 171923 171962 40 -0.95 6.01 Sngl + 172025 172357 333 1 0 45 50 155 0.843 3.17 6.02 PlyA + 172745 172750 6 1.05 7.02 PlyA - 174304 174299 6 1.05 7.01 Sngl - 185325 183577 1749 0 0 37 39 655 0.952 49.98 7.00 Prom - 189187 189148 40 -7.75 8.00 Prom + 190216 190255 40 -4.95 8.01 Sngl + 193387 193968 582 0 0 25 47 307 0.755 16.23 8.02 PlyA + 194659 194664 6 1.05 9.00 Prom + 196134 196173 40 -6.25 9.01 Sngl + 200439 200741 303 2 0 88 54 330 0.985 25.18 9.02 PlyA + 201325 201330 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 36923 36756 168 2 0 56 53 166 0.885 4.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:102769085_102987427|GENSCAN_predicted_peptide_1|661_aa XKLTIKQSQAGPSGGIPEEGIVITGDDSSRCVIALEDLTVEQDVESPTFSCFSARCSATC VALPAEVFYGHRIGAWQAKKRHSVCLSSGSSLSRCMLKGPAKTLDYPGGGAGLSRLTPAL PDTGAFCKNGADQHSHRPRRAPDESRSLKTAAIRGGWGGVAKPPTASVPCFRPEVGARVT GRAEVVATSLTSRPLERKRPGWERKRRRRKGWVRGRVRSPAEEEPGPSRRYTGVTRAKKG KGGGGSIRLRTDAGLLSVSVQSWCLPGPTGLTRTTQPQDVRSDGTTASCHAPPLPLLLPL HRRGSVRQRRRPEHKTDIFLDYSSVKSPSVFWIGKNRSEPFGRQHARLGKLGTHLTLTFL PGRTHGPKGPLLALNSAALEEKDEWADRAEKDRVSKLNQRMFSGSKNVPECEIVVTSKIA EFKHLQTLKAQEEGHLHTRFRGGPRYPPVPAPRQSAASAEAASARRPLPLPAQARQLGQS GGGSGEEWTPLARMVLESVARIVKVQLPAYLKRLPVPESITGFARLTEWLRLLPFLGVLA LLGYLAVRPFLPKKKQQKDSLINLKIQKENPKVVNEINIEDLCLTKAAYCRCWRSKTPCP SRAGKIIGMLLEIGYLELLHMLESPEPRCTKVDKGIAVPQDHQAKEAAQKAVNGATGVPI I >gi568815594f:102769085_102987427|GENSCAN_predicted_CDS_1|1986_bp nnaaagttaactataaaacagtctcaggcaggtccttcaggaggtattccagaagaaggc attgttatcacaggagatgacagctccaggtgtgttattgcacttgaagaccttacagtg gaacaagatgtggagtctccaacgttcagttgcttctctgctcgctgctcagccacttgt gttgctctgccagctgaagtcttctatgggcacaggataggagcatggcaggccaaaaag cgccatagtgtgtgcttgtcgtctggctcctcactctctcggtgtatgctcaaaggtccg gccaaaactcttgattatcccggcggcggggcaggattgtctcgtctcacaccagctctg ccagacacaggcgccttttgcaaaaacggagcagatcagcactcgcacaggccccgaaga gcccctgatgaatccaggtcccttaagacagccgcgatccggggtgggtggggtggcgtg gcaaagccaccaacagcctctgtaccgtgctttcggcccgaagtgggggcgagggtgacg ggaagagcggaggtggtggctacaagtttaacttcccggcctctagaaaggaagagaccc gggtgggagaggaagaggcgtaggagaaaggggtgggtgcggggcagggtccggagccca gcagaggaggagccggggcctagtcggcgctataccggcgtcactagggcaaagaaaggc aaggggggagggggaagcataagactccggacggacgccgggctgctttcggtttctgtc caaagctggtgcctccccggccctacggggctcacgcgcacgacacagccacaagatgtc cgctctgacggaactactgccagctgccacgctccgcccctccccctcctcctgcctctt caccgccgcggatcagtccgccaacgccgccgtcccgagcacaagaccgatatattccta gactattcatctgtgaagagtccctcagtattctggatagggaaaaacagaagtgagcct tttgggcggcaacacgcaaggctagggaagctgggcactcacttgactcttactttcctt cctgggagaactcatgggccaaaagggcctctcttggcactgaactctgctgccttggag gagaaggatgaatgggctgacagagcagaaaaagacagggtttcaaaactaaatcaaagg atgtttagtggcagtaaaaatgtaccagaatgtgagattgtggtcaccagtaaaattgct gaatttaagcatctacagacgttaaaggcccaagaggagggccaccttcacacgagattc cgaggagggcctcgctacccgccagttcccgcgcctcggcagtccgccgcgagcgccgag gccgccagtgcccgccggccgcttccgctcccggcgcaggcgcggcagcttggccagagc ggagggggctcgggagaggagtggacgccgctggccaggatggtgctggagagcgtggcc cgtatcgtgaaggtgcagctccctgcatatctgaagcggctcccagtccctgaaagcatt accgggttcgctaggctcacagaatggcttcggttattgcctttccttggtgtactcgca cttcttggctaccttgcagttcgtccattcctcccgaagaagaaacaacagaaggatagc ttgattaatcttaaaatacaaaaggaaaatccgaaagtagtgaatgaaataaacattgaa gatttgtgtcttactaaagcagcttattgtaggtgttggcgttctaaaacgccatgccct agtcgtgctggtaaaatcattggcatgttgttggagattggttatttagaacttcttcat atgcttgaatctccagagcctcgctgtactaaggttgacaaaggtatagctgtaccacaa gaccaccaagctaaagaggctgcccagaaagcagttaatggtgccactggtgttccaatt atttaa >gi568815594f:102769085_102987427|GENSCAN_predicted_peptide_2|129_aa MHSQKRFERLLEPIAMLIQDRSSLSAMEQSWMENDFDELTEVGFRRLVITNFSELKEDVG THRKEGKNLEKRLDEWLTRINSVEKTLSDLMELKTLARELCDTCTSFNSRFNQVEERVSV IEDQINEIK >gi568815594f:102769085_102987427|GENSCAN_predicted_CDS_2|390_bp atgcattcccagaaaaggtttgagaggctgctagaacctatagccatgctgattcaggat cgcagctccttgtcagcaatggaacaaagctggatggagaatgactttgacgagttgaca gaagtaggcttcagaaggttggtaataacaaacttctccgagctaaaggaggatgttgga acccatcgcaaggaaggtaaaaaccttgaaaaaagattagatgaatggcttacaagaata aacagtgtagagaagaccttaagtgacctgatggagctgaaaaccttggcacgagaactt tgtgacacatgcacaagcttcaatagccgattcaatcaagtggaagaaagggtatcagtg attgaagatcaaataaatgaaataaagtga >gi568815594f:102769085_102987427|GENSCAN_predicted_peptide_3|302_aa MSELPFTIASKRIKYLGIQLTREMKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTFKFIWNQKRTCIAKTVLSQKNKAGGITLP DFKLHYKATVTKTAWYWCQSRDIDQWNRIEPSEIIRHIYNHLIFDKPDINEKWGKDSLFN KWCWENGLAIRRKLKLDPFLTLYIKINSRWIKDLHVRPKTIKTLEEYLGNTIQAIGMGKE LMTKTPIAMATKAKIDKWDLIKLESFCTAKETTIRVNRQPTEWEKIFTIYPSDKGLISRI YK >gi568815594f:102769085_102987427|GENSCAN_predicted_CDS_3|909_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggaaatgaaggacctcttcaaggagaactacaaaccacttctcaacgaaataaaa gaggacacaaacaaatggaagaacattccatgctcatggataggaagaatcaatattgtg aaaatggccatattgccaaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttcaagttcatatggaaccaaaaaaga acctgcattgccaagacagtcttaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactacactacaaggctacagtaaccaaaacagcatggtactggtgccaaagc agagatatagaccaatggaacagaatagagccctcagaaataatacgacacatctacaac catctgatctttgacaaacctgacataaacgagaaatggggaaaggattccctatttaat aaatggtgctgggaaaacgggctagcaatacgtagaaaactgaaactggatcccttcctt acactttatataaaaattaattcaagatggattaaagacttacatgttagacctaaaacc ataaaaaccctagaagaatacctaggcaataccattcaggccataggcatgggcaaagaa ttaatgactaaaacaccaatagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactagagagcttctgcacagcaaaagaaactaccatcagagtgaacagacaacct acagaatgggagaaaatttttacaatctacccatctgacaaagggctaatatccagaatc tacaaataa >gi568815594f:102769085_102987427|GENSCAN_predicted_peptide_4|310_aa MNIDAKILNKILANRIQQHIKKLIHHDQVGFISGMQGWFNICNSINIIHYINRTKDKNHM IISIDAEKAFDKIQHSFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFLLKT GTRQGCPLSPLLFNMVLEVLTSAIRQEKEINGIQLGKEEVKLSLFADDMIVYLENPIISA QNLLKLISNFSKVSGYKINVQKSQAFLYTINRQTESQIMSELPFTIATKRIKYLGIQLTR VVKDLFKENYKPLLNEIKEDTNKLKNIPCSWIGRINIMEMAILPEVIYRFNTIPIKLLMT FFTELEKKLL >gi568815594f:102769085_102987427|GENSCAN_predicted_CDS_4|933_bp atgaacatcgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccacgatcaagtcggcttcatctctgggatgcaaggctggttcaac atatgtaactcaataaacataatccattacataaacagaaccaaagacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacactccttcatgctaaaa actctcaataaactaggtattgatgggacgtatctcaaaataataagagctatttatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccttttgaaaacc ggcacaagacaaggatgccctctctcaccactcctattcaacatggtgttggaagttctg accagtgcaatcaggcaggagaaagaaataaatggtattcagttaggaaaagaggaagtc aaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccatcatctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcctatacaccattaatagacaaacagagagccaaatcatgagt gaactcccattcacaattgctacaaagagaataaaatacctaggaatccaacttacaagg gttgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaagaggac acaaacaaattgaagaatattccatgctcatggataggaagaatcaatatcatggaaatg gccatactgcccgaagtaatttatagattcaatactatccccatcaagctactaatgact ttcttcacagaattggaaaaaaaactgctttaa >gi568815594f:102769085_102987427|GENSCAN_predicted_peptide_5|220_aa MGIVAYELGLLTADTVGLCFFIQLAILAGTIPTDLLSGPSVLSTEVFPVHWQQHSDGVCQ PECFIVVIVARSLLTHMCQQQQHTAGAVSPAVVVPYMMVLQENGYGVEEGIPTLLMAASS MDDILAITGFNTCLSIVFSSENRETRSNGACSQGACILKKNYKWDEYYIVKINIKGWNSW EDSEEDQKMWESLELPRDLLNGFDRTADSDLDSEVQAEEV >gi568815594f:102769085_102987427|GENSCAN_predicted_CDS_5|663_bp atgggtatcgttgcatatgagctgggtctcttgacagcagatacagttgggctttgcttc tttatccaacttgccattctggctggtaccatacccacagatttgttgtctggcccttca gtgttaagcactgaggtgttcccagtccactggcaacaacactctgatggggtgtgccag ccagagtgcttcattgtagtgattgtagcaaggtccctactcacacatatgtgccagcag cagcagcacacagcaggtgctgtctctcctgctgttgttgtcccttacatgatggtgctg caagaaaatggatatggtgttgaggaaggcattccaaccttattaatggctgctagcagt atggatgacattctggctatcactggattcaatacatgcttgagcatagtcttttcctca gaaaacagagagacaagatcaaatggtgcatgttctcaaggagcttgtatattaaagaaa aattacaaatgggatgaatattacattgtgaagattaatattaaagggtggaacagttgg gaggactcagaagaagaccagaagatgtgggaaagtttggaactgcctagagacttgttg aatggctttgaccgaactgctgatagtgacttggacagtgaagtccaggctgaggaggtc tga >gi568815594f:102769085_102987427|GENSCAN_predicted_peptide_6|110_aa MKTPKTIATKTKIDKWDLIKLKGFCTAKEIINKPPMEWEKIFANHAFDKSLISRIQKELR KINKQKPNNPIKWAKERTRHFSKEDKHVVNKYMEKYSTSLTLEKCKTKLQ >gi568815594f:102769085_102987427|GENSCAN_predicted_CDS_6|333_bp atgaagacgccaaaaacaattgcaacaaaaacaaaaattgacaaatgggatctaattaaa ctaaagggcttctgcacagcaaaggaaatcatcaacaaaccacctatggaatgggagaaa atatttgcaaaccatgcattcgacaaaagtctaatatccagaatccaaaaagaacttaga aaaatcaacaagcaaaaacctaacaaccccattaaatgggcaaaggagaggaccagacac ttctcaaaagaagacaaacacgtagtcaacaagtatatggaaaaatattcaacatcacta acattagagaaatgcaaaacaaaactgcagtga >gi568815594f:102769085_102987427|GENSCAN_predicted_peptide_7|582_aa MNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHM IISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEAFPLKT STRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSA QNLLKLISNFSKVSGYEINVQKSQAFLYTNNRQTESQIMGELPFTIASKRIKYLGIQLTR DVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMT FFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRD IDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTP YTKINSRWIKDLNVKPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKDKIDKWDLIK LKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKD MNRHFSKEDIYAAKKTHEEMLIITGHQRNANQNHYEISSHTS >gi568815594f:102769085_102987427|GENSCAN_predicted_CDS_7|1749_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaat atacgcaaatcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatg attatctcaatagatgcagaaaaagcctttgacaaaattcaacaacccttcatgctaaaa actctcaataaattaggtattgatgggacgtatttcaaaataataagagctatctatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaacc agcacaagacagggatgccctctctcaccgctcctattcaacatagtgttggaagttctg gccagggcaatcaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcagacgacatgattgtttatctagaaaaccccatcgtctcagcc caaaatctccttaagctgataagcaacttcagcaaagtctcaggatacgaaatcaatgta caaaaatcacaagcattcttatacaccaacaacagacaaacagagagccaaatcatgggt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagaggac acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatg gccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgc attgccaagtcaatcctaagccaaaagaataaagctggaggcatcacactacctgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagat atagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaactatctg atctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacacct tatacaaaaatcaattcaagatggattaaagatttaaacgttaaacctaaaaccataaaa accctagaagaaaacctaggcattaccattcaggacataggcgtgggcaaggacttcatg tccaaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatctaattaaa ctaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacaaca tgggagaaaattttcgcaacctactcatctgacaaaggactaatatccagaatctacaat gaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggac atgaacagacacttctcaaaagaagacatttatgcagccaaaaaaacacatgaagaaatg ctcatcatcactggccatcagagaaatgcaaatcaaaaccactatgagatatcatctcac accagttag >gi568815594f:102769085_102987427|GENSCAN_predicted_peptide_8|193_aa MFELFNVPGFYIAVQEVLALAVSWTSQQVGECMLMSIVIDKGDGVTHVLPVVEGYVIGSC INHILIVGDTVYFTQQLLREREVGIPLEQSLETTKAIKEKYCYICPDTVKEFAKHDVDSW KWIKQYTGINVINQEKFIIDVGYKRFLQPEIFFYPEFANPDFMESILNVVDEYKTVPLMC MVHCIRMLFFQGV >gi568815594f:102769085_102987427|GENSCAN_predicted_CDS_8|582_bp atgtttgaattatttaatgtaccaggattctacattgcagttcaggaggtactggccttg gcagtatcttggacatctcaacaagtgggtgaatgtatgttaatgagtatagtcattgac aaaggagatggagtcacccatgttctcccagttgtagaaggttatgtaattgggagctgc atcaatcacatcctgattgtaggtgatactgtgtatttcactcaacagctgctaagggag agggaggtaggaatccctcttgagcagtcactggagaccacaaaagccattaaggagaaa tactgttacatttgccctgatacagtcaaggaatttgctaagcatgatgtggattcctgg aagtggatcaaacaatacacaggtatcaatgtgatcaaccaggagaagttcataatagat gttggttacaaaaggttcctgcaacctgaaatatttttttacccagagtttgccaaccca gactttatggaatccatcttgaatgttgttgatgaatacaaaactgtcccattgatgtgc atggtccactgtataagaatgttgttctttcaaggggtttga >gi568815594f:102769085_102987427|GENSCAN_predicted_peptide_9|100_aa MGGNKSRQAENSKNQSASSPPKDRSSSPAMEQSWMENDFDELTEVGFRRSVITNFSEPKE DVRTHCKEAKNLEQRLDEWLTRINNIEKTFNDLMELKTMA >gi568815594f:102769085_102987427|GENSCAN_predicted_CDS_9|303_bp atggggggaaacaagagcagacaagctgaaaattcaaaaaaccagagtgcctcttctcct ccaaaggatcggagctcctcgccagcaatggaacaaagctggatggagaatgactttgac gagttgacagaagtaggctttagaaggtcagtaataacaaacttctccgagccaaaggag gatgttcgaacccattgcaaggaagctaaaaaccttgaacaacgattagatgaatggcta actagaataaacaacatagagaagacctttaatgacctgatggagctgaaaaccatggca tga