GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:05:01 Sequence gi568815592f:36030583_36239370 : 208788 bp : 43.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 22117 22246 130 1 1 106 101 61 0.936 9.90 1.02 Intr + 28707 28765 59 2 2 116 56 41 0.195 1.38 1.03 Intr + 42291 42402 112 0 1 88 87 76 0.971 7.88 1.04 Intr + 42464 42523 60 1 0 122 103 -19 0.788 1.93 1.05 Intr + 43467 43514 48 2 0 100 93 38 0.954 4.38 1.06 Intr + 45266 45380 115 1 1 101 67 157 0.998 14.92 1.07 Term + 53330 53538 209 0 2 54 42 72 0.246 -3.30 1.08 PlyA + 54509 54514 6 1.05 2.03 PlyA - 59567 59562 6 1.05 2.02 Term - 61861 61407 455 2 2 62 48 322 0.983 20.92 2.01 Init - 62232 62205 28 0 1 82 40 32 0.676 -3.39 2.00 Prom - 63589 63550 40 -7.46 3.00 Prom + 63854 63893 40 -3.96 3.01 Init + 64736 64843 108 1 0 57 49 117 0.443 2.95 3.02 Intr + 71989 72067 79 0 1 69 106 37 0.494 2.72 3.03 Intr + 76873 77046 174 2 0 40 106 222 0.839 19.11 3.04 Term + 77798 77865 68 0 2 89 53 62 0.912 0.90 3.05 PlyA + 79264 79269 6 1.05 4.00 Prom + 84902 84941 40 -3.66 4.01 Init + 88709 88787 79 1 1 90 61 58 0.568 4.52 4.02 Term + 89181 89290 110 1 2 31 48 137 0.588 2.67 4.03 PlyA + 89371 89376 6 1.05 5.00 Prom + 97887 97926 40 -6.46 5.01 Init + 100001 100119 119 1 2 71 69 275 0.949 23.57 5.02 Intr + 100689 100818 130 0 1 65 61 347 0.836 30.40 5.03 Intr + 102039 102097 59 2 2 86 70 92 0.977 4.88 5.04 Intr + 105171 105279 109 0 1 66 98 197 0.979 18.79 5.05 Intr + 105437 105466 30 1 0 96 96 4 0.556 0.33 5.06 Intr + 105902 105949 48 1 0 114 113 71 0.997 11.08 5.07 Intr + 106074 106272 199 2 1 73 61 293 0.865 24.02 5.08 Intr + 107783 107862 80 0 2 -31 64 153 0.890 0.27 5.09 Intr + 108120 108198 79 2 1 123 96 40 0.973 7.42 5.10 Intr + 108297 108473 177 1 0 92 57 335 0.983 30.69 5.11 Term + 108712 108791 80 2 2 71 50 132 0.974 5.63 5.12 PlyA + 109448 109453 6 1.05 6.00 Prom + 117713 117752 40 -3.26 6.01 Init + 166392 166561 170 2 2 67 88 116 0.576 6.51 6.02 Intr + 168630 168685 56 0 2 68 56 50 0.452 -1.58 6.03 Intr + 169715 171188 1474 0 1 107 115 1575 0.652 150.89 6.04 Intr + 174076 174232 157 1 1 61 96 24 0.928 0.51 6.05 Intr + 176731 176862 132 0 0 76 110 245 0.999 26.34 6.06 Intr + 179205 179384 180 2 0 136 1 181 0.751 14.26 6.07 Intr + 179634 179946 313 2 1 36 93 383 0.754 29.36 6.08 Intr + 180676 180978 303 2 0 104 78 546 0.998 51.76 6.09 Intr + 183298 183804 507 2 0 84 60 302 0.598 19.95 6.10 Intr + 187335 187428 94 1 1 86 110 67 0.657 7.72 6.11 Intr + 191586 191683 98 0 2 98 94 159 0.999 17.15 6.12 Intr + 194685 194782 98 1 2 108 97 159 0.999 18.53 6.13 Intr + 198320 198474 155 1 2 117 53 197 0.993 17.97 6.14 Intr + 199844 200023 180 2 0 87 67 250 0.597 21.78 6.15 Intr + 200074 200134 61 1 1 105 66 25 0.533 0.74 6.16 Term + 202751 202876 126 1 0 82 37 70 0.132 -0.42 6.17 PlyA + 203737 203742 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:36030583_36239370|GENSCAN_predicted_peptide_1|244_aa XAAFDTKTGLRVAVKKLSRPFQSIIHAKRTYRELRLLKHMKHENVIGLLDVFTPARSLEE FNDVYLVTHLMGADLNNIVKCQKLTDDHVQFLIYQILRGLKGLNKQVNNFSNGKFKHIKS QDLKPSNLAVNEDCELKILDFGLARHTDDEMTGYVATRWYRAPEIMLNWMHYNQTGVARH LMQEHSHWHQVGAPLGRSSQRKEQAAIFAVLQSPLVTPPGTGGTQAKSLEWTPSKLQQPY GRGA >gi568815592f:36030583_36239370|GENSCAN_predicted_CDS_1|735_bp nntgctgcttttgacacaaaaacggggttacgtgtggcagtgaagaagctctccagacca tttcagtccatcattcatgcgaaaagaacctacagagaactgcggttacttaaacatatg aaacatgaaaatgtgattggtctgttggacgtttttacacctgcaaggtctctggaggaa ttcaatgatgtgtatctggtgacccatctcatgggggcagatctgaacaacattgtgaaa tgtcagaagcttacagatgaccatgttcagttccttatctaccaaattctccgaggtcta aagggcttaaacaaacaagtaaacaacttttctaatggaaaattcaaacacataaaatca caggacctaaaacctagtaatctagctgtgaatgaagactgtgagctgaagattctggat tttggactggctcggcacacagatgatgaaatgacaggctacgtggccactaggtggtac agggctcctgagatcatgctgaactggatgcattacaaccagacaggggtcgccagacac cttatgcaagagcattcccactggcatcaggttggtgcccctctgggacggagctcccag aggaaggaacaggcagccatctttgctgttctgcagtctccactggtgacacctccaggt acaggagggacccaggcaaagagtctggagtggacccccagcaaactgcagcagccctat ggaagaggggcctga >gi568815592f:36030583_36239370|GENSCAN_predicted_peptide_2|160_aa MAHPFITSGENASLRVYRSGETVLTTNGWLLHRVPCRELIEDGETPGAAALWELEEETGY KGDAAECSPVVYMDPGLSNCTTHIVTVIINGDDAENVRPKPKPGDGEFVEVISLPKNDLL QGLDALVAEKHLTVDAKVYSHALALKHANVKPSEVPVLKF >gi568815592f:36030583_36239370|GENSCAN_predicted_CDS_2|483_bp atggcacatcctttcatcacctcgggagagaacgcttcactacgagtgtatcgttctggt gaaacagtcctgaccaccaatgggtggctactgcatagagttccctgcagggaactcata gaagatggtgaaaccccaggagcagctgctctttgggagcttgaggaagaaactggctac aaaggggacgctgctgaatgttctccagtggtctatatggacccaggtttgtcaaactgt accacacacatcgtgacagtaatcattaacggagatgacgcagaaaatgtaaggcccaag ccaaagccaggggatggagaatttgtggaagtcatttcgttacccaagaatgacctgctg cagggactcgatgctctcgtagctgaaaaacatcttacagtggatgccaaagtctattcc catgctctagcactgaaacatgcaaatgtgaagccatctgaagtgcccgtcctgaaattt taa >gi568815592f:36030583_36239370|GENSCAN_predicted_peptide_3|142_aa MALLKEHAMGERPLLATLPHSACYSQCERRGFPFQRARNYIQSLTQMPKMNFANVFIGAN PLAVDLLEKMLVLDSDKRITAAQALAHAYFAQYHDPDDEPVADPYDQSFESRDLLIDEWK SLTYDEVISFVPPPLDQEEMES >gi568815592f:36030583_36239370|GENSCAN_predicted_CDS_3|429_bp atggcgctcctaaaggaacatgcgatgggagagcgccctctgctggcgactctccctcat agtgcctgctatagccagtgtgagaggcgaggcttccctttccagagggcaagaaactat attcagtctttgactcagatgccgaagatgaactttgcgaatgtatttattggtgccaat cccctggctgtcgacttgctggagaagatgcttgtattggactcagataagagaattaca gcggcccaagcccttgcacatgcctactttgctcagtaccacgatcctgatgatgaacca gtggccgatccttatgatcagtcctttgaaagcagggacctccttatagatgagtggaaa agcctgacctatgatgaagtcatcagctttgtgccaccaccccttgaccaagaagagatg gagtcctga >gi568815592f:36030583_36239370|GENSCAN_predicted_peptide_4|62_aa MQEKANEFARAKTVELSYIGVHLAGFRRPPQRGITQQKHVSSAEAEKQRRLNILADLTNN LK >gi568815592f:36030583_36239370|GENSCAN_predicted_CDS_4|189_bp atgcaggaaaaggcaaatgaatttgcccgtgcaaaaactgtagagctttcatacattggc gtccatttggcaggattcagacggcccccacaacgaggaattacccagcagaaacatgtc agcagtgccgaggctgagaaacagagacgactcaacatcttggcagatctcaccaacaac cttaagtga >gi568815592f:36030583_36239370|GENSCAN_predicted_peptide_5|369_aa MSLIRKKGFYKQDVNKTAWELPKTYVSPTHVGSGAYGSVCSAIDKRSGEKVAIKKLSRPF QSEIFAKRAYRELLLLKHMQHENVIGLLDVFTPASSLRNFYDFYLVMPFMQTDLQKIMGM EFSEEKIQYLVYQMLKGLKYIHSAGVVHRDLKPGNLAVNEDCELKILDFGLARHADAEMT GYVVTRWYRAPEVILSWMHYNQTGQWSMPERAVLGPSGHSVLTDCWAPDSPDLDQLTQIL KVTGVPGTEFVQKLNDKAAKSYIQSLPQTPRKDFTQLFPRASPQAADLLEKMLELDVDKR LTAAQALTHPFFEPFRDPEEETEAQQPFDDSLEHEKLTVDEWKQHIYKEIVNFSPIARKD SRRRSGMKL >gi568815592f:36030583_36239370|GENSCAN_predicted_CDS_5|1110_bp atgagcctcatccggaaaaagggcttctacaagcaggacgtcaacaagacagcctgggag ctgcccaagacctacgtgtccccgacgcacgtcggcagcggggcctatggctccgtgtgc tcggccatcgacaagcggtcaggggagaaggtggccatcaagaagctgagccgacccttt cagtccgagatcttcgccaagcgcgcctaccgggagctgctgctgctgaagcacatgcag catgagaacgtcattgggctcctggatgtcttcaccccagcctcctccctgcgcaacttc tatgacttctacctggtgatgcccttcatgcagacggatctgcagaagatcatggggatg gagttcagtgaggagaagatccagtacctggtgtatcagatgctcaaaggccttaagtac atccactctgctggggtcgtgcacagggacctgaagccaggcaacctggctgtgaatgag gactgtgaactgaagattctggattttgggctggcgcgacatgcagacgccgagatgact ggctacgtggtgacccgctggtaccgagcccccgaggtgatcctcagctggatgcactac aaccagacaggtcagtggtcaatgcctgagagggcggtcctggggccatctggtcactcg gtgctgactgactgctgggccccagacagtccagacctggaccagctgacccagatcctg aaagtgaccggggtgcctggcacggagtttgtgcagaagctgaacgacaaagcggccaaa tcctacatccagtccctgccacagacccccaggaaggatttcactcagctgttcccacgg gccagcccccaggctgcggacctgctggagaagatgctggagctagacgtggacaagcgc ctgacggccgcgcaggccctcacccatcccttctttgaacccttccgggaccctgaggaa gagacggaggcccagcagccgtttgatgattccttagaacacgagaaactcacagtggat gaatggaagcagcacatctacaaggagattgtgaacttcagccccattgcccggaaggac tcacggcgccggagtggcatgaagctgtag >gi568815592f:36030583_36239370|GENSCAN_predicted_peptide_6|1367_aa MGAAAGPDRAGAGASGPSGTRDCTGGNRRSGGLSEQGGGGAKALSWSEGGGREGEGGSAA ALDSHRSANPTVNCACLSPQFPGAMRKPRRKSRQNAEGRRSPSPYSLKCSPTRETLTYAQ AQRIVEVDIDGRLHRISIYDPLKIITEDELTAQDITECNSNKENSEQPQFPGKSKKPSSK GKKKESCSKHASGTSFHLPQPSFRMVDSGIQPEAPPLPAAYYRYIEKPPEDLDAEVEYDM DEEDLAWLDMVNEKRRVDGHSLVSADTFELLVDRLEKESYLESRSSGAQQSLIDEDAFCC VCLDDECHNSNVILFCDICNLAVHQECYGVPYIPEGQWLCRCCLQSPSRPVDCILCPNKG GAFKQTSDGHWAHVVCAIWIPEVCFANTVFLEPIEGIDNIPPARWKLTCYICKQKGLGAA IQCHKVNCYTAFHVTCAQRAGLFMKIEPMRETSLNGTIFTVRKTAYCEAHSPPGAATARR KGDSPRSISETGDEEGLKEGDGEEEEEEEVEEEEQEAQGGVSGSLKGVPKKSKMSLKQKI KKEPEEAGQDTPSTLPMLAVPQIPSYRLNKICSGLSFQRKNQFMQRLHNYWLLKRQARNG VPLIRRLHSHLQSQRNAEQREQDEKTSAVKEELKYWQKLRHDLERARLLIELIRKREKLK REQVKVQQAAMELELMPFNVLLRTTLDLLQEKDPAHIFAEPVNLSEASSPYFPSSKFFLN WNRVPDYLEFISKPMDFSTMRRKLESHLYRTLEEFEEDFNLIVTNCMKYNAKDTIFHRAA VRLRDLGGAILRHARRQAENIGYDPERGTHLPESPKLEDFYRFSWEDVDNILIPENRAHL SPEVQLKELLEKLDLVSAMRSSGARTRRVRLLRREINALRQKLAQPPPPQPPSLNKTVSN GELPAGPQGDAAVLEQALQEEPEDDGDRDDSKLPPPPTLEPTGPAPSLSEQESPPEPPTL KPINDSKPPSRFLKPRKVEEDELLEKSPLQLGNEPLQRLLSDNGINRLSLMAPDTPAGTP LSGVGRRTSVLFKKAKNGVKLQRSPDRVLENGEDHGVAGSPASPASIEEERHSRKRPRSR SCSESEGERSPQQEEETGMTNGFGKHTESGSDSECSLGLSGGLAFEACSGLTPPKRSRGK PALSRVPFLEGVNGDSDYNGSGRSLLLPFEDRGDLEPLELVWAKCRGYPSYPALIIDPKM PREGLLHNGVPIPVPPLDVLKLGEQKQAEAGEKLFLVLFFDNKRTWQWLPRDKVLPLGVE DTVDKLKMLEGRKTSIRKSVQVAYDRAMIHLSRVRGPHSFVTSSYLAQRSLFCPCQMYGR QLPPLMEPLLLIQKESDPLDTPDLPSLLNLRLKDAHTSYVTAAVLIL >gi568815592f:36030583_36239370|GENSCAN_predicted_CDS_6|4104_bp atgggggcggcagcggggccggacagagcgggggcgggggcatcggggccttcggggacc agggactgcacggggggaaaccggagaagcggggggctttccgagcagggcgggggcggt gcgaaagctctctcctggagcgagggcggcggcagggaaggagaaggcggatcagcggct gctttggattctcataggagcgcgaaccctactgtgaactgcgcatgcctgtcccctcag ttcccaggtgccatgaggaagcctcgtcggaagtcccggcagaatgccgagggccggcgt tccccgtccccctacagtctcaagtgctcacccacccgggagaccctgacatatgcccag gcccagcggattgtcgaggtagacattgatggacgcctgcatcgtatcagcatctatgac ccactcaaaatcattactgaagatgagctaactgcccaggatatcaccgaatgcaatagt aacaaggaaaacagtgaacagcctcagttccctggcaagtccaagaaaccctcatccaag ggcaaaaagaaggaatcctgctccaagcatgcatctggtacttccttccacctcccacag cccagcttccgtatggtggactcaggcatccagccagaagcacccccgctgcctgctgcc tactaccgctacattgagaagccacctgaagacctggatgcagaggtagagtatgacatg gatgaggaggaccttgcctggctggacatggtgaatgaaaaacggcgagtagatgggcac agtttggtgtctgcagatacctttgagctgctggtagaccggcttgagaaagagtcatac ttggagagtcgcagcagtggggcccaacagtcactcatcgatgaagacgctttctgctgt gtgtgcctggatgatgaatgtcacaatagcaatgttattctcttctgtgacatctgcaac ctggctgtacaccaggagtgctatggcgtcccatacatccctgagggccagtggctatgc cgctgctgcctgcagtctccctcccggcctgtggattgcatcctttgccccaataagggt ggcgccttcaaacagaccagtgatgggcactgggcccatgtggtgtgtgccatctggatc cctgaagtctgctttgctaacaccgtgttcttggaacctattgagggcattgacaatatc ccgcctgcccgctggaaactaacctgctatatctgcaagcagaaagggctaggtgcagcc atccagtgccataaggtgaactgctacacagcattccatgtgacatgtgcacagcgggct gggctcttcatgaagattgagcccatgcgcgaaaccagcctcaatggcaccatctttaca gtgcgcaagactgcctactgtgaggcccactcgccaccaggtgcggccactgctaggagg aagggcgactcccctagaagcatcagtgagactggcgatgaggaagggctgaaggagggt gatggagaggaggaagaagaggaagaggtggaggaagaagagcaggaagctcaaggcggg gtgagtggctccctcaagggagtgcccaagaaaagcaagatgagtttgaagcagaagatc aagaaggagccagaggaagcaggccaagacacaccctccactctccccatgcttgctgtc ccacagataccctcttacaggttgaacaagatctgtagtggtctctcctttcagaggaaa aaccagtttatgcagcggcttcacaattattggctgttgaagcggcaggcacggaatggt gtccctcttatccggcgcttgcactcccatctgcagtcccaaagaaacgctgagcagcga gagcaggatgagaagacaagtgcagtgaaggaggagctgaagtattggcagaagctccgg catgacttggagcgggcgcggctgctgattgagctgattcggaagagagagaagctcaaa cgagagcaggtcaaagtccagcaggctgccatggagctggagctgatgccattcaatgtt ctgttgaggacaacactggacctgctgcaggagaaggatcctgcacacatcttcgcagaa ccagtcaacttgagtgaggcaagttccccctactttccaagtagtaaattcttcttgaac tggaacagggttccagattacctggaattcatatccaagccaatggatttttctactatg aggcggaagctggagtcccacctgtaccgcaccttggaggagtttgaggaggactttaac cttatagttaccaactgcatgaagtataatgctaaagacacaattttccaccgagcagct gtccgcctgcgggacctgggaggggccatcctacggcacgcccggcggcaggcagagaac atcggctatgaccccgagaggggcactcacctgcccgagtcacccaaattggaagacttt taccgcttctcctgggaagacgtggacaacatcctcatcccagagaaccgggcccatttg tccccagaggtgcagctgaaggagctgctggagaaactggacctggtgagcgccatgcgg tccagtggggcccgcacccgtcgtgtccgcctgctacgccgggagatcaatgcccttcgg cagaagctggcacagccaccaccaccacagccaccatcactcaacaagacagtatccaat ggggagctgccagcagggccccagggggatgcagctgtgctggagcaggccttgcaggag gagccagaagacgatggggacagagatgactccaaactgcctcctccgccaaccctggag cccactgggcctgcaccttccttgtctgagcaagaatcccccccggagccccctactctg aaacccattaatgatagcaaacctccaagcaggttcctaaagcccagaaaggtggaagaa gatgagctcttggaaaaatcaccactgcagctagggaatgagcctttgcaacgcttgctc agtgacaatggcatcaacagactatccctcatggcccctgacaccccggccggtacccca cttagtggtgtgggtcgccgcacatcagtcctcttcaagaaggccaagaatggggttaag ctacagagaagcccagacagggtcctggagaatggcgaggaccatggtgtggcaggctct cctgcctctccagccagcatcgaggaagagcgccactcccggaagcggccaaggagcagg agctgtagtgagagcgaaggggagaggtccccccagcaggaggaagagacaggcatgacc aacggctttggaaaacacaccgaaagcgggtctgactctgaatgtagtttgggtctcagt ggtggactggcatttgaagcttgcagtggtctgacgccccccaaacgcagccgtgggaag ccagccctgtctcgagtgcccttcctggaaggtgtgaacggagactctgactacaatggc tcaggcagaagcctcctgctgccctttgaagaccgcggagacctggagcccttggagctg gtgtgggccaagtgccgaggctacccctcctaccctgccttgatcatcgatcccaagatg ccccgggagggcctcctgcacaatggcgttcccatccctgtccccccgctggacgtgctg aagctgggagagcagaaacaggcagaggctggagagaagctcttccttgtcctcttcttt gacaacaagcgcacctggcagtggcttccaagggacaaagtcctgcccttgggtgtggaa gacaccgtggacaagctcaagatgctggaaggccgcaagaccagcatccgcaagtcagtg caggtggcctatgaccgtgcgatgatccacctgagcagagtccgggggccccactccttc gtcacttccagctacctggcacagagaagcctcttctgcccctgccagatgtatggccgg cagcttccccctctcatggagcctctgctcctcattcagaaggagagtgaccctttggac acacctgacttaccaagcttgctgaacttgagactcaaggatgctcacacctcttatgtc actgctgctgttttaattctgtag