GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:03:44 Sequence gi568815592f:34426306_34632527 : 206222 bp : 50.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7352 7421 70 1 1 64 70 101 0.943 7.11 1.02 Intr + 9716 9827 112 0 1 74 31 35 0.275 -4.16 1.03 Intr + 12299 12370 72 2 0 107 67 72 0.354 5.52 1.04 Intr + 12816 12911 96 0 0 69 69 118 0.295 7.12 1.05 Intr + 22155 22193 39 0 0 125 103 31 0.773 5.64 1.06 Term + 22911 23025 115 0 1 53 42 99 0.719 -0.16 1.07 PlyA + 23684 23689 6 1.05 2.00 Prom + 24343 24382 40 -6.56 2.01 Init + 29441 29447 7 1 1 114 100 0 0.372 4.99 2.02 Intr + 39819 39965 147 1 0 133 91 80 0.849 13.01 2.03 Intr + 46410 46452 43 1 1 138 116 -5 0.498 4.60 2.04 Intr + 59964 60099 136 0 1 78 111 40 0.072 5.87 2.05 Intr + 61893 62016 124 2 1 91 37 25 0.071 -2.14 2.06 Term + 63439 63506 68 2 2 104 44 77 0.250 3.00 2.07 PlyA + 66622 66627 6 1.05 3.04 PlyA - 67850 67845 6 1.05 3.03 Term - 69826 69656 171 1 0 87 44 63 0.171 -0.37 3.02 Intr - 71305 71222 84 1 0 47 105 27 0.102 0.32 3.01 Init - 88920 88831 90 0 0 62 62 152 0.347 8.49 3.00 Prom - 91083 91044 40 -7.96 4.05 PlyA - 92505 92500 6 1.05 4.04 Term - 94400 94288 113 0 2 108 50 107 0.914 7.62 4.03 Intr - 94909 94757 153 2 0 90 92 94 0.817 10.04 4.02 Intr - 95546 95444 103 2 1 109 43 17 0.573 -0.95 4.01 Init - 96645 96580 66 0 0 56 113 6 0.337 0.98 4.00 Prom - 97209 97170 40 -7.16 5.00 Prom + 98066 98105 40 -11.14 5.01 Init + 100001 100063 63 1 0 72 78 106 0.955 9.15 5.02 Intr + 101027 101183 157 1 1 35 42 446 0.855 34.28 5.03 Intr + 102337 102572 236 2 2 90 32 428 0.684 34.81 5.04 Intr + 103092 103247 156 2 0 108 48 236 0.919 21.81 5.05 Intr + 103361 103584 224 1 2 90 46 414 0.679 34.33 5.06 Intr + 103938 104058 121 0 1 38 78 160 0.999 10.50 5.07 Intr + 104155 104282 128 0 2 79 109 113 0.999 11.98 5.08 Intr + 105295 105482 188 1 2 69 74 428 0.992 38.83 5.09 Term + 106116 106225 110 1 2 122 44 213 0.999 18.97 5.10 PlyA + 109927 109932 6 1.05 6.08 PlyA - 111518 111513 6 1.05 6.07 Term - 112147 111969 179 2 2 71 48 457 0.998 37.85 6.06 Intr - 113091 112945 147 1 0 63 84 181 0.991 15.41 6.05 Intr - 113257 113210 48 2 0 112 77 17 0.758 1.65 6.04 Intr - 114876 114679 198 1 0 33 91 324 0.967 26.52 6.03 Intr - 117303 117223 81 1 0 96 58 40 0.721 1.41 6.02 Intr - 118351 117715 637 1 1 83 99 640 0.360 56.27 6.01 Init - 124735 124640 96 1 0 65 61 92 0.160 2.52 6.00 Prom - 126770 126731 40 -7.86 7.00 Prom + 127319 127358 40 -8.86 7.01 Init + 128630 128705 76 1 1 62 66 130 0.627 7.54 7.02 Intr + 128929 128963 35 2 2 128 69 24 0.595 2.44 7.03 Intr + 131056 131256 201 0 0 69 45 104 0.511 3.68 7.04 Intr + 132966 133036 71 2 2 73 76 58 0.324 1.08 7.05 Intr + 141132 141249 118 0 1 42 72 49 0.069 -0.83 7.06 Intr + 143799 144013 215 2 2 39 94 157 0.388 8.91 7.07 Intr + 146376 146533 158 0 2 64 89 115 0.852 8.85 7.08 Intr + 149960 150196 237 0 0 60 9 200 0.229 6.89 7.09 Term + 152641 152705 65 2 2 71 55 77 0.437 0.75 7.10 PlyA + 153195 153200 6 1.05 8.06 PlyA - 153406 153401 6 1.05 8.05 Term - 157865 157714 152 0 2 117 29 89 0.599 3.97 8.04 Intr - 161971 161860 112 1 1 113 89 -9 0.636 1.65 8.03 Intr - 167754 167647 108 0 0 63 81 86 0.697 5.88 8.02 Intr - 180599 180250 350 0 2 77 100 116 0.709 6.78 8.01 Init - 180901 180865 37 1 1 59 107 30 0.830 2.18 8.00 Prom - 187571 187532 40 -5.26 9.00 Prom + 188233 188272 40 -8.16 9.01 Init + 190233 190346 114 2 0 110 44 148 0.921 12.81 9.02 Intr + 204863 204937 75 1 0 72 91 24 0.002 0.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:34426306_34632527|GENSCAN_predicted_peptide_1|167_aa MIHIAVEVVYTGGHHVAVNVYAQGSIPLNEYTKIVYAFFLDGHLSCYQFLAITNKAARSI RSLDKALILSTNCQSEKSLNPPNERAVTLTTKIRGFIFEVSETKNPPEGANSGHTDLLSQ GWQGIRPYRWSTGKADAIWRFHFLSHGTGSLQIGEQALSLTPAVFCA >gi568815592f:34426306_34632527|GENSCAN_predicted_CDS_1|504_bp atgatccacattgctgtggaggtggtctatactggaggccatcacgttgctgtcaatgtc tatgctcaaggaagtattccactgaatgaatataccaaaatagtttatgcattcttcctc gatggacatttgagttgttatcagtttttggctatcaccaataaagctgctagaagcatc aggtctttagacaaagctttaattctttcaaccaactgccaatcagaaaaatctttgaat ccacctaatgaaagagctgtaacactcaccaccaagatccgtggcttcatttttgaagtc agcgagaccaagaacccaccagaaggagccaattctggacacactgacctgctctcccag ggctggcagggcatacggccctatcggtggagcactggcaaggctgatgccatatggcgg ttccacttcctcagccatggaactggatctttgcaaattggagagcaggcgctgagcctc actcctgctgtcttctgtgcctga >gi568815592f:34426306_34632527|GENSCAN_predicted_peptide_2|174_aa MPTPARSQSDAVAAAAPAPSSPGKSAGLEAGPQTSRFAPSRRRWSPELLPPGLWCLKPGG MGKGVRAQLSAEREGGGGVVKYLSPMWFSQPWGEVGAIIPALKMRKQEGGQTLTIALSHF PSIPGKGAPRDVPGNYLAAKAIILPCPLPYKQWGHDASLGPQTNVHVDPMSDSL >gi568815592f:34426306_34632527|GENSCAN_predicted_CDS_2|525_bp atgcccaccccggcgagatcccagagcgacgcggtggcggcggcagcgccagccccctcc tcccccgggaagtcggccgggcttgaggccgggccccagacgtcccgcttcgccccgagt cgccgccgatggtccccggagctcctgcccccaggcctgtggtgcctgaaacctggcggg atgggaaagggggtaagagcacagctgtcagctgagagggagggaggaggaggtgttgta aaatatttgtctccaatgtggttctcacagccatggggtgaagtaggtgccatcatcccc gctttgaagatgagaaaacaggaagggggccagaccctgaccattgccctttcccatttc ccatcaataccaggcaagggagcaccaagagatgtcccagggaattacctggctgccaag gctattattcttccctgccccttgccatacaaacagtggggccacgatgcctctttgggt ccccagaccaatgtccatgtggaccctatgtctgacagtctttga >gi568815592f:34426306_34632527|GENSCAN_predicted_peptide_3|114_aa MLAAPHTPLPSLVPGPAPPAGLFGSYQVPLNQRGVAAWGQGWGTHTLGGDSKKEEELKAA FSPMAQSPASAQLAASLLSSYNHTGEQQAAGFIPVVGMGKLRLSFTKDKGPVNS >gi568815592f:34426306_34632527|GENSCAN_predicted_CDS_3|345_bp atgctggctgcgccccacaccccgctccccagcctggtccctggcccggcccctccagct ggcctgtttggctcctatcaggtgccgctgaatcagagaggtgtggctgcctgggggcag ggctggggcacgcacacccttggaggggattccaagaaggaggaagagctcaaggctgcc tttagtcctatggcacaatcacctgcatctgcacaacttgctgcctctctgctctccagc tacaaccacacaggggagcagcaggcagctggcttcatccctgttgtagggatggggaaa ctgaggctcagcttcaccaaggataaggggcctgtaaacagttga >gi568815592f:34426306_34632527|GENSCAN_predicted_peptide_4|144_aa MQCGRAPPNRLRAQREGKGRGKASASASVKWAVARIKRSTRPRATGPRLTHGELSTALVT TGHGIHSTAFITLRVLLPTGMSLSPVSGVYLSGYLPYFQHPVQCRQHRVLQHAGGINNTT IFERLLCAIVSASSHALSPNLTAH >gi568815592f:34426306_34632527|GENSCAN_predicted_CDS_4|435_bp atgcaatgtgggcgggcaccacccaatcggctgagggcccagagagaaggaaaaggcaga ggaaaggcctcggcgtccgcatctgtaaaatgggctgttgcaaggattaaacggagcact agacctcgggccactggaccgcgcctgacacacggcgagctctccacagccctcgtcacc accggacacggcatccacagcacggcattcatcacgttgcgtgtcctccttcccacagga atgtcacttagtccagtcagcggtgtttatctgtctggttatctgccgtatttccagcat ccagtgcagtgccggcaacatagggtactgcagcacgcaggcggcatcaacaacacaacc atttttgagcgcctactgtgtgccattgtgagcgcctcctcacatgctttatctccaaac ctcacagcccactga >gi568815592f:34426306_34632527|GENSCAN_predicted_peptide_5|460_aa MSSSYDEASLAPEETTDSFWEVGNYKRTVKRIDDGHRLCNDLMNCVQERAKIEKAYGQQL TDWAKRWRQLIEKGPQYGSLERAWGAIMTEADKVSELHQEVKNNLLNEDLEKVKNWQKDA YHKQIMGGFKETKEAEDGFRKAQKPWAKKMKELEAAKKAYHLACKEEKLAMTREMNSKTE QSVTPEQQKKLQDKVDKCKQDVQKTQEKYEKVLEDVGKTTPQYMENMEQVFEQCQQFEEK RLVFLKEVLLDIKRHLNLAENSRYLGMAGTEGTGTASRCYIHVYRELEQAIRGADAQEDL RWFRSTSGPGMPMNWPQFEEWNPDLPHTTTKKEKQPKKAEGVALTNATGAVESTSQAGDR GSVSSYDRGQPYATEWSDDESGNPFGGSETNGGANPFEDDSKGVRVRALYDYDGQEQDEL SFKAGDELTKLGEEDEQGWCRGRLDSGQLGLYPANYVEAI >gi568815592f:34426306_34632527|GENSCAN_predicted_CDS_5|1383_bp atgtccagctcctacgatgaggcctcactggcgccagaggagaccaccgacagcttctgg gaggtggggaactacaagcggaccgtgaagcgcatcgatgacggccaccgtctatgcaac gacctgatgaactgcgtgcaggagcgcgccaagatcgagaaggcgtacgggcagcagctc accgactgggccaagcgttggcgccagctcatcgagaaaggcccacagtatggcagcctg gagcgggcctggggtgccataatgacagaggcagacaaggtgagcgagctgcaccaggag gtgaagaacaatctgctgaatgaggacctggagaaggtgaagaactggcagaaggacgcc tatcacaagcagatcatgggtggcttcaaggagacgaaggaggctgaagatggcttccgc aaggcccagaagccttgggccaagaagatgaaggagctggaggcagccaagaaggcctac catttggcttgcaaagaggaaaagctggccatgacacgggagatgaacagcaagacggag caatcggtcacacctgagcagcaaaagaagctgcaggacaaagtggacaagtgcaagcag gatgtgcagaagacacaggagaagtatgagaaagtgctggaagatgtgggcaagaccaca ccccagtacatggagaacatggagcaggtgtttgagcaatgccagcaatttgaggaaaag cggctggtcttcctcaaggaggtgctgctggacatcaaacggcacctcaacctggctgag aacagcaggtacctgggcatggcaggcaccgagggcacaggcacagccagcagatgctac atccatgtgtaccgtgagctggagcaggccatccggggggctgatgcccaggaagacctc agatggttccgcagcaccagtggccccggcatgcccatgaactggccccagtttgaggag tggaacccagaccttcctcacaccaccaccaagaaggagaaacagcctaagaaggcagag ggagtggcgctgaccaatgccactggggcggtagagtccacatcccaggctggggaccgc ggcagtgttagcagctacgacagaggccagccctacgccaccgagtggtcagacgacgag agtgggaacccctttgggggcagtgagaccaacgggggcgccaacccctttgaggacgac tccaagggagtgcgcgtgcgggcactctacgactatgacggccaggagcaggacgagctc agctttaaggccggagacgaactcaccaagctgggcgaggaggatgagcagggctggtgc cgtgggcggctggacagcgggcagctgggcctctaccctgccaactacgtggaggctatc tag >gi568815592f:34426306_34632527|GENSCAN_predicted_peptide_6|461_aa MLAAPAGRARQSSLFSLTFLLSQKALEVVPVRPEGLSAQPCRGCPMTRKPPTLSQAQPTV HRAPHPHNQLSGPHRDGEAVKPESSFISVDTAASPNSSGMGSASPGLSSVSPSHLLLPPD TVSRTGLEKAAAGAVGLERRDWSPSPPATPEQGLSAFYLSYFDMLYPEDSSWAAKAPGAS SREEPPEEPEQCPVIDSQAPAGSLDLVPGGLTLEEHSLEQVQSMVVGEVLKDIETACKLL NITAGLGSDGTIRMRPSLTPPLKLLPMSTGHDPMDWSPSNVQKWLLWTEHQYRLPPMGKA FQELAGKELCAMSEEQFRQRSPLGGDVLHAHLDIWKSAAWMKERTSPGAIHYCASTSEES WTDSEVDSSCSGQPIHLWQFLKELLLKPHSYGRFIRWLNKEKGIFKIEDSAQVARLWGIR KNRPAMNYDKLSRSIRQYYKKGIIRKPDISQRLVYQFVHPI >gi568815592f:34426306_34632527|GENSCAN_predicted_CDS_6|1386_bp atgctggctgccccagctgggagggcccggcaaagcagtttattcagtttgaccttcctc ctgtcccagaaagcgctggaagtagtgccagtgaggccagaaggcctgtctgcccaacca tgtcgtggctgcccaatgacccggaagcccccaactctgtcccaggcccagcccactgtc cacagggcccctcatccccataaccagcttagcggcccccacagggatggagaagcagtc aaacctgagtcctctttcatttccgtagacacagccgccagcccaaacagcagcggcatg ggcagcgccagcccgggtctgagcagcgtatcccccagccacctcctgctgccccccgac acggtgtcgcggacaggcttggagaaggcggcagcgggggcagtgggtctcgagagacgg gactggagtcccagtccacccgccacgcccgagcagggcctgtccgccttctacctctcc tactttgacatgctgtaccctgaggacagcagctgggcagccaaggcccctggggccagc agtcgggaggagccacctgaggagcctgagcagtgcccggtcattgacagccaagcccca gcgggcagcctggacttggtgcccggcgggctgaccttggaggagcactcgctggagcag gtgcagtccatggtggtgggcgaagtgctcaaggacatcgagacggcctgcaagctgctc aacatcaccgcaggtcttggctcagatggcaccatccgcatgaggccttccctgacccct ccattaaaattgctgcccatgagcacaggacatgatcccatggactggagccccagcaat gtgcagaagtggctcctgtggacagagcaccaataccggctgccccccatgggcaaggcc ttccaggagctggcgggcaaggagctgtgcgccatgtcggaggagcagttccgccagcgc tcgcccctgggtggggatgtgctgcacgcccacctggacatctggaagtcagcggcctgg atgaaagagcggacttcacctggggcgattcactactgtgcctcgaccagtgaggagagc tggaccgacagcgaggtggactcatcatgctccgggcagcccatccacctgtggcagttc ctcaaggagttgctactcaagccccacagctatggccgcttcattaggtggctcaacaag gagaagggcatcttcaaaattgaggactcagcccaggtggcccggctgtggggcatccgc aagaaccgtcccgccatgaactacgacaagctgagccgctccatccgccagtattacaag aagggcatcatccggaagccagacatctcccagcgcctcgtctaccagttcgtgcacccc atctga >gi568815592f:34426306_34632527|GENSCAN_predicted_peptide_7|391_aa MAIIKDTGYAGLMAWVWAAGRPQLIEIWQILEMSTFEGALYLAVIVTGTLVEDQLSVIPQ RRLREDGAEEGEVTQGPEMVHSTTPNPQWSSPPGSIKKDTLRFQGLADHQPGHVFDKCHL TPHSNTHRCLLCAYFNKSTDTDLLDEGKDNWEDHALDQVTVRGREWRQSCVTTEAAPTSP CRLVTLKLFCPLDTAFASCQRSQDPRAQSGTQLAAAAGSMTRPASRVSLTNLPGVATGRS YRTGGRHCTQPTSNPLTNPVVPIAQMGKLRPKLHPAGPSEQKGEVYLAEGTSLEDQHPDD DEQNGHEDVHNQGSDVQALGGGGIRLGPSQVTNHLPVPRLHGISKCYEAQAAGVHEEGVE QGSDDTVGHGDIHDGRYPQLLHTPPQTSPSQ >gi568815592f:34426306_34632527|GENSCAN_predicted_CDS_7|1176_bp atggccatcataaaggacacaggctacgcggggctcatggcctgggtgtgggctgctggg cggccccagctgatagaaatctggcaaatcttagaaatgagcaccttcgagggagccctg tacttggctgttatcgtcactggcaccttggttgaggatcagttatcagtcattccccag agaaggttgagggaggatggagcggaggaaggggaggtaacacaaggtcctgaaatggta cactcgaccactcctaatccccaatggtccagccctccagggtcaataaagaaagacacg ctgcgtttccagggccttgctgaccaccaacctggccacgtgtttgacaaatgccatctt actcctcacagcaacactcacaggtgtttactatgtgcctatttcaacaaatcaacagat accgacttgttggatgaaggaaaggacaactgggaggatcatgcgctggatcaagtcaca gtcaggggaagggaatggaggcagtcctgcgtgaccacagaagctgcccccaccagcccc tgccggctggtgacccttaaactcttctgccctcttgacacggcttttgcgtcatgtcag agaagccaggatccccgcgcccagtctgggacgcagttggcagcagctgcaggaagcatg acccgcccagcatctcgggtgtcactcacgaacctcccaggtgtcgctactggcaggtcc taccgcacagggggcaggcactgtacccagcccacctccaaccccctcaccaaccctgtt gttcccattgcacagatgggaaaactgaggcccaagctccatcctgcagggccctcggaa cagaagggtgaggtgtatctggctgaagggacaagcttggaagatcaacacccggatgat gatgagcagaatggtcatgaggatgtccacaatcagggctcagatgttcaggcacttggc ggtggaggcatacgcctgggccccagtcaggtcaccaaccatcttcctgtccctagactt cacggaataagcaaatgctatgaagcccaggcagcaggggttcacgaagagggtgttgaa cagggatcagacgacacggtcgggcacggagatatccacgatggtcgctaccctcagctg cttcacacaccacctcaaacatcaccttctcagtga >gi568815592f:34426306_34632527|GENSCAN_predicted_peptide_8|252_aa MISKDDGALASTDVIWVILSVEVGGLLGVTQQLSSFETEFNTQPHRKVEGNFNPFASPQK NRQSDENNLKDPGGSEFDSISKNTWAPAPDTWAPAPDQTEQDQNRLSQNSVNLSPSSHAN NLSVVTYSKAGIVVGYQNLLHQVLELQWNLKCGWASRFESTSIQKATTLGKLNVPSTLSP FPRTNHFCAHPWRRDELLAEGQNSFAPTHSLLVTADGYGVDEGMDGVQARGDTTQEPGCK CMEEWSQSPSRA >gi568815592f:34426306_34632527|GENSCAN_predicted_CDS_8|759_bp atgatttccaaagatgacggtgcattggcatcaactgatgtcatctgggtgattctcagt gtggaggtgggtggacttttaggagtaacgcagcagctgtcatcttttgaaacggagttc aacacacagccgcatcgtaaggtagaaggaaacttcaacccttttgcctctccccaaaag aaccgacaatcagatgaaaacaacttaaaagaccctgggggctccgagttcgactcgatc agcaaaaacacatgggctcctgctcctgacacatgggctcctgctcctgaccaaactgag caagaccagaatagactgtcacagaactctgtaaatctgtctcccagcagtcacgcaaac aacttatcagtagtgacttacagtaaggctgggattgtcgttggctaccagaacctgtta caccaggtcttagaactccagtggaatctgaaatgtggatgggcctcccggtttgaaagt acctccatccagaaggcaaccacacttggcaagttaaatgtcccaagcaccttgtctccc ttcccaagaacaaaccatttctgtgcacatccttggaggcgagatgagctccttgcagag ggccagaactcttttgcccccacacactccctgctggtcacagcagacggttatggagta gatgagggaatggatggtgtccaggccaggggtgacaccacccaggaaccaggctgcaag tgcatggaggagtggagccagagccctagtagggcctag >gi568815592f:34426306_34632527|GENSCAN_predicted_peptide_9|63_aa MEGVEEKKVPAVPETLKKKRRNFAELKINRLRKKFAQKYTLLKGRQDVLAKRNCSKEASS NVM >gi568815592f:34426306_34632527|GENSCAN_predicted_CDS_9|189_bp atggagggtgtcgaagagaagaaggttcctgctgtgccagaaacccttaagaaaaagcga aggaattttgcagagctgaagatcaatcgcctgagaaagaagtttgcccaaaagtatacc ctgctgaaaggtagacaggatgtactggctaaaagaaactgtagtaaagaagccagtagt aatgtaatg