GENSCAN 1.0 Date run: 5-Nov-116 Time: 09:54:46 Sequence gi568815592r:34438277_34644455 : 206179 bp : 49.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 328 399 72 1 0 107 67 72 0.971 5.52 1.02 Intr + 845 940 96 2 0 69 69 118 0.800 7.12 1.03 Intr + 10184 10222 39 2 0 125 103 31 0.858 5.64 1.04 Term + 10940 11054 115 2 1 53 42 99 0.795 -0.16 1.05 PlyA + 11713 11718 6 1.05 2.00 Prom + 12372 12411 40 -6.56 2.01 Init + 17470 17476 7 0 1 114 100 0 0.372 4.99 2.02 Intr + 27848 27994 147 0 0 133 91 80 0.849 13.01 2.03 Intr + 34439 34481 43 0 1 138 116 -5 0.498 4.60 2.04 Intr + 47993 48128 136 2 1 78 111 40 0.072 5.87 2.05 Intr + 49922 50045 124 1 1 91 37 25 0.071 -2.14 2.06 Term + 51468 51535 68 1 2 104 44 77 0.250 3.00 2.07 PlyA + 54651 54656 6 1.05 3.04 PlyA - 55879 55874 6 1.05 3.03 Term - 57855 57685 171 0 0 87 44 63 0.171 -0.37 3.02 Intr - 59334 59251 84 0 0 47 105 27 0.102 0.32 3.01 Init - 76949 76860 90 2 0 62 62 152 0.347 8.49 3.00 Prom - 79112 79073 40 -7.96 4.05 PlyA - 80534 80529 6 1.05 4.04 Term - 82429 82317 113 2 2 108 50 107 0.914 7.62 4.03 Intr - 82938 82786 153 1 0 90 92 94 0.817 10.04 4.02 Intr - 83575 83473 103 1 1 109 43 17 0.573 -0.95 4.01 Init - 84674 84609 66 2 0 56 113 6 0.337 0.98 4.00 Prom - 85238 85199 40 -7.16 5.00 Prom + 86095 86134 40 -11.14 5.01 Init + 88030 88092 63 0 0 72 78 106 0.955 9.15 5.02 Intr + 89056 89212 157 0 1 35 42 446 0.855 34.28 5.03 Intr + 90366 90601 236 1 2 90 32 428 0.684 34.81 5.04 Intr + 91121 91276 156 1 0 108 48 236 0.919 21.81 5.05 Intr + 91390 91613 224 0 2 90 46 414 0.679 34.33 5.06 Intr + 91967 92087 121 2 1 38 78 160 0.999 10.50 5.07 Intr + 92184 92311 128 2 2 79 109 113 0.999 11.98 5.08 Intr + 93324 93511 188 0 2 69 74 428 0.992 38.83 5.09 Term + 94145 94254 110 0 2 122 44 213 0.999 18.97 5.10 PlyA + 97956 97961 6 1.05 6.08 PlyA - 99547 99542 6 1.05 6.07 Term - 100176 99998 179 1 2 71 48 457 0.998 37.85 6.06 Intr - 101120 100974 147 0 0 63 84 181 0.991 15.41 6.05 Intr - 101286 101239 48 1 0 112 77 17 0.758 1.65 6.04 Intr - 102905 102708 198 0 0 33 91 324 0.967 26.52 6.03 Intr - 105332 105252 81 0 0 96 58 40 0.721 1.41 6.02 Intr - 106380 105744 637 0 1 83 99 640 0.360 56.27 6.01 Init - 112764 112669 96 0 0 65 61 92 0.160 2.52 6.00 Prom - 114799 114760 40 -7.86 7.00 Prom + 115348 115387 40 -8.86 7.01 Init + 116659 116734 76 0 1 62 66 130 0.627 7.54 7.02 Intr + 116958 116992 35 1 2 128 69 24 0.595 2.44 7.03 Intr + 119085 119285 201 2 0 69 45 104 0.511 3.68 7.04 Intr + 120995 121065 71 1 2 73 76 58 0.324 1.08 7.05 Intr + 129161 129278 118 2 1 42 72 49 0.069 -0.83 7.06 Intr + 131828 132042 215 1 2 39 94 157 0.388 8.91 7.07 Intr + 134405 134562 158 2 2 64 89 115 0.852 8.85 7.08 Intr + 137989 138225 237 2 0 60 9 200 0.229 6.89 7.09 Term + 140670 140734 65 1 2 71 55 77 0.437 0.75 7.10 PlyA + 141224 141229 6 1.05 8.06 PlyA - 141435 141430 6 1.05 8.05 Term - 145894 145743 152 2 2 117 29 89 0.599 3.97 8.04 Intr - 150000 149889 112 0 1 113 89 -9 0.636 1.65 8.03 Intr - 155783 155676 108 2 0 63 81 86 0.697 5.88 8.02 Intr - 168628 168279 350 2 2 77 100 116 0.709 6.78 8.01 Init - 168930 168894 37 0 1 59 107 30 0.830 2.18 8.00 Prom - 175600 175561 40 -5.26 9.00 Prom + 176262 176301 40 -8.16 9.01 Init + 178262 178375 114 1 0 110 44 148 0.926 12.81 9.02 Intr + 192892 192966 75 0 0 72 91 24 0.047 0.81 9.03 Term + 196602 196709 108 2 0 44 40 67 0.033 -3.99 9.04 PlyA + 196749 196754 6 1.05 10.02 PlyA - 198715 198710 6 1.05 10.01 Term - 199644 199545 100 2 1 26 42 231 0.992 9.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:34438277_34644455|GENSCAN_predicted_peptide_1|107_aa XSLDKALILSTNCQSEKSLNPPNERAVTLTTKIRGFIFEVSETKNPPEGANSGHTDLLSQ GWQGIRPYRWSTGKADAIWRFHFLSHGTGSLQIGEQALSLTPAVFCA >gi568815592r:34438277_34644455|GENSCAN_predicted_CDS_1|324_bp nngtctttagacaaagctttaattctttcaaccaactgccaatcagaaaaatctttgaat ccacctaatgaaagagctgtaacactcaccaccaagatccgtggcttcatttttgaagtc agcgagaccaagaacccaccagaaggagccaattctggacacactgacctgctctcccag ggctggcagggcatacggccctatcggtggagcactggcaaggctgatgccatatggcgg ttccacttcctcagccatggaactggatctttgcaaattggagagcaggcgctgagcctc actcctgctgtcttctgtgcctga >gi568815592r:34438277_34644455|GENSCAN_predicted_peptide_2|174_aa MPTPARSQSDAVAAAAPAPSSPGKSAGLEAGPQTSRFAPSRRRWSPELLPPGLWCLKPGG MGKGVRAQLSAEREGGGGVVKYLSPMWFSQPWGEVGAIIPALKMRKQEGGQTLTIALSHF PSIPGKGAPRDVPGNYLAAKAIILPCPLPYKQWGHDASLGPQTNVHVDPMSDSL >gi568815592r:34438277_34644455|GENSCAN_predicted_CDS_2|525_bp atgcccaccccggcgagatcccagagcgacgcggtggcggcggcagcgccagccccctcc tcccccgggaagtcggccgggcttgaggccgggccccagacgtcccgcttcgccccgagt cgccgccgatggtccccggagctcctgcccccaggcctgtggtgcctgaaacctggcggg atgggaaagggggtaagagcacagctgtcagctgagagggagggaggaggaggtgttgta aaatatttgtctccaatgtggttctcacagccatggggtgaagtaggtgccatcatcccc gctttgaagatgagaaaacaggaagggggccagaccctgaccattgccctttcccatttc ccatcaataccaggcaagggagcaccaagagatgtcccagggaattacctggctgccaag gctattattcttccctgccccttgccatacaaacagtggggccacgatgcctctttgggt ccccagaccaatgtccatgtggaccctatgtctgacagtctttga >gi568815592r:34438277_34644455|GENSCAN_predicted_peptide_3|114_aa MLAAPHTPLPSLVPGPAPPAGLFGSYQVPLNQRGVAAWGQGWGTHTLGGDSKKEEELKAA FSPMAQSPASAQLAASLLSSYNHTGEQQAAGFIPVVGMGKLRLSFTKDKGPVNS >gi568815592r:34438277_34644455|GENSCAN_predicted_CDS_3|345_bp atgctggctgcgccccacaccccgctccccagcctggtccctggcccggcccctccagct ggcctgtttggctcctatcaggtgccgctgaatcagagaggtgtggctgcctgggggcag ggctggggcacgcacacccttggaggggattccaagaaggaggaagagctcaaggctgcc tttagtcctatggcacaatcacctgcatctgcacaacttgctgcctctctgctctccagc tacaaccacacaggggagcagcaggcagctggcttcatccctgttgtagggatggggaaa ctgaggctcagcttcaccaaggataaggggcctgtaaacagttga >gi568815592r:34438277_34644455|GENSCAN_predicted_peptide_4|144_aa MQCGRAPPNRLRAQREGKGRGKASASASVKWAVARIKRSTRPRATGPRLTHGELSTALVT TGHGIHSTAFITLRVLLPTGMSLSPVSGVYLSGYLPYFQHPVQCRQHRVLQHAGGINNTT IFERLLCAIVSASSHALSPNLTAH >gi568815592r:34438277_34644455|GENSCAN_predicted_CDS_4|435_bp atgcaatgtgggcgggcaccacccaatcggctgagggcccagagagaaggaaaaggcaga ggaaaggcctcggcgtccgcatctgtaaaatgggctgttgcaaggattaaacggagcact agacctcgggccactggaccgcgcctgacacacggcgagctctccacagccctcgtcacc accggacacggcatccacagcacggcattcatcacgttgcgtgtcctccttcccacagga atgtcacttagtccagtcagcggtgtttatctgtctggttatctgccgtatttccagcat ccagtgcagtgccggcaacatagggtactgcagcacgcaggcggcatcaacaacacaacc atttttgagcgcctactgtgtgccattgtgagcgcctcctcacatgctttatctccaaac ctcacagcccactga >gi568815592r:34438277_34644455|GENSCAN_predicted_peptide_5|460_aa MSSSYDEASLAPEETTDSFWEVGNYKRTVKRIDDGHRLCNDLMNCVQERAKIEKAYGQQL TDWAKRWRQLIEKGPQYGSLERAWGAIMTEADKVSELHQEVKNNLLNEDLEKVKNWQKDA YHKQIMGGFKETKEAEDGFRKAQKPWAKKMKELEAAKKAYHLACKEEKLAMTREMNSKTE QSVTPEQQKKLQDKVDKCKQDVQKTQEKYEKVLEDVGKTTPQYMENMEQVFEQCQQFEEK RLVFLKEVLLDIKRHLNLAENSRYLGMAGTEGTGTASRCYIHVYRELEQAIRGADAQEDL RWFRSTSGPGMPMNWPQFEEWNPDLPHTTTKKEKQPKKAEGVALTNATGAVESTSQAGDR GSVSSYDRGQPYATEWSDDESGNPFGGSETNGGANPFEDDSKGVRVRALYDYDGQEQDEL SFKAGDELTKLGEEDEQGWCRGRLDSGQLGLYPANYVEAI >gi568815592r:34438277_34644455|GENSCAN_predicted_CDS_5|1383_bp atgtccagctcctacgatgaggcctcactggcgccagaggagaccaccgacagcttctgg gaggtggggaactacaagcggaccgtgaagcgcatcgatgacggccaccgtctatgcaac gacctgatgaactgcgtgcaggagcgcgccaagatcgagaaggcgtacgggcagcagctc accgactgggccaagcgttggcgccagctcatcgagaaaggcccacagtatggcagcctg gagcgggcctggggtgccataatgacagaggcagacaaggtgagcgagctgcaccaggag gtgaagaacaatctgctgaatgaggacctggagaaggtgaagaactggcagaaggacgcc tatcacaagcagatcatgggtggcttcaaggagacgaaggaggctgaagatggcttccgc aaggcccagaagccttgggccaagaagatgaaggagctggaggcagccaagaaggcctac catttggcttgcaaagaggaaaagctggccatgacacgggagatgaacagcaagacggag caatcggtcacacctgagcagcaaaagaagctgcaggacaaagtggacaagtgcaagcag gatgtgcagaagacacaggagaagtatgagaaagtgctggaagatgtgggcaagaccaca ccccagtacatggagaacatggagcaggtgtttgagcaatgccagcaatttgaggaaaag cggctggtcttcctcaaggaggtgctgctggacatcaaacggcacctcaacctggctgag aacagcaggtacctgggcatggcaggcaccgagggcacaggcacagccagcagatgctac atccatgtgtaccgtgagctggagcaggccatccggggggctgatgcccaggaagacctc agatggttccgcagcaccagtggccccggcatgcccatgaactggccccagtttgaggag tggaacccagaccttcctcacaccaccaccaagaaggagaaacagcctaagaaggcagag ggagtggcgctgaccaatgccactggggcggtagagtccacatcccaggctggggaccgc ggcagtgttagcagctacgacagaggccagccctacgccaccgagtggtcagacgacgag agtgggaacccctttgggggcagtgagaccaacgggggcgccaacccctttgaggacgac tccaagggagtgcgcgtgcgggcactctacgactatgacggccaggagcaggacgagctc agctttaaggccggagacgaactcaccaagctgggcgaggaggatgagcagggctggtgc cgtgggcggctggacagcgggcagctgggcctctaccctgccaactacgtggaggctatc tag >gi568815592r:34438277_34644455|GENSCAN_predicted_peptide_6|461_aa MLAAPAGRARQSSLFSLTFLLSQKALEVVPVRPEGLSAQPCRGCPMTRKPPTLSQAQPTV HRAPHPHNQLSGPHRDGEAVKPESSFISVDTAASPNSSGMGSASPGLSSVSPSHLLLPPD TVSRTGLEKAAAGAVGLERRDWSPSPPATPEQGLSAFYLSYFDMLYPEDSSWAAKAPGAS SREEPPEEPEQCPVIDSQAPAGSLDLVPGGLTLEEHSLEQVQSMVVGEVLKDIETACKLL NITAGLGSDGTIRMRPSLTPPLKLLPMSTGHDPMDWSPSNVQKWLLWTEHQYRLPPMGKA FQELAGKELCAMSEEQFRQRSPLGGDVLHAHLDIWKSAAWMKERTSPGAIHYCASTSEES WTDSEVDSSCSGQPIHLWQFLKELLLKPHSYGRFIRWLNKEKGIFKIEDSAQVARLWGIR KNRPAMNYDKLSRSIRQYYKKGIIRKPDISQRLVYQFVHPI >gi568815592r:34438277_34644455|GENSCAN_predicted_CDS_6|1386_bp atgctggctgccccagctgggagggcccggcaaagcagtttattcagtttgaccttcctc ctgtcccagaaagcgctggaagtagtgccagtgaggccagaaggcctgtctgcccaacca tgtcgtggctgcccaatgacccggaagcccccaactctgtcccaggcccagcccactgtc cacagggcccctcatccccataaccagcttagcggcccccacagggatggagaagcagtc aaacctgagtcctctttcatttccgtagacacagccgccagcccaaacagcagcggcatg ggcagcgccagcccgggtctgagcagcgtatcccccagccacctcctgctgccccccgac acggtgtcgcggacaggcttggagaaggcggcagcgggggcagtgggtctcgagagacgg gactggagtcccagtccacccgccacgcccgagcagggcctgtccgccttctacctctcc tactttgacatgctgtaccctgaggacagcagctgggcagccaaggcccctggggccagc agtcgggaggagccacctgaggagcctgagcagtgcccggtcattgacagccaagcccca gcgggcagcctggacttggtgcccggcgggctgaccttggaggagcactcgctggagcag gtgcagtccatggtggtgggcgaagtgctcaaggacatcgagacggcctgcaagctgctc aacatcaccgcaggtcttggctcagatggcaccatccgcatgaggccttccctgacccct ccattaaaattgctgcccatgagcacaggacatgatcccatggactggagccccagcaat gtgcagaagtggctcctgtggacagagcaccaataccggctgccccccatgggcaaggcc ttccaggagctggcgggcaaggagctgtgcgccatgtcggaggagcagttccgccagcgc tcgcccctgggtggggatgtgctgcacgcccacctggacatctggaagtcagcggcctgg atgaaagagcggacttcacctggggcgattcactactgtgcctcgaccagtgaggagagc tggaccgacagcgaggtggactcatcatgctccgggcagcccatccacctgtggcagttc ctcaaggagttgctactcaagccccacagctatggccgcttcattaggtggctcaacaag gagaagggcatcttcaaaattgaggactcagcccaggtggcccggctgtggggcatccgc aagaaccgtcccgccatgaactacgacaagctgagccgctccatccgccagtattacaag aagggcatcatccggaagccagacatctcccagcgcctcgtctaccagttcgtgcacccc atctga >gi568815592r:34438277_34644455|GENSCAN_predicted_peptide_7|391_aa MAIIKDTGYAGLMAWVWAAGRPQLIEIWQILEMSTFEGALYLAVIVTGTLVEDQLSVIPQ RRLREDGAEEGEVTQGPEMVHSTTPNPQWSSPPGSIKKDTLRFQGLADHQPGHVFDKCHL TPHSNTHRCLLCAYFNKSTDTDLLDEGKDNWEDHALDQVTVRGREWRQSCVTTEAAPTSP CRLVTLKLFCPLDTAFASCQRSQDPRAQSGTQLAAAAGSMTRPASRVSLTNLPGVATGRS YRTGGRHCTQPTSNPLTNPVVPIAQMGKLRPKLHPAGPSEQKGEVYLAEGTSLEDQHPDD DEQNGHEDVHNQGSDVQALGGGGIRLGPSQVTNHLPVPRLHGISKCYEAQAAGVHEEGVE QGSDDTVGHGDIHDGRYPQLLHTPPQTSPSQ >gi568815592r:34438277_34644455|GENSCAN_predicted_CDS_7|1176_bp atggccatcataaaggacacaggctacgcggggctcatggcctgggtgtgggctgctggg cggccccagctgatagaaatctggcaaatcttagaaatgagcaccttcgagggagccctg tacttggctgttatcgtcactggcaccttggttgaggatcagttatcagtcattccccag agaaggttgagggaggatggagcggaggaaggggaggtaacacaaggtcctgaaatggta cactcgaccactcctaatccccaatggtccagccctccagggtcaataaagaaagacacg ctgcgtttccagggccttgctgaccaccaacctggccacgtgtttgacaaatgccatctt actcctcacagcaacactcacaggtgtttactatgtgcctatttcaacaaatcaacagat accgacttgttggatgaaggaaaggacaactgggaggatcatgcgctggatcaagtcaca gtcaggggaagggaatggaggcagtcctgcgtgaccacagaagctgcccccaccagcccc tgccggctggtgacccttaaactcttctgccctcttgacacggcttttgcgtcatgtcag agaagccaggatccccgcgcccagtctgggacgcagttggcagcagctgcaggaagcatg acccgcccagcatctcgggtgtcactcacgaacctcccaggtgtcgctactggcaggtcc taccgcacagggggcaggcactgtacccagcccacctccaaccccctcaccaaccctgtt gttcccattgcacagatgggaaaactgaggcccaagctccatcctgcagggccctcggaa cagaagggtgaggtgtatctggctgaagggacaagcttggaagatcaacacccggatgat gatgagcagaatggtcatgaggatgtccacaatcagggctcagatgttcaggcacttggc ggtggaggcatacgcctgggccccagtcaggtcaccaaccatcttcctgtccctagactt cacggaataagcaaatgctatgaagcccaggcagcaggggttcacgaagagggtgttgaa cagggatcagacgacacggtcgggcacggagatatccacgatggtcgctaccctcagctg cttcacacaccacctcaaacatcaccttctcagtga >gi568815592r:34438277_34644455|GENSCAN_predicted_peptide_8|252_aa MISKDDGALASTDVIWVILSVEVGGLLGVTQQLSSFETEFNTQPHRKVEGNFNPFASPQK NRQSDENNLKDPGGSEFDSISKNTWAPAPDTWAPAPDQTEQDQNRLSQNSVNLSPSSHAN NLSVVTYSKAGIVVGYQNLLHQVLELQWNLKCGWASRFESTSIQKATTLGKLNVPSTLSP FPRTNHFCAHPWRRDELLAEGQNSFAPTHSLLVTADGYGVDEGMDGVQARGDTTQEPGCK CMEEWSQSPSRA >gi568815592r:34438277_34644455|GENSCAN_predicted_CDS_8|759_bp atgatttccaaagatgacggtgcattggcatcaactgatgtcatctgggtgattctcagt gtggaggtgggtggacttttaggagtaacgcagcagctgtcatcttttgaaacggagttc aacacacagccgcatcgtaaggtagaaggaaacttcaacccttttgcctctccccaaaag aaccgacaatcagatgaaaacaacttaaaagaccctgggggctccgagttcgactcgatc agcaaaaacacatgggctcctgctcctgacacatgggctcctgctcctgaccaaactgag caagaccagaatagactgtcacagaactctgtaaatctgtctcccagcagtcacgcaaac aacttatcagtagtgacttacagtaaggctgggattgtcgttggctaccagaacctgtta caccaggtcttagaactccagtggaatctgaaatgtggatgggcctcccggtttgaaagt acctccatccagaaggcaaccacacttggcaagttaaatgtcccaagcaccttgtctccc ttcccaagaacaaaccatttctgtgcacatccttggaggcgagatgagctccttgcagag ggccagaactcttttgcccccacacactccctgctggtcacagcagacggttatggagta gatgagggaatggatggtgtccaggccaggggtgacaccacccaggaaccaggctgcaag tgcatggaggagtggagccagagccctagtagggcctag >gi568815592r:34438277_34644455|GENSCAN_predicted_peptide_9|98_aa MEGVEEKKVPAVPETLKKKRRNFAELKINRLRKKFAQKYTLLKGRQDVLAKRNCSKEASS NVMKKNGTTYQTASTELHCPKELVATSHMVAIEYLGCS >gi568815592r:34438277_34644455|GENSCAN_predicted_CDS_9|297_bp atggagggtgtcgaagagaagaaggttcctgctgtgccagaaacccttaagaaaaagcga aggaattttgcagagctgaagatcaatcgcctgagaaagaagtttgcccaaaagtatacc ctgctgaaaggtagacaggatgtactggctaaaagaaactgtagtaaagaagccagtagt aatgtaatgaaaaaaaatggtacaacgtatcagacagcatccacagagctgcactgtcca aaagagctggtggccaccagccatatggtggctattgagtacttgggatgtagctag >gi568815592r:34438277_34644455|GENSCAN_predicted_peptide_10|33_aa XVRPSFNNNNNNNNNNNNNSSSSSSSNNNSNVK >gi568815592r:34438277_34644455|GENSCAN_predicted_CDS_10|102_bp nnagtgagacccagtttcaacaacaacaacaacaacaacaacaacaacaacaacaacagc agcagcagcagcagcagcaacaacaacagcaatgtgaaatga