GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:10:11 Sequence gi568815591r:44862742_45078961 : 216220 bp : 47.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 51 46 6 1.05 1.01 Sngl - 22607 21669 939 2 0 76 47 2062 0.961 197.41 1.00 Prom - 24814 24775 40 -3.06 2.07 PlyA - 25011 25006 6 1.05 2.06 Term - 25267 25210 58 0 1 100 46 67 0.193 0.86 2.05 Intr - 29906 29801 106 0 1 49 84 83 0.156 3.27 2.04 Intr - 30721 30666 56 0 2 119 58 27 0.115 1.42 2.03 Intr - 59448 59345 104 0 2 45 81 62 0.022 0.17 2.02 Intr - 73036 72842 195 1 0 48 80 161 0.620 10.91 2.01 Init - 74815 74702 114 1 0 76 83 33 0.439 1.83 2.00 Prom - 88162 88123 40 -3.16 3.22 PlyA - 88259 88254 6 1.05 3.21 Term - 100154 99998 157 1 1 112 53 222 0.999 18.51 3.20 Intr - 100383 100229 155 0 2 87 84 224 0.999 20.77 3.19 Intr - 101421 101308 114 0 0 90 89 253 0.984 26.24 3.18 Intr - 101778 101674 105 0 0 103 72 91 0.674 9.41 3.17 Intr - 102348 102204 145 2 1 91 84 209 0.998 21.08 3.16 Intr - 103119 102896 224 0 2 65 79 165 0.994 10.23 3.15 Intr - 103614 103332 283 2 1 102 100 300 0.521 30.02 3.14 Intr - 104097 103931 167 0 2 112 84 236 0.983 24.36 3.13 Intr - 104996 104864 133 1 1 47 95 175 0.975 14.75 3.12 Intr - 105217 105143 75 0 0 90 99 207 0.999 20.63 3.11 Intr - 106729 106672 58 2 1 89 85 87 0.777 6.44 3.10 Intr - 107134 106927 208 1 1 74 48 354 0.739 28.65 3.09 Intr - 107413 107299 115 0 1 59 49 287 0.999 22.35 3.08 Intr - 107996 107851 146 2 2 40 94 289 0.999 23.78 3.07 Intr - 108318 108094 225 0 0 61 75 317 0.999 25.88 3.06 Intr - 109048 108932 117 1 0 71 93 175 0.999 16.96 3.05 Intr - 112486 112433 54 1 0 106 82 78 0.981 8.18 3.04 Intr - 112908 112743 166 2 1 78 86 357 0.998 34.46 3.03 Intr - 113916 113823 94 1 1 96 105 97 0.999 11.22 3.02 Intr - 114330 114122 209 2 2 81 82 507 0.999 48.12 3.01 Init - 116220 116126 95 0 2 94 96 196 0.999 20.85 3.00 Prom - 120844 120805 40 -7.96 4.00 Prom + 121022 121061 40 -6.76 4.01 Init + 129618 129629 12 2 0 84 86 14 0.351 0.82 4.02 Intr + 136973 137116 144 1 0 117 96 53 0.748 9.48 4.03 Intr + 139856 139897 42 1 0 98 87 36 0.138 2.94 4.04 Intr + 169263 169346 84 2 0 88 49 101 0.342 6.22 4.05 Intr + 175512 175685 174 2 0 97 115 99 0.719 13.64 4.06 Intr + 201177 201260 84 2 0 77 86 30 0.614 1.72 4.07 Intr + 201722 201905 184 1 1 102 92 256 0.945 26.76 4.08 Intr + 205702 205838 137 2 2 41 100 76 0.950 4.39 4.09 Intr + 207085 207220 136 0 1 100 80 86 0.981 9.14 4.10 Intr + 209985 210042 58 1 1 98 59 102 0.945 6.24 4.11 Intr + 210719 210830 112 2 1 77 78 165 0.998 14.78 4.12 Intr + 211529 211667 139 1 1 117 97 354 0.999 39.24 4.13 Term + 213036 213316 281 1 2 89 45 525 0.999 43.91 4.14 PlyA + 213703 213708 6 -4.33 5.00 Prom + 214949 214988 40 -12.87 5.01 Init + 215286 215962 677 2 2 40 77 472 0.513 33.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:44862742_45078961|GENSCAN_predicted_peptide_1|312_aa MADGDSGSERGGGGGPCGFQPASRGGGEQETQELASKRLDIQNKRFYLDVKQNAKGRFLK IAEVGAGGSKSRLTLSMAVAAEFRDSLGDFIEHYAQLGPSSPEQLAAGAEEGGGPRRALK SEFLVRENRKYYLDLKENQRGRFLRIRQTVNRGGGGFGAGPGPGGLQSGQTIALPAQGLI EFRDALAKLIDDYGGEDDELAGGPGGGAGGPGGGLYGELPEGTSITVDSKRFFFDVGCNK YGVFLRVSEVKPSYRNAITVPFKAWGKFGGAFCRYADEMKEIQERQRDKLYERRGGGSGG GEESEGEEVDED >gi568815591r:44862742_45078961|GENSCAN_predicted_CDS_1|939_bp atggcggacggcgacagcggcagcgagcgcggcggcggcggtgggccgtgcgggttccag cccgcgtcccgcggcggcggcgagcaagagacgcaggagctggcctcgaagcggctggac atccagaacaagcgcttctacttagatgtgaagcagaacgccaagggccgcttcctcaag atcgccgaggtgggcgcgggcggttccaagagccgcctcacgctgtccatggcggtggcc gccgagttccgcgactcgctgggcgacttcatagaacactacgcgcagctgggccctagc agccccgagcagctggcggctggcgccgaggagggcggcgggccgcggcgcgcgctcaag agcgaattcttggtgcgtgagaaccgcaagtactacctggacctcaaggagaaccagcgc ggccgcttcctgcgcatccgccaaacggtcaaccgcggcggtggcggcttcggcgcgggc cccgggccgggcggcttgcagagcggccagaccatcgcgctgcctgcgcagggcctcatc gagttccgcgacgcgctggcgaagctcatagacgactacggaggcgaggacgacgagctg gcaggcggcccgggaggcggcgccgggggcccagggggcggcctgtatggagagctcccg gagggcacctccatcaccgtggactccaagcgcttcttcttcgatgtgggctgcaacaaa tacggggtgtttctgcgagtgagcgaggtgaagccgtcctaccgcaatgccatcaccgta cccttcaaagcctggggcaagttcggaggcgccttttgccggtatgcggatgagatgaaa gaaatccaggaacgacagagggataagctttatgagcgacgtggtgggggcagcggcggc ggcgaagagtcagagggtgaggaggtggatgaggattga >gi568815591r:44862742_45078961|GENSCAN_predicted_peptide_2|210_aa MGQQVLCSTGYTLCSIILGQRNTRASPVPSEKCKTQIGATAITQFSSPPADSEGTVEAAM LLQPKDLRQGYPHSGSGIPTYWSISREGTQTQASTILSTSPGWVSQDSATQGVESPCPPP PPPTICCRPLFHASAIPCIRHNRQRLTHYGKLSQPGQTEMKKHKDEKRNDTNTVMLTFWG ICVKNTIIAAVLKLMSNPPYLLAAPTPNYY >gi568815591r:44862742_45078961|GENSCAN_predicted_CDS_2|633_bp atggggcagcaggtcctctgctccactggctacaccctatgctccatcattttaggccag agaaacaccagggcctctcctgtgcccagtgagaaatgcaagactcaaatcggagccact gccatcacccagtttagctcaccacctgcagactcagaggggacagtggaggctgcaatg ttgctgcaaccaaaggacctgcgccaaggatacccccacagcggtagtgggatccccact tactggagcatctcccgtgaaggcacacagacccaagcatcaaccatcctcagcaccagc ccagggtgggtctcacaggactcagcaacccagggcgttgagtccccatgccccccacca ccacccccaaccatctgctgccggccacttttccatgcttcagccatcccctgcatcagg cacaacaggcaacgtctcactcactacgggaagttgagccagccaggacagacagagatg aagaaacataaagatgaaaagcgtaatgacacaaacaccgtgatgttaacattttgggga atctgtgtgaagaatactattattgcagctgttctgaaactgatgtcaaatccaccctac ctgctggcagctcccacacctaattactactga >gi568815591r:44862742_45078961|GENSCAN_predicted_peptide_3|1014_aa MEDEEGPEYGKPDFVLLDQVTMEDFMRNLQLRFEKGRIYTYIGEVLVSVNPYQELPLYGP EAIARYQGRELYERPPHLYAVANAAYKAMKHRSRDTCIVISGESGAGKTEASKHIMQYIA AVTNPSQRAEVERVKDVLLKSTCVLEAFGNARTNRNHNSSRFGKYMDINFDFKGDPIGGH IHSYLLEKSRVLKQHVGERNFHAFYQALDSDEQSHQAVTEAMRVIGFSPEEVESVHRILA AILHLGNIEFVETEEGGLQKEGLAVAEEALVDHVAELTATPRDLVLRSLLARTVASGGRE LIEKGHTAAEASYARDACAKAVYQRLFEWVVNRINSVMEPRGRDPRRDGKDTVIGVLDIY GFEVFPVNSFEQFCINYCNEKLQQLFIQLILKQEQEEYEREGITWQSVEYFNNATIVDLV ERPHRGILAVLDEACSSAGTITDRIFLQTLDMHHRHHLHYTSRQVPPAVPVPPQWADKTM EFGRDFRIKHYAGDVTYSVEGFIDKNRDFLFQDFKRLLYNSTDPTLRAMWPDGQQDITEV TKRPLTAGTLFKNSMVALVENLASKEPFYVRCIKPNEDKVAGKLDENHCRHQVAYLGLLE NVRVRRAGFASRQPYSRFLLRCGVGRVGSIISPPGLGHTALSCGDRYKMTCEYTWPNHLL GSDKAAVSALLEQHGLQGDVAFGHSKLFIRSPRTLVTLEQSRARLIPIIVLLLQKAWRGT LARWRCRRLRAIYTIMRWFRRHKVRAHLAELQRRFQAARQPPLYGRDLVWPLPPAVLQPF QDTCHALFCRWRARQLVKNIPPSDMPQIKAKVAAMGALQGLRQDWGCRRAWARDYLSSAT DNPTASSLFAQRLKTLQDKDGFGAVLFSSHVRKVNRFHKIRNRALLLTDQHLYKLDPDRQ YRVMRAVPLEAVTGLSVTSGGDQLVVLHARGQDDLVVCLHRSRPPLDNRVGELVGVLAAH CQGEGRTLEVRVSDCIPLSHRGVRRLISVEPRPEQPEPDFRCARGSFTLLWPSR >gi568815591r:44862742_45078961|GENSCAN_predicted_CDS_3|3045_bp atggaggacgaggaaggccctgagtatggcaaacctgactttgtgcttttggaccaagtg accatggaggacttcatgaggaacctgcagctcaggttcgagaagggccgcatctacacc tacatcggtgaggtgctggtgtccgtgaacccctaccaggagctgcccctgtatgggcct gaggccatcgccaggtaccagggccgtgagctctatgagcggccaccccatctctatgct gtggccaacgccgcctacaaggcaatgaagcaccggtccagggacacctgcatcgtcatc tcaggggagagtggggcagggaagacagaagccagtaagcacatcatgcagtacatcgct gctgtcaccaatccaagccagagggctgaggtggagagggtcaaggacgtgctgctcaag tccacctgtgtgctggaggcctttggcaatgcccgcaccaaccgcaatcacaactccagc cgctttggcaagtacatggacatcaactttgacttcaagggggacccgatcggaggacac atccacagctacctactggagaagtctcgggtcctcaagcagcacgtgggtgaaagaaac ttccacgccttctaccaagccttggacagtgatgagcagagccaccaggcagtgaccgag gccatgagggtcatcggcttcagtcctgaagaggtggagtctgtgcatcgcatcctggct gccatattgcacctgggaaacatcgagtttgtggagacggaggagggtgggctgcagaag gagggcctggcagtggccgaggaggcactggtggaccatgtggctgagctgacggccaca ccccgggacctcgtgctccgctccctgctggctcgcacagttgcctcgggaggcagggaa ctcatagagaagggccacactgcagctgaggccagctatgcccgggatgcctgtgccaag gcagtgtaccagcggctgtttgagtgggtggtgaacaggatcaacagtgtcatggaaccc cggggccgggatcctcggcgtgatggcaaggacacagtcattggcgtgctggacatctat ggcttcgaggtgtttcccgtcaacagtttcgagcagttctgcatcaactactgcaacgag aagctgcagcagctattcatccagctcatcctgaagcaggaacaggaagagtacgagcgc gagggcatcacctggcagagcgttgagtatttcaacaacgccaccattgtggatctggtg gagcggccccaccgtggcatcctggccgtgctggacgaggcctgcagctctgctggcacc atcactgaccgaatcttcctgcagaccctggacatgcaccaccgccatcacctacactac accagccgccaggtgcccccggctgtcccagtgccaccacagtgggctgacaagaccatg gagtttggccgagacttccggatcaagcactatgcaggggacgtcacgtactccgtggaa ggcttcatcgacaagaacagagatttcctcttccaggacttcaagcggctgctgtacaac agcacggaccccactctacgggccatgtggccggacgggcagcaggacatcacagaggtg accaagcgccccctgacggctggcacactcttcaagaactccatggtggccctggtggag aaccttgcctccaaggagcccttctacgtccgctgcatcaagcccaatgaggacaaggta gctgggaagctggatgagaaccactgtcgccaccaggtcgcatacctggggctgctggag aatgtgagggtccgcagggctggcttcgcttcccgccagccctactctcgattcctgctc aggtgtggggtgggcagggtgggctccatcatctctcccccaggcctgggccacactgcc ctgtcctgtggtgacaggtacaagatgacctgtgaatacacatggcccaaccacctgctg ggctccgacaaggcagccgtgagcgctctcctggagcagcacgggctgcagggggacgtg gcctttggccacagcaagctgttcatccgctcaccccggacactggtcacactggagcag agccgagcccgcctcatccccatcattgtgctgctattgcagaaggcatggcggggcacc ttggcgaggtggcgctgccggaggctgagggctatctacaccatcatgcgctggttccgg agacacaaggtgcgggctcacctggctgagctgcagcggcgattccaggctgcaaggcag ccgccactctacgggcgtgaccttgtgtggccgctgccccctgctgtgctgcagcccttc caggacacctgccacgcactcttctgcaggtggcgggcccggcagctggtgaagaacatc cccccttcagacatgccccagatcaaggccaaggtggccgccatgggggccctgcaaggg cttcgtcaggactggggctgccgacgggcctgggcccgagactacctgtcctctgccact gacaatcccacagcatcaagcctgtttgctcagcgactaaagacacttcaggacaaagat ggcttcggggctgtgctcttttcaagccatgtccgcaaggtgaaccgcttccacaagatc cggaaccgggccctcctgctcacagaccagcacctctacaagctggaccctgaccggcag taccgggtgatgcgggccgtgccccttgaggcggtgacggggctgagcgtgaccagcgga ggagaccagctggtggtgctgcacgcccgcggccaggacgacctcgtggtgtgcctgcac cgctcccggccgccattggacaaccgcgttggggagctggtgggcgtgctggccgcacac tgccagggggagggccgcaccctggaggttcgcgtctccgactgcatcccactaagccat cgcggggtccggcgcctcatctccgtggagcccaggccggagcagccagagcccgatttc cgctgcgctcgcggctccttcaccctgctctggcccagccgctga >gi568815591r:44862742_45078961|GENSCAN_predicted_peptide_4|528_aa MGLRRPELRLPGREAGWEVGRRKDGGKLKKLELLPRALPRALSGRSEQGAGQDTGISELA ENLWTKLRAEQTLPLKSPIILSENSSYPAFRAEWPGIVSPFKRVFLKGEKSRDKKAHEKV TERRPLHTVVLSLPERVEPDRLLSDYIEKEVKYLGQLTSIPGYLNPSSRTEILHFIDNAK RAHQLPGHLTQEHDAVLSLSAYNVKLAWRDGEDIILRVPIHDIAAVSYVRDDAAHLVVLK TAQDPGISPSQSLCAESSRGLSAGSLSESAVGPVEACCLVILAAESKVAAEELCCLLGQV FQVVYTESTIDFLDRAIFDGASTPTHHLSLHSDDSSTKVDIKETYEVEASTFCFPESVDV GGASPHSKTISESELSASATELLQDYMLTLRTKLSSQEIQQFAALLHEYRNGASIHEFCI NLRQLYGDSRKFLLLGLRPFIPEKDSQHFENFLETIGVKDGRGIITDSFGRHRRALSTTS SSTTNGNRATGSSDDRSAPSEGDEWDRMISDISSDIEALGCSMDQDSA >gi568815591r:44862742_45078961|GENSCAN_predicted_CDS_4|1587_bp atggggctgaggagacctgagctccgcctccctggacgcgaggcaggctgggaagtcggc cgccgtaaagatggcggcaaattgaaaaagttggagctgctcccgcgcgcgctcccgcgc gcgctgagtgggcggagcgaacagggggctggacaggacacaggaatcagtgagttggca gagaacctatggaccaaactgagagctgagcaaacattacctctgaaaagccccatcatc ctcagcgagaacagcagctaccctgcattccgtgctgaatggcctggaattgtctcgcca tttaaacgagtattcctaaaaggtgaaaagagtagagataagaaagcccatgagaaggtg acagagaggcgccctctgcacactgtggtgttgtcattgcctgagcgcgtcgagccagac agactgctgagcgactatattgagaaggaggtaaagtatttaggtcagttaacgtccata ccaggatacctgaatccctccagtaggactgaaatcctgcatttcatagacaatgcaaag agagcccaccagcttccgggacacttgactcaggagcacgatgctgtgctcagcctgtct gcgtacaacgtcaagctggcctggagggacggggaggatatcatcctcagggtgcccatc catgacatcgccgccgtctcctatgttcgggatgacgctgcacacctggtggtcctgaag acagcccaggacccagggatctcccccagccagagtctgtgtgcggaaagttccagaggc ctcagtgcaggctccctgtcggagagtgcagttgggcccgtggaggcatgctgcctggtc atcctggctgcagagagcaaggtcgctgcggaggagctttgctgtctgctaggccaggtc ttccaggttgtttacacggagtccaccatcgactttctggacagagcgatatttgatggg gcctctaccccgacccaccacctgtccctgcacagcgatgactcttctacaaaagtggac attaaggagacctacgaggtggaagccagcactttctgcttccctgaatctgtggatgtg ggtggtgcatcaccccacagcaagaccatcagtgagagcgagctgagcgccagcgccact gagctgctgcaggactacatgctgacgctgcgcaccaagctgtcatcacaggagatccag cagtttgcagcactgctgcacgagtaccgcaatggggcctctatccacgagttctgcatc aacctgcggcagctctacggggacagccgcaagttcctgctgcttggtctgaggcccttc atccctgagaaggacagccagcacttcgagaacttcctggagaccattggcgtgaaggat ggccgcggcatcatcactgacagctttggcaggcaccggcgggccctgagcaccacatcc agttccaccaccaatgggaacagggccacgggcagctctgatgaccggtcggcaccctca gagggggatgagtgggaccgcatgatctcggacatcagcagcgacattgaggcgctgggc tgcagcatggaccaggactcagcatga >gi568815591r:44862742_45078961|GENSCAN_predicted_peptide_5|226_aa MLAPPGSIPGVAPVLQSLVFASRKSRGHGADEGTPEGNEGTPESNEGTSEGGDRTPEGSE GTPGSNGTSGSDGTSEGDEGTPEGGEGTPESSEGTPESNEGTLEGEGTPEGGEGTPESNE GHERVVRGHQRVMGHQRAMRGHRRVLRGHQRAVKGRQRVLRGHQRVVRGHQRVMGTPEGE GTPEGSEGTPESIGTPEGSEGTPEGDEGTPESNEGTSESNEGTSRR >gi568815591r:44862742_45078961|GENSCAN_predicted_CDS_5|678_bp atgcttgcccctccagggtccatcccaggtgtggcccccgtgctgcagtcgctggtcttt gctagcagaaagagtagaggacatggagcagatgaagggacaccagagggtaatgagggg acaccagagagtaatgaagggacatcagagggtggtgacaggacaccagagggcagtgag gggacaccagggagtaatgggacatcagggagtgatgggacatcagaaggtgatgagggg acaccagagggtggtgaggggacaccagagagtagtgaggggacaccagagagtaatgag gggacactagaaggtgaggggacaccagagggcggtgaagggacaccagagagtaatgag gggcacgagagggtggtgaggggacatcagagggtgatgggacaccagagggcgatgagg ggacaccggagagtattgaggggacaccagagggcagtgaagggacgccagagagtactg aggggacaccagagggtggtgaggggacaccagagagtaatggggacaccagaaggtgag gggacgccagagggcagtgaagggacaccggagagtattgggacaccagagggcagtgag gggacaccagagggcgatgaggggacaccagagagtaatgaggggacatcagagagtaat gaggggacatcaaggcgn