GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:28:29 Sequence gi568815591r:44784413_44985266 : 200854 bp : 46.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11237 11285 49 1 1 69 80 63 0.524 2.72 1.02 Intr + 12278 12436 159 0 0 40 47 173 0.439 8.36 1.03 Intr + 14980 15068 89 2 2 61 96 46 0.944 2.39 1.04 Intr + 15290 15462 173 1 2 70 79 196 0.999 15.64 1.05 Term + 16875 17010 136 0 1 82 38 226 0.997 14.49 1.06 PlyA + 17198 17203 6 1.05 2.00 Prom + 24962 25001 40 -3.36 2.01 Init + 37591 37646 56 0 2 32 86 54 0.162 0.36 2.02 Intr + 38860 38963 104 1 2 94 63 40 0.253 1.92 2.03 Intr + 61779 61859 81 1 0 104 119 10 0.038 5.31 2.04 Term + 63523 63998 476 2 2 95 38 185 0.041 9.25 2.05 PlyA + 64808 64813 6 1.05 3.02 PlyA - 65030 65025 6 1.05 3.01 Sngl - 100936 99998 939 1 0 76 47 2062 0.959 197.41 3.00 Prom - 103143 103104 40 -3.06 4.07 PlyA - 103340 103335 6 1.05 4.06 Term - 103596 103539 58 2 1 100 46 67 0.193 0.86 4.05 Intr - 108235 108130 106 2 1 49 84 83 0.156 3.27 4.04 Intr - 109050 108995 56 2 2 119 58 27 0.115 1.42 4.03 Intr - 137777 137674 104 2 2 45 81 62 0.022 0.17 4.02 Intr - 151365 151171 195 0 0 48 80 161 0.620 10.91 4.01 Init - 153144 153031 114 0 0 76 83 33 0.439 1.83 4.00 Prom - 166491 166452 40 -3.16 5.22 PlyA - 166588 166583 6 1.05 5.21 Term - 178483 178327 157 0 1 112 53 222 0.999 18.51 5.20 Intr - 178712 178558 155 2 2 87 84 224 0.999 20.77 5.19 Intr - 179750 179637 114 2 0 90 89 253 0.984 26.24 5.18 Intr - 180107 180003 105 2 0 103 72 91 0.674 9.41 5.17 Intr - 180677 180533 145 1 1 91 84 209 0.998 21.08 5.16 Intr - 181448 181225 224 2 2 65 79 165 0.994 10.23 5.15 Intr - 181943 181661 283 1 1 102 100 300 0.521 30.02 5.14 Intr - 182426 182260 167 2 2 112 84 236 0.983 24.36 5.13 Intr - 183325 183193 133 0 1 47 95 175 0.975 14.75 5.12 Intr - 183546 183472 75 2 0 90 99 207 0.999 20.63 5.11 Intr - 185058 185001 58 1 1 89 85 87 0.777 6.44 5.10 Intr - 185463 185256 208 0 1 74 48 354 0.739 28.65 5.09 Intr - 185742 185628 115 2 1 59 49 287 0.999 22.35 5.08 Intr - 186325 186180 146 1 2 40 94 289 0.999 23.78 5.07 Intr - 186647 186423 225 2 0 61 75 317 0.999 25.88 5.06 Intr - 187377 187261 117 0 0 71 93 175 0.999 16.96 5.05 Intr - 190815 190762 54 0 0 106 82 78 0.981 8.18 5.04 Intr - 191237 191072 166 1 1 78 86 357 0.998 34.46 5.03 Intr - 192245 192152 94 0 1 96 105 97 0.999 11.22 5.02 Intr - 192659 192451 209 1 2 81 82 507 0.999 48.12 5.01 Init - 194549 194455 95 2 2 94 96 196 0.999 20.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 63558 63998 441 2 0 49 38 268 0.955 12.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:44784413_44985266|GENSCAN_predicted_peptide_1|201_aa MKRSSPALASAPPKAPDATAEENRVLLAMVNPTVFFDIAVDGEPLGRVSFEVGRAAACGN GAQKVGRGRENFRALSTGEKGFGYKGSCFHRIIPGFMCQGGDFTRHNGTGGKSIYGEKFE DENFILKHTGPGILSMANAGPNTNGSQFFICTAKTEWLDGKHVVFGKVKEGMNIVEAMER FGSRNGKTSKKITIADCGQLE >gi568815591r:44784413_44985266|GENSCAN_predicted_CDS_1|606_bp atgaagcgatcctccccggccttggcctccgcgcctcctaaagcgccagacgccaccgcc gaggaaaaccgtgtactattagccatggtcaaccccaccgtgttcttcgacattgccgtc gacggcgagcccttgggccgcgtctcctttgaggtcgggcgggcggcggcgtgcgggaat ggggcccagaaagtgggccggggtcgggaaaattttcgtgctctgagcactggagagaaa ggatttggttataagggttcctgctttcacagaattattccagggtttatgtgtcagggt ggtgacttcacacgccataatggcactggtggcaagtccatctatggggagaaatttgaa gatgagaacttcatcctaaagcatacgggtcctggcatcttgtccatggcaaatgctgga cccaacacaaatggttcccagtttttcatctgcactgccaagactgagtggttggatggc aagcatgtggtgtttggcaaagtgaaagaaggcatgaatattgtggaggccatggagcgc tttgggtccaggaatggcaagaccagcaagaagatcaccattgctgactgtggacaactc gaataa >gi568815591r:44784413_44985266|GENSCAN_predicted_peptide_2|238_aa MHGVRKKPSYNSTKSSMDGLILHPATGLVFVLSKQCEEIHQPVVWTCEQREAESLKSIKN DNSMSPAFLAYVNTYVFLATGPVPPAHPAALTMFSAPTPPPLGRAPSRCRPAPPPPLSQH RPPPPEPDNTPCPPRAAVANARLSCWFEPQRLFNRAPPSSQTPPPRHRLPNGCLFLHTAR GGGASSPNSDALSAFHQAPKPDEQKGKPMRLDASQTKANKNIAREQGRRATGKDSKIS >gi568815591r:44784413_44985266|GENSCAN_predicted_CDS_2|717_bp atgcatggagtaagaaagaaaccatcctacaatagcaccaaatccagcatggatggactc atcctccaccctgctactggacttgtctttgtactctcaaagcagtgtgaggagattcat caaccggtggtgtggacatgtgaacagcgtgaggcagagagtttaaaatctatcaagaac gacaactcaatgtctcctgcatttttagcttatgttaatacttatgtatttttagctact ggccccgtgcccccggcccacccggccgcccttaccatgttctcggcgccgactccgcct ccgctcggccgcgcgccctcccgctgccgacccgcgccgccgccgccgctctcgcagcac cgaccgccgccgccggagccggacaataccccgtgcccgcctcgcgctgctgtggccaat gcccgcttgtcttgctggttcgaaccccagcggctgttcaatcgcgcgcctccttctagc cagaccccgcccccccggcaccgccttcctaacggctgtttgtttttgcacacggcacgc ggaggcggggcctccagccccaatagtgacgcgctctctgcctttcaccaggcgcccaag cctgacgaacagaaaggcaaaccaatgagattggacgcctcgcagacgaaagccaataaa aatatcgcccgcgagcaagggaggcgggccactgggaaggacagcaaaattagctaa >gi568815591r:44784413_44985266|GENSCAN_predicted_peptide_3|312_aa MADGDSGSERGGGGGPCGFQPASRGGGEQETQELASKRLDIQNKRFYLDVKQNAKGRFLK IAEVGAGGSKSRLTLSMAVAAEFRDSLGDFIEHYAQLGPSSPEQLAAGAEEGGGPRRALK SEFLVRENRKYYLDLKENQRGRFLRIRQTVNRGGGGFGAGPGPGGLQSGQTIALPAQGLI EFRDALAKLIDDYGGEDDELAGGPGGGAGGPGGGLYGELPEGTSITVDSKRFFFDVGCNK YGVFLRVSEVKPSYRNAITVPFKAWGKFGGAFCRYADEMKEIQERQRDKLYERRGGGSGG GEESEGEEVDED >gi568815591r:44784413_44985266|GENSCAN_predicted_CDS_3|939_bp atggcggacggcgacagcggcagcgagcgcggcggcggcggtgggccgtgcgggttccag cccgcgtcccgcggcggcggcgagcaagagacgcaggagctggcctcgaagcggctggac atccagaacaagcgcttctacttagatgtgaagcagaacgccaagggccgcttcctcaag atcgccgaggtgggcgcgggcggttccaagagccgcctcacgctgtccatggcggtggcc gccgagttccgcgactcgctgggcgacttcatagaacactacgcgcagctgggccctagc agccccgagcagctggcggctggcgccgaggagggcggcgggccgcggcgcgcgctcaag agcgaattcttggtgcgtgagaaccgcaagtactacctggacctcaaggagaaccagcgc ggccgcttcctgcgcatccgccaaacggtcaaccgcggcggtggcggcttcggcgcgggc cccgggccgggcggcttgcagagcggccagaccatcgcgctgcctgcgcagggcctcatc gagttccgcgacgcgctggcgaagctcatagacgactacggaggcgaggacgacgagctg gcaggcggcccgggaggcggcgccgggggcccagggggcggcctgtatggagagctcccg gagggcacctccatcaccgtggactccaagcgcttcttcttcgatgtgggctgcaacaaa tacggggtgtttctgcgagtgagcgaggtgaagccgtcctaccgcaatgccatcaccgta cccttcaaagcctggggcaagttcggaggcgccttttgccggtatgcggatgagatgaaa gaaatccaggaacgacagagggataagctttatgagcgacgtggtgggggcagcggcggc ggcgaagagtcagagggtgaggaggtggatgaggattga >gi568815591r:44784413_44985266|GENSCAN_predicted_peptide_4|210_aa MGQQVLCSTGYTLCSIILGQRNTRASPVPSEKCKTQIGATAITQFSSPPADSEGTVEAAM LLQPKDLRQGYPHSGSGIPTYWSISREGTQTQASTILSTSPGWVSQDSATQGVESPCPPP PPPTICCRPLFHASAIPCIRHNRQRLTHYGKLSQPGQTEMKKHKDEKRNDTNTVMLTFWG ICVKNTIIAAVLKLMSNPPYLLAAPTPNYY >gi568815591r:44784413_44985266|GENSCAN_predicted_CDS_4|633_bp atggggcagcaggtcctctgctccactggctacaccctatgctccatcattttaggccag agaaacaccagggcctctcctgtgcccagtgagaaatgcaagactcaaatcggagccact gccatcacccagtttagctcaccacctgcagactcagaggggacagtggaggctgcaatg ttgctgcaaccaaaggacctgcgccaaggatacccccacagcggtagtgggatccccact tactggagcatctcccgtgaaggcacacagacccaagcatcaaccatcctcagcaccagc ccagggtgggtctcacaggactcagcaacccagggcgttgagtccccatgccccccacca ccacccccaaccatctgctgccggccacttttccatgcttcagccatcccctgcatcagg cacaacaggcaacgtctcactcactacgggaagttgagccagccaggacagacagagatg aagaaacataaagatgaaaagcgtaatgacacaaacaccgtgatgttaacattttgggga atctgtgtgaagaatactattattgcagctgttctgaaactgatgtcaaatccaccctac ctgctggcagctcccacacctaattactactga >gi568815591r:44784413_44985266|GENSCAN_predicted_peptide_5|1014_aa MEDEEGPEYGKPDFVLLDQVTMEDFMRNLQLRFEKGRIYTYIGEVLVSVNPYQELPLYGP EAIARYQGRELYERPPHLYAVANAAYKAMKHRSRDTCIVISGESGAGKTEASKHIMQYIA AVTNPSQRAEVERVKDVLLKSTCVLEAFGNARTNRNHNSSRFGKYMDINFDFKGDPIGGH IHSYLLEKSRVLKQHVGERNFHAFYQALDSDEQSHQAVTEAMRVIGFSPEEVESVHRILA AILHLGNIEFVETEEGGLQKEGLAVAEEALVDHVAELTATPRDLVLRSLLARTVASGGRE LIEKGHTAAEASYARDACAKAVYQRLFEWVVNRINSVMEPRGRDPRRDGKDTVIGVLDIY GFEVFPVNSFEQFCINYCNEKLQQLFIQLILKQEQEEYEREGITWQSVEYFNNATIVDLV ERPHRGILAVLDEACSSAGTITDRIFLQTLDMHHRHHLHYTSRQVPPAVPVPPQWADKTM EFGRDFRIKHYAGDVTYSVEGFIDKNRDFLFQDFKRLLYNSTDPTLRAMWPDGQQDITEV TKRPLTAGTLFKNSMVALVENLASKEPFYVRCIKPNEDKVAGKLDENHCRHQVAYLGLLE NVRVRRAGFASRQPYSRFLLRCGVGRVGSIISPPGLGHTALSCGDRYKMTCEYTWPNHLL GSDKAAVSALLEQHGLQGDVAFGHSKLFIRSPRTLVTLEQSRARLIPIIVLLLQKAWRGT LARWRCRRLRAIYTIMRWFRRHKVRAHLAELQRRFQAARQPPLYGRDLVWPLPPAVLQPF QDTCHALFCRWRARQLVKNIPPSDMPQIKAKVAAMGALQGLRQDWGCRRAWARDYLSSAT DNPTASSLFAQRLKTLQDKDGFGAVLFSSHVRKVNRFHKIRNRALLLTDQHLYKLDPDRQ YRVMRAVPLEAVTGLSVTSGGDQLVVLHARGQDDLVVCLHRSRPPLDNRVGELVGVLAAH CQGEGRTLEVRVSDCIPLSHRGVRRLISVEPRPEQPEPDFRCARGSFTLLWPSR >gi568815591r:44784413_44985266|GENSCAN_predicted_CDS_5|3045_bp atggaggacgaggaaggccctgagtatggcaaacctgactttgtgcttttggaccaagtg accatggaggacttcatgaggaacctgcagctcaggttcgagaagggccgcatctacacc tacatcggtgaggtgctggtgtccgtgaacccctaccaggagctgcccctgtatgggcct gaggccatcgccaggtaccagggccgtgagctctatgagcggccaccccatctctatgct gtggccaacgccgcctacaaggcaatgaagcaccggtccagggacacctgcatcgtcatc tcaggggagagtggggcagggaagacagaagccagtaagcacatcatgcagtacatcgct gctgtcaccaatccaagccagagggctgaggtggagagggtcaaggacgtgctgctcaag tccacctgtgtgctggaggcctttggcaatgcccgcaccaaccgcaatcacaactccagc cgctttggcaagtacatggacatcaactttgacttcaagggggacccgatcggaggacac atccacagctacctactggagaagtctcgggtcctcaagcagcacgtgggtgaaagaaac ttccacgccttctaccaagccttggacagtgatgagcagagccaccaggcagtgaccgag gccatgagggtcatcggcttcagtcctgaagaggtggagtctgtgcatcgcatcctggct gccatattgcacctgggaaacatcgagtttgtggagacggaggagggtgggctgcagaag gagggcctggcagtggccgaggaggcactggtggaccatgtggctgagctgacggccaca ccccgggacctcgtgctccgctccctgctggctcgcacagttgcctcgggaggcagggaa ctcatagagaagggccacactgcagctgaggccagctatgcccgggatgcctgtgccaag gcagtgtaccagcggctgtttgagtgggtggtgaacaggatcaacagtgtcatggaaccc cggggccgggatcctcggcgtgatggcaaggacacagtcattggcgtgctggacatctat ggcttcgaggtgtttcccgtcaacagtttcgagcagttctgcatcaactactgcaacgag aagctgcagcagctattcatccagctcatcctgaagcaggaacaggaagagtacgagcgc gagggcatcacctggcagagcgttgagtatttcaacaacgccaccattgtggatctggtg gagcggccccaccgtggcatcctggccgtgctggacgaggcctgcagctctgctggcacc atcactgaccgaatcttcctgcagaccctggacatgcaccaccgccatcacctacactac accagccgccaggtgcccccggctgtcccagtgccaccacagtgggctgacaagaccatg gagtttggccgagacttccggatcaagcactatgcaggggacgtcacgtactccgtggaa ggcttcatcgacaagaacagagatttcctcttccaggacttcaagcggctgctgtacaac agcacggaccccactctacgggccatgtggccggacgggcagcaggacatcacagaggtg accaagcgccccctgacggctggcacactcttcaagaactccatggtggccctggtggag aaccttgcctccaaggagcccttctacgtccgctgcatcaagcccaatgaggacaaggta gctgggaagctggatgagaaccactgtcgccaccaggtcgcatacctggggctgctggag aatgtgagggtccgcagggctggcttcgcttcccgccagccctactctcgattcctgctc aggtgtggggtgggcagggtgggctccatcatctctcccccaggcctgggccacactgcc ctgtcctgtggtgacaggtacaagatgacctgtgaatacacatggcccaaccacctgctg ggctccgacaaggcagccgtgagcgctctcctggagcagcacgggctgcagggggacgtg gcctttggccacagcaagctgttcatccgctcaccccggacactggtcacactggagcag agccgagcccgcctcatccccatcattgtgctgctattgcagaaggcatggcggggcacc ttggcgaggtggcgctgccggaggctgagggctatctacaccatcatgcgctggttccgg agacacaaggtgcgggctcacctggctgagctgcagcggcgattccaggctgcaaggcag ccgccactctacgggcgtgaccttgtgtggccgctgccccctgctgtgctgcagcccttc caggacacctgccacgcactcttctgcaggtggcgggcccggcagctggtgaagaacatc cccccttcagacatgccccagatcaaggccaaggtggccgccatgggggccctgcaaggg cttcgtcaggactggggctgccgacgggcctgggcccgagactacctgtcctctgccact gacaatcccacagcatcaagcctgtttgctcagcgactaaagacacttcaggacaaagat ggcttcggggctgtgctcttttcaagccatgtccgcaaggtgaaccgcttccacaagatc cggaaccgggccctcctgctcacagaccagcacctctacaagctggaccctgaccggcag taccgggtgatgcgggccgtgccccttgaggcggtgacggggctgagcgtgaccagcgga ggagaccagctggtggtgctgcacgcccgcggccaggacgacctcgtggtgtgcctgcac cgctcccggccgccattggacaaccgcgttggggagctggtgggcgtgctggccgcacac tgccagggggagggccgcaccctggaggttcgcgtctccgactgcatcccactaagccat cgcggggtccggcgcctcatctccgtggagcccaggccggagcagccagagcccgatttc cgctgcgctcgcggctccttcaccctgctctggcccagccgctga