GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:04:01 Sequence gi568815593f:160099107_160338656 : 239550 bp : 47.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 9508 9439 70 0 1 95 73 18 0.037 0.18 1.01 Init - 20282 19699 584 2 2 75 81 862 0.788 76.86 1.00 Prom - 40150 40111 40 -2.56 2.00 Prom + 67362 67401 40 -2.36 2.01 Init + 72984 72990 7 2 1 48 116 0 0.048 -0.25 2.02 Term + 86463 86665 203 1 2 55 42 202 0.302 9.85 2.03 PlyA + 87021 87026 6 1.05 3.00 Prom + 90865 90904 40 -7.66 3.01 Init + 94781 95376 596 1 2 88 47 310 0.028 19.63 3.02 Intr + 99943 100051 109 1 1 97 70 97 0.133 9.09 3.03 Intr + 119154 119237 84 2 0 80 100 7 0.081 1.12 3.04 Intr + 130440 130518 79 2 1 88 97 124 0.170 12.42 3.05 Intr + 132992 133167 176 0 2 55 100 268 0.981 24.36 3.06 Intr + 135714 135803 90 2 0 79 105 121 0.757 13.09 3.07 Term + 139500 139553 54 2 0 131 47 71 0.929 4.86 3.08 PlyA + 139590 139595 6 1.05 4.09 PlyA - 140105 140100 6 1.05 4.08 Term - 154692 154272 421 2 1 96 47 336 0.897 25.06 4.07 Intr - 156602 156443 160 0 1 100 78 73 0.185 6.55 4.06 Intr - 160665 160363 303 1 0 132 56 392 0.131 36.96 4.05 Intr - 170298 170227 72 1 0 75 28 81 0.037 0.18 4.04 Intr - 170438 170357 82 2 1 113 105 70 0.927 10.41 4.03 Intr - 173652 173555 98 1 2 49 77 81 0.909 2.83 4.02 Intr - 181632 181419 214 0 1 134 82 420 0.970 44.29 4.01 Init - 201034 200891 144 1 0 66 59 61 0.018 1.12 4.00 Prom - 201737 201698 40 -3.66 5.07 PlyA - 202507 202502 6 1.05 5.06 Term - 206483 206382 102 2 0 67 45 110 0.772 2.88 5.05 Intr - 212866 212752 115 0 1 76 86 87 0.982 7.75 5.04 Intr - 213483 213258 226 1 1 42 66 80 0.016 -1.76 5.03 Intr - 221479 221327 153 2 0 49 97 36 0.218 0.64 5.02 Intr - 228457 228321 137 0 2 115 69 105 0.538 11.51 5.01 Init - 232037 232021 17 2 2 87 89 8 0.214 0.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 130452 130518 67 2 1 91 97 111 0.817 13.53 S.002 Sngl + 159147 159482 336 2 0 91 37 343 0.936 25.53 S.003 Term - 160665 160175 491 1 2 132 42 388 0.865 33.42 S.004 Init - 213514 213258 257 1 2 30 66 171 0.884 3.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:160099107_160338656|GENSCAN_predicted_peptide_1|218_aa MAAVAAEAAATAASPGEGGAGEAEPEMEPIPGSEAGTDPLPVTATEASVPDGETDGQQSA PQADEPPLPPPPPPPGELARSPEAVGPELEAEEKLSVRVAESAAAAPQGGPELPPSPASP PEQPPAPEEREEPPLPQPVAPALVPPAGGDSTVSQLIPGSEVRVTLDHIIEDALVVSFRF GEKLFSGVLMDLSKRDHETYGAIALLLVGKIYSHSGEA >gi568815593f:160099107_160338656|GENSCAN_predicted_CDS_1|654_bp atggcggccgtggctgcagaggcggcagcgactgcagcgtcccccggggaggggggcgcc ggcgaggccgagccggagatggagcccatccccggcagtgaggccggcactgaccccctc ccggtcacggccactgaagcgtctgtgccggatggcgagactgacgggcagcaatccgct cctcaggccgacgagccgccgctcccgccgccaccgccgccgccgggggagctcgcccgc agcccagaggcggtggggccggagctggaggctgaggagaaactgtccgttcgggtggcg gagtcggcggcagccgcgcctcagggagggccggaacttccaccttctcctgcatcgccg ccggagcagcccccggctcccgaggagcgcgaggagccgccgctgcctcagcccgtagcc ccggcgctcgtgccgccggcgggcggggactccacggtgtcgcaactgatcccgggctcg gaggtgcgggtcacgctggaccacatcattgaggacgcgcttgtcgtgtcgttccgcttc ggggagaagctcttctccggggtcctcatggatctgtccaaaagagaccatgaaacgtat ggtgcaatagccctacttttggtgggaaaaatatacagccattctggtgaagca >gi568815593f:160099107_160338656|GENSCAN_predicted_peptide_2|69_aa MQAWEGITQEKSVDKEEKQFQDLTWGEHIIFSCQVEEEPVKESEKQQPVRQEENQDISVK EQKLCKKAF >gi568815593f:160099107_160338656|GENSCAN_predicted_CDS_2|210_bp atgcaagcatgggaaggaatcactcaggagaagagcgttgacaaggaagagaagcagttc caggacctcacctggggggagcacattatattttcttgccaagtagaggaggaacctgtg aaggagagtgagaagcagcagccagtgaggcaggaggaaaaccaggacatttctgtgaaa gaacaaaagctttgcaagaaagccttttaa >gi568815593f:160099107_160338656|GENSCAN_predicted_peptide_3|395_aa MRSHSSALGRSMGLGAVGQGAALVGEARAAQVPTEAGEGSCMAGCRSRALPHGKAAKARR EIKRSAGGPALLGDPVHPPQPLAQVLSPSLPGASRAGRLLPVRDPPSPRPPATPAGPQAP HASARSPSSCSRLSLHTSLQAEGVGSSLGQPRKGLPQCSGGLKGSSSAAKVGAQAEEAPR VSEGCEDCQHAVISHLRLQLCTSGSRELTGRGWSSPEAMKTVTMMMVVEMQALTQIGTII IQILQMRKLRHKKINKVPKATEPPPSSMAFTGKFEMESEKNYDEFMKLLGISSDVIEKAR NFKIVTEVQQDGQDFTWSQHYSGGHTMTNKFTVGKESNIQTMGGKTFKATVQMEGGKLVV NFPNYHQTSEIVGDKLVEVSTIGGVTYERVSKRLA >gi568815593f:160099107_160338656|GENSCAN_predicted_CDS_3|1188_bp atgcgctcgcactcctcagcccttgggcggtcgatgggactgggcgccgtggggcagggg gcggcgctcgtcggggaggctcgggctgcacaggtgcccacggaggcgggggaaggctcc tgcatggctggctgcaggtcccgagccctgccccatgggaaggcagctaaggcccggcga gaaatcaagcgcagcgccggtgggccggcactgctgggggacccagtacaccctccgcag ccgctggcccaggtgctaagtccctcattgcccggggccagcagggccggccggctgctc ccagtgcgggacccaccaagcccacgcccaccggcaactccagctggcccgcaagcgccg cacgcatccgcacgcagccccagttcctgctcgcgcctctccctccacacctccctgcaa gctgagggagtgggctccagccttggccagcccagaaagggactcccacagtgcagtggt gggctgaagggctcctcaagtgccgccaaagtgggagcccaggcagaggaggccccgaga gtgagcgagggctgtgaggactgccagcacgctgttatctctcatttgaggctgcaactt tgcacctctggctccagggagctcacaggcaggggctggtccagcccagaggcgatgaag acagtgacgatgatgatggtggtggagatgcaggcgctgactcagataggcaccattatt atccagattttgcagatgagaaaactgagacacaaaaaaataaataaggtgcccaaggcc acagaaccacctcccagcagcatggctttcaccggcaagttcgagatggagagtgagaag aattatgatgagttcatgaagctccttgggatctccagcgatgtaatcgaaaaggcccgc aacttcaagatcgtcacggaggtgcagcaggatgggcaggacttcacttggtcccagcac tactccgggggccacaccatgaccaacaagttcactgttggcaaggaaagcaacatacag acaatggggggcaagacgttcaaggccactgtgcagatggagggcgggaagctggtggtg aatttccccaactatcaccagacctcagagatcgtgggtgacaagctggtggaggtctcc accatcggaggcgtgacctatgagcgcgtgagcaagagactggcctaa >gi568815593f:160099107_160338656|GENSCAN_predicted_peptide_4|497_aa MDTHAMPPAAKSGSGHQVIWVEGLGGHEEEGEDVIQNRSIGNWIDSVIELKLPTFRAHSP LLKSRRFFVDILTLLSSHCQLCPAARHLAVYLLDHFMDRYNVTTSKQLYTVAVSCLLLAK VLTQPTADETRFSVNLTSFVGAFGTVSIGGTVVGTKDQGPQCPHVQRSSAVRPGPEAAPG DNSYPPYEDRSTCGMQAPDASYSGKFEDREDHVPKLEQINSTRILSSQNFTLTKKELLST ELLLLEAFSWNLCLPTPAHFLDYYLLASVSQKDHHCHTWPTTCPRKTKECLKEYAHYFLE VTLQDHIFYKFQPSVVAAACVGASRICLQLSPYWTRDLQRISSYSLEHLSTCIEILLVVY DNVLKDAVAVKSQALAMVPGTPPTPTQVLFQPPAYPALGQPATTLAQFQTPVQDLCLAYR DSLQAHRSGSLLSGSTGSSLHTPYQPLQPLDMCPVPVPASLSMHMAIAAEPRHCLATTYG SSYFSGSHMFPTGCFDR >gi568815593f:160099107_160338656|GENSCAN_predicted_CDS_4|1494_bp atggatacccatgccatgccgccagctgctaagtctggaagtggtcatcaagtgatttgg gtggaagggttggggggacacgaggaggaaggagaggatgtcattcagaacagaagcatc gggaattggatagattcagttattgaactgaagctgcccaccttccgagcccactcccca ctcctgaagagccgccggttcttcgtggacatcctgaccctgctgagcagccactgccag ctctgccctgcagcccggcacctggccgtctacctgctggaccacttcatggatcgctac aacgtcaccacctccaagcagctctacaccgtggccgtctcctgcctcctgcttgcaaaa gttttaactcaaccaacagctgatgagacccgtttttccgtaaatttgacttcatttgtg ggagcctttggaactgttagcattggtggtacagtggtcggaacaaaggatcagggtccc cagtgcccacacgttcagcgcagcagtgccgtcagacctgggccagaggcagcccctgga gataactcctatcctccgtacgaggacagaagcacttgtgggatgcaggccccagatgcc agctattcgggtaagttcgaggatcgggaagaccacgtccccaagttggagcaaataaac agcacgaggatcctgagcagccagaacttcaccctcaccaagaaggagctgctgagcaca gagctgctgctcctggaggccttcagctggaacctctgcctgcccacgcctgcccacttc ctggactactacctcttggcctccgtcagccagaaggaccaccactgccacacctggccc accacctgcccccgcaagaccaaagagtgcctcaaggagtatgcccattacttcctagag gtcaccctgcaagatcacatattctacaaattccagccttctgtggtcgctgcggcctgt gttggggcctccaggatttgcctgcagctttctccctactggaccagagacctgcagagg atctcaagctattccctggagcacctcagcacgtgtattgaaatcctgctggtagtgtat gacaacgtcctcaaggatgccgtagccgtcaagagccaggccttggcaatggtgcccggc acaccccccacccccactcaagtgctgttccagccaccagcctacccggccctcggccag ccagcgaccaccctggcacagttccagacccccgtgcaggacctatgcttggcctatcgg gactccttgcaggcccaccgttcagggagcctgctctcggggagtacaggctcatccctc cacaccccgtaccaaccgctgcagcccttggatatgtgtcccgtgcccgtccctgcatcc cttagcatgcatatggccattgcagctgagcccaggcactgcctcgccaccacctatgga agcagctacttcagtgggagccacatgttccccaccggctgctttgacagatag >gi568815593f:160099107_160338656|GENSCAN_predicted_peptide_5|249_aa MASDKWLCLPESQTGSSYIAALEKALAAREESKLSPMAVAELSGGPSGCATVLIAPHCFL ALWLVWTLTVKLRRFFPHKEEYFSPSKFLPKVGEWGKLDQEGPREGGPRRPGACEPRRDP SARRGAATAGASRAAEAAVGSAERLGRGSNRRQAPEGTAGSALGGARRSQPPTPREDAVR PESRVASDSGAAYAMMDEPWWEGRVASDVHCTLREKFVFWRSPTDVLSLNHPEKPAAMDV GLAVGIGKG >gi568815593f:160099107_160338656|GENSCAN_predicted_CDS_5|750_bp atggcaagtgataagtggttgtgcttgcctgagagccaaactggttccagctacatcgct gcattggagaaggctttggcagccagagaagagtcaaagctttcccctatggctgtggct gaactcagtggtggcccaagtggctgtgcaacagtcctcatcgctccacactgcttcctg gcactgtggctggtgtggacattgactgtcaaactcaggcggttttttcctcataaagag gaatacttctctccctccaagtttcttccaaaggttggggagtgggggaagttagaccag gagggcccgagggagggaggcccccggcggccgggagcctgcgagccgcggcgggacccg agcgcacgcaggggcgcggcgacggcgggggctagtcgggctgcggaggcggccgtcggg agtgcggagcgcctcggacgagggtccaaccgccggcaggcaccagagggcacggctggc tcggcactgggaggggcccggcgctcgcagccccccacgcccagagaggatgcggtgcgc cctgagagccgggtagcctcggatagcggcgctgcgtacgcgatgatggatgagccgtgg tgggaagggcgcgtcgcctcggacgtccactgcaccctgcgcgagaagtttgttttctgg cgctctccaacggatgtgctctctctaaaccatcctgagaagccagccgctatggacgtg ggccttgctgtgggcattggaaagggctga