GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:23:55 Sequence gi568815591f:74115976_74324739 : 208764 bp : 49.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4531 4663 133 1 1 47 92 127 0.583 9.65 1.02 Intr + 4917 5074 158 2 2 63 105 271 0.999 25.11 1.03 Term + 5164 5326 163 1 1 83 52 169 0.994 10.21 1.04 PlyA + 6522 6527 6 1.05 2.00 Prom + 49432 49471 40 -0.36 2.01 Init + 58409 58467 59 1 2 56 72 209 0.663 16.88 2.02 Intr + 71636 71823 188 2 2 116 113 117 0.994 16.33 2.03 Intr + 72344 72430 87 0 0 55 75 100 0.906 5.34 2.04 Intr + 78766 78903 138 2 0 96 102 59 0.981 8.54 2.05 Term + 79194 79333 140 1 2 115 52 74 0.987 4.63 2.06 PlyA + 81098 81103 6 1.05 3.03 PlyA - 81140 81135 6 1.05 3.02 Term - 87503 87409 95 0 2 62 54 106 0.643 2.59 3.01 Init - 90599 90551 49 2 1 66 111 -5 0.249 0.79 3.00 Prom - 96619 96580 40 -1.56 4.00 Prom + 98751 98790 40 -7.76 4.01 Init + 100001 100094 94 1 1 104 115 110 0.960 13.84 4.02 Intr + 100850 100889 40 0 1 114 65 26 0.697 0.18 4.03 Intr + 104242 104279 38 1 2 151 96 14 0.746 6.41 4.04 Intr + 104729 104759 31 0 1 114 84 0 0.654 -0.61 4.05 Intr + 105662 105717 56 2 2 70 100 66 0.662 4.62 4.06 Intr + 107749 107808 60 2 0 127 94 104 0.999 13.61 4.07 Intr + 108043 108222 180 2 0 111 69 36 0.931 3.84 4.08 Term + 108664 108767 104 2 2 116 49 84 0.940 5.74 4.09 PlyA + 109415 109420 6 1.05 5.12 PlyA - 111245 111240 6 1.05 5.11 Term - 116241 116114 128 1 2 88 42 52 0.731 -0.96 5.10 Intr - 119670 119553 118 0 1 77 89 126 0.948 11.64 5.09 Intr - 121467 121387 81 0 0 112 58 120 0.996 11.23 5.08 Intr - 123013 122948 66 1 0 77 82 100 0.936 7.40 5.07 Intr - 124120 123963 158 2 2 84 91 251 0.947 24.73 5.06 Intr - 127271 127171 101 1 2 89 109 109 0.931 12.85 5.05 Intr - 130788 130687 102 2 0 58 80 88 0.941 4.29 5.04 Intr - 133143 133037 107 0 2 32 96 92 0.992 3.41 5.03 Intr - 133805 133764 42 2 0 79 101 44 0.901 3.24 5.02 Intr - 136523 136454 70 1 1 78 105 48 0.969 4.68 5.01 Init - 138408 138296 113 0 2 84 65 146 0.923 11.61 5.00 Prom - 142220 142181 40 -1.96 6.06 PlyA - 142918 142913 6 1.05 6.05 Term - 163866 163670 197 1 2 61 43 91 0.086 -0.53 6.04 Intr - 171007 170959 49 1 1 41 105 61 0.127 1.35 6.03 Intr - 173246 173087 160 1 1 132 100 66 0.549 12.19 6.02 Intr - 181192 181021 172 2 1 46 32 143 0.060 3.50 6.01 Init - 185119 185071 49 1 1 55 91 53 0.861 3.41 6.00 Prom - 185236 185197 40 -6.06 7.05 PlyA - 186228 186223 6 1.05 7.04 Term - 187109 186692 418 1 1 84 53 171 0.503 7.95 7.03 Intr - 196877 196551 327 1 0 69 113 99 0.400 5.31 7.02 Intr - 201791 201554 238 0 1 46 43 129 0.001 0.87 7.01 Init - 206461 206413 49 1 1 86 58 73 0.027 3.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 73698 73762 65 1 2 85 92 92 0.972 7.74 S.002 Intr + 73847 73943 97 1 1 54 45 122 0.920 4.08 S.003 Term + 201505 201681 177 0 0 151 36 139 0.815 12.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:74115976_74324739|GENSCAN_predicted_peptide_1|151_aa XWPGPLPFGIPGTPMCSSADSRSLYPGRSYDEKVDVFSFGIVLCEIIGRVNADPDYLPRT MDFGLNVRGFLDRYCPPNCPPSFFPITVRCCDLDPEKRPSFVKLEHWLETLRMHLAGHLP LGPQLEQLDRGFWETYRRGESGLPAHPEVPD >gi568815591f:74115976_74324739|GENSCAN_predicted_CDS_1|456_bp nnctggcctggtcccctgccttttggcatccctggcacccccatgtgttcatctgctgac agtcggtctctttatccaggccgcagctatgatgagaaggtggatgtgttctcctttggg atcgtcctgtgcgagatcatcgggcgggtgaacgcagaccctgactacctgccccgcacc atggactttggcctcaacgtgcgaggattcctggaccgctactgccccccaaactgcccc ccgagcttcttccccatcaccgtgcgctgttgcgatctggaccccgagaagaggccatcc tttgtgaagctggaacactggctggagaccctccgcatgcacctggccggccacctgcca ctgggcccacagctggagcagctggacagaggtttctgggagacctaccggcgcggcgag agcggactgcctgcccaccctgaggtccccgactga >gi568815591f:74115976_74324739|GENSCAN_predicted_peptide_2|203_aa MADFDTYDDRAYSSFGGGRGSRGSAGGHGSRSQKELPTEPPYTAYVGNLPFNTVQGDIDA IFKDLSIRSVRLVRDKDTDKFKGCGSKEQEEEGIRAAFSCLQGFLKEVCLLCFRDDFLGG RGGSRPGDRRTGPPMGSRFRDGPPLRGSNMDFREPTEEERAQRPRLQLKPRTVATPLNQV ANPNSAIFGGARPREEVVQKEQE >gi568815591f:74115976_74324739|GENSCAN_predicted_CDS_2|612_bp atggcggacttcgacacctacgacgatcgggcctacagcagcttcggcggcggcagaggg tcccgcggcagtgctggtggccatggttcccgtagccagaaggagttgcccacagagccc ccctacacagcatacgtaggaaatctacctttcaatacggttcagggcgacatagatgct atctttaaggatctcagcataaggagtgtacggctagtcagagacaaagacacagataaa tttaaagggtgcgggagcaaagagcaggaagaagaagggatccgagcagcattctcctgc ctgcagggcttcctgaaggaagtgtgcctgctgtgcttcagggatgacttcttagggggc aggggaggtagtcgcccaggcgaccggcgaacaggcccccccatgggcagccgcttcaga gatggccctcccctccgtggatccaacatggatttcagagaacccacagaagaggaaaga gcacagagaccacgactccagcttaaacctcgaacagtcgcgacgcccctcaatcaagta gccaatcccaactctgctatcttcgggggtgccaggcctagagaggaagtcgttcaaaag gagcaagaatga >gi568815591f:74115976_74324739|GENSCAN_predicted_peptide_3|47_aa MPSALDSGPRFLPTVTEQNCSCHRKQKPNACGHQQNYRREEKLFNSC >gi568815591f:74115976_74324739|GENSCAN_predicted_CDS_3|144_bp atgccaagtgcactggattcaggaccacgtttccttcccacggtaaccgagcagaactgc tcctgccacagaaaacagaagcccaatgcttgcggccatcagcagaattatcggagagag gaaaaacttttcaacagctgctga >gi568815591f:74115976_74324739|GENSCAN_predicted_peptide_4|200_aa MSSGTELLWPGAALLVLLGVAASLCVRCSRPGAKRSEKIYQQRSLKDKLLQFYPSLEGSR HGSEEAYIDPIAMEYYNWGRFSKPPEDDDANSYENVLICKQKTTETGAQQEGIGGLCRGD LSLSLALKTGPTSGLCPSASPEEDEESEDYQNSASIHQWRESRKVMGQLQREASPGPVGS PDEEDGEPDYVNGEVAATEA >gi568815591f:74115976_74324739|GENSCAN_predicted_CDS_4|603_bp atgagctcggggactgaactgctgtggcccggagcagcgctgctggtgctgttgggggtg gcagccagtctgtgtgtgcgctgctcacgcccaggtgcaaagaggtcagagaaaatctac cagcagagaagtctgaaggacaagctgttgcaattctaccccagcctggagggaagcaga cacgggtcggaggaagcctacatagaccccattgccatggagtattacaactgggggcgg ttctcgaagcccccagaagatgatgatgccaattcctacgagaatgtgctcatttgcaag cagaaaaccacagagacaggtgcccagcaggagggcataggtggcctctgcagaggggac ctcagcctgtcactggccctgaagactggccccacttctggtctctgtccctctgcctcc ccggaagaagatgaggaatctgaggattatcagaactcagcatccatccatcagtggcgc gagtccaggaaggtcatggggcaactccagagagaagcatcccctggcccggtgggaagc ccagacgaggaggacggggaaccggattacgtgaatggggaggtggcagccacagaagcc tag >gi568815591f:74115976_74324739|GENSCAN_predicted_peptide_5|361_aa MEVEAVCGGAGEVEAQDSDPAPAFSKAPGSAGHYELPWVEKYRPVKLNEIVGNEDTVSRL EVFAREGNVPNIIIAGPPGTGKTTSILCLARALLGPALKDAMLELNASNDRGIDVVRNKI KMFAQQKVTLPKGRHKIIILDEADSMTDGAQQALRRTMEIYSKTTRFALACNASDKIIEP IQSRCAVLRYTKLTDAQILTRLMNVIEKERVPYTDDGLEAIIFTAQGDMRQALNNLQSTF SGFGFINSENVFKVCDEPHPLLVKEMIQHCVNANIDEAYKILAHLWHLGYSPEDIIGNIF RVCKTFQMAEYLKLEFIKVGNWIHSHENSGRSELSFADGRPPGKAVSEDNGPGGQLEQRL H >gi568815591f:74115976_74324739|GENSCAN_predicted_CDS_5|1086_bp atggaggtggaggccgtctgtggtggcgcgggcgaggtggaggcccaggactctgaccct gcccctgccttcagcaaggcccccggcagcgccggccactacgaactgccgtgggttgaa aaatataggccagtaaagctgaatgaaattgtcgggaatgaagacaccgtgagcaggcta gaggtctttgcaagggaaggaaatgtgcccaacatcatcattgcgggccctccaggaacc ggcaagaccacaagcattctgtgcttggcccgggccctgctgggcccagcactcaaagat gccatgttggaactcaatgcttcaaatgacaggggcattgacgttgtgaggaataaaatt aaaatgtttgctcaacaaaaagtcactcttcccaaaggccgacataagatcatcattctg gatgaagcagacagcatgaccgacggagcccagcaagccttgaggagaaccatggaaatc tactctaaaaccactcgcttcgcccttgcttgtaatgcttcggataagatcatcgagccc attcagtcccgctgtgcagtcctccggtacacaaagctgaccgacgcccagatcctcacc aggctgatgaatgttatcgagaaggagagggtaccctacactgatgacggcctagaagcc atcatcttcacggcccagggagacatgaggcaggcgctgaacaacctgcagtccaccttc tcaggatttggcttcattaacagtgagaacgtgttcaaggtctgtgacgagccccaccca ctgctggtaaaggagatgatccagcactgtgtgaatgccaacattgacgaagcctacaag attcttgctcacttgtggcatctgggctactcaccagaagatatcattggcaacatcttt cgagtgtgtaaaactttccaaatggcagaatacctgaaactggagtttatcaaggtcgga aattggatacactcacatgaaaatagcggaaggagtgaactctcttttgcagatggcagg cctcctggcaaggctgtgtcagaagacaatggccccggtggccagttagagcagagactt cactga >gi568815591f:74115976_74324739|GENSCAN_predicted_peptide_6|208_aa MHVGEDVEERERVYTVAALRGSLELTPGFRSCTDASLPLHPRPKLTEDTDSKEKAASQEP RYLCQKGDPLLISKPQLSFVGRPGFAHIHLRAGHLVVLTSRLVKRILQPCYACKDVSGHW EGRQLGKYYSEQQPAVNRPYLAAAQADSCTRNFHGTVSHLQTSSVTIRVGGQQIWGLGSA AFPNAASCDSILPLWQSNESVEDLQTRS >gi568815591f:74115976_74324739|GENSCAN_predicted_CDS_6|627_bp atgcatgttggagaggatgtggaggaaagggaacgcgtatacactgttgctgcccttaga ggctctcttgagcttacaccaggattccgcagctgtacagatgcatctcttcccctgcac cccaggcccaagctgactgaggacacagactccaaggagaaagcagcaagccaggagccc agatacctgtgtcagaaaggcgacccactgctgatcagcaagccgcagctgtcctttgtg gggcgtccagggttcgcccacatccacctccgcgctgggcacctggtggtgcttacatcg cgtttggtgaaacgcatcctccagccatgctatgcttgcaaagatgtctctggtcattgg gaaggacggcagcttgggaagtattattcagagcaacagcctgcggtcaacagaccttac ctggccgctgctcaagctgactcatgcaccaggaatttccacgggactgtgtcacacttg caaacgtcatctgtgactatccgggtgggaggacagcagatctggggactgggctctgca gccttcccaaatgcagcttcctgtgattccattcttccactttggcagtcgaatgagtca gtggaagatttgcaaacacggagctaa >gi568815591f:74115976_74324739|GENSCAN_predicted_peptide_7|343_aa MGFRHVAQAGLELLTSDWSQSGLCSWGSSRRTLIPCSQSPHPPSLVCHVPSLELAATAAE DEADPVDVRPMGLECFPPRPGGFRPLGFCMAVPLGRLAVLRAEENKLPPGTQQWSRHAGV RPREHWACLPALALRAAKSNIPTPLVPREFWGPGVNHAGVQGCSLRPQNLALLVACSMRE PDCELWAQLSESLVRLGQHFQCQQRPGRADGRVPGCLYCCLHPYLVSLLCQACGELQPQL CRQHTQLLASSRGTREPQEEWGGDKSRAALRAPIGLAAIQIVGTADSQRGDRALPQHHSR SRAGTQAPQAHVASNRREGDETGKTTVDSVDQVYTVQGSGLLG >gi568815591f:74115976_74324739|GENSCAN_predicted_CDS_7|1032_bp atggggtttcgccatgttgcccaggctggtctcgaactcctgacctcagattggtcacag agtggcctgtgctcctggggctcttccagaagaacactcatcccatgtagccagtcccct caccccccatccttggtgtgccacgtaccttccttggagctagcggccaccgccgccgag gatgaagctgacccagtagatgtccggcccatggggctggagtgcttccccccacggccg gggggcttcaggccgctgggcttctgcatggcggtgccactgggcagactggcagtttta agagcagaggaaaacaagctgcccccaggcacacagcagtggagccgacatgccggggtg cgcccacgagagcactgggcatgcctgccagccctggccctcagggctgctaaatccaac attccaacgcccctggtgccccgggaattctgggggccgggagtcaatcatgctggggta caaggctgctcccttcgtcctcagaacctggccctcctggtggcttgttcaatgagggag ccggactgtgagctttgggcacagctctccgagagcttggttaggctgggtcagcatttc caatgccagcaaaggcctgggagagctgatgggagggtgcccggctgtctgtactgctgt ctccacccatacttggtcagtcttctctgccaggcctgcggggagctgcagccccagctg tgcaggcaacacacacagctgctggccagcagtagaggtacgagggagccccaggaagaa tggggtggggataaatcccgagctgccctgagagcacccattggcctggcggccatccag atagtggggacagctgacagccagcgtggggatagggccttgccccagcatcacagcaga tccagggctggaacccaggctccacaagcccacgtggcttcaaacaggagggagggagat gaaactggcaagaccacagtggatagtgtggaccaggtttacacagtgcagggctctggg ctcctgggctga