GENSCAN 1.0 Date run: 8-Nov-116 Time: 00:52:00 Sequence gi568815591r:151460522_151695456 : 234935 bp : 44.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 383 492 110 1 2 54 94 95 0.608 6.44 1.02 Term + 6021 6177 157 0 1 68 42 161 0.704 6.91 1.03 PlyA + 6235 6240 6 1.05 2.08 PlyA - 6308 6303 6 -3.24 2.07 Term - 6690 6598 93 0 0 127 37 90 0.694 5.63 2.06 Intr - 10131 10050 82 2 1 71 92 68 0.629 5.14 2.05 Intr - 11084 11028 57 1 0 81 98 22 0.381 0.50 2.04 Intr - 16894 16812 83 1 2 110 116 22 0.599 5.64 2.03 Intr - 24283 24216 68 2 2 110 108 30 0.882 5.82 2.02 Intr - 30493 30422 72 2 0 85 98 79 0.156 7.98 2.01 Init - 58990 58939 52 1 1 104 92 79 0.453 11.44 2.00 Prom - 70674 70635 40 -3.86 3.00 Prom + 73507 73546 40 -4.26 3.01 Init + 78864 79066 203 2 2 107 86 95 0.943 9.55 3.02 Intr + 82280 82384 105 2 0 57 37 113 0.619 2.43 3.03 Term + 92954 93065 112 2 1 103 54 77 0.353 3.83 3.04 PlyA + 95373 95378 6 1.05 4.20 PlyA - 95510 95505 6 1.05 4.19 Term - 96390 96206 185 1 2 56 33 101 0.633 -0.89 4.18 Intr - 99302 99201 102 0 0 107 78 8 0.696 1.85 4.17 Intr - 100096 100003 94 1 1 159 98 116 0.971 19.24 4.16 Intr - 103703 103557 147 2 0 91 78 135 0.719 13.23 4.15 Intr - 104862 104825 38 1 2 99 87 16 0.994 0.58 4.14 Intr - 105364 105199 166 1 1 57 72 211 0.984 15.93 4.13 Intr - 108321 108195 127 2 1 84 78 88 0.978 8.08 4.12 Intr - 109704 109650 55 1 1 62 92 37 0.646 -0.46 4.11 Intr - 115931 115850 82 2 1 112 87 41 0.484 5.61 4.10 Intr - 134933 134824 110 0 2 107 108 55 0.216 9.40 4.09 Intr - 144346 144225 122 0 2 16 41 120 0.001 0.24 4.08 Intr - 149211 149156 56 0 2 68 81 51 0.048 0.18 4.07 Intr - 152298 152165 134 1 2 97 105 31 0.687 6.06 4.06 Intr - 153681 153579 103 0 1 67 44 36 0.073 -3.15 4.05 Intr - 164839 164706 134 2 2 101 66 38 0.048 3.26 4.04 Intr - 171617 171362 256 2 1 144 38 288 0.639 26.42 4.03 Intr - 215116 214899 218 2 2 115 97 111 0.762 12.92 4.02 Intr - 215835 215783 53 2 2 75 54 -1 0.430 -6.25 4.01 Init - 217326 217271 56 0 2 71 72 67 0.640 4.16 4.00 Prom - 222037 221998 40 -5.46 5.00 Prom + 224909 224948 40 -6.36 5.01 Init + 227053 227141 89 0 2 83 57 31 0.179 -0.39 5.02 Intr + 227242 227335 94 1 1 67 90 70 0.810 5.07 5.03 Intr + 229105 229267 163 0 1 100 50 69 0.458 3.85 5.04 Term + 232907 232980 74 0 2 114 54 50 0.335 2.27 5.05 PlyA + 234811 234816 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 42645 42942 298 0 1 35 42 237 0.851 8.54 S.002 Term - 68469 68360 110 1 2 77 48 98 0.944 3.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:151460522_151695456|GENSCAN_predicted_peptide_1|88_aa MAALQDSGATHSTDCCWRYTIMGTREAAETHGDPDTRPLRRESTVSSHLSTPPPVATSSL VSGIQPGAFLKPSEKLRTRLLFFESHIK >gi568815591r:151460522_151695456|GENSCAN_predicted_CDS_1|267_bp atggcagcactgcaggattccggagccacccacagcaccgactgctgctggagatacaca atcatgggcactcgtgaagctgcagaaactcacggggaccctgacaccaggccactgcgc agggagtccacggtcagcagccacctcagcacacctcctcctgtggccacatcttcactg gtttctggaatccaacctggggccttcctgaagccaagtgagaaactcaggacaaggctt ctcttctttgaaagccacatcaagtag >gi568815591r:151460522_151695456|GENSCAN_predicted_peptide_2|168_aa MPQSKSRKIAILGYRSVGKSSLTIQFVEGQFVDSYDPTIENTFTKLITVNGQEYHLQLVD TAGQDEYSIFPQTYSIDINGYILVYSVTSIKSFEVIKVIHGKLLDMVGKVQVISYEEGKA LAESWNAAFLESSAKENQTAVDVFRRIILEAEKMDGAASQGKSSCSVM >gi568815591r:151460522_151695456|GENSCAN_predicted_CDS_2|507_bp atgccgcagtccaagtcccggaagatcgcgatcctgggctaccggtctgtggggaaatcc tcattgacgattcaatttgttgaaggccaatttgtggactcctacgatccaaccatagaa aacacttttacaaagttgatcacagtaaatggacaagaatatcatcttcaacttgtagac acagccgggcaagatgaatattctatctttcctcagacatactccatagatattaatggc tatattcttgtgtattctgttacatcaatcaaaagttttgaagtgattaaagttatccat ggcaaattgttggatatggtggggaaagtacaggtgatcagttatgaagaagggaaagct ttggcagaatcttggaatgcagcttttttggaatcttctgctaaagaaaatcagactgct gtggatgtttttcgaaggataattttggaggcagaaaaaatggacggggcagcttcacaa ggcaagtcttcatgctcggtgatgtga >gi568815591r:151460522_151695456|GENSCAN_predicted_peptide_3|139_aa MDAGITEEWSDGERPPPTAVLGVMAERAGEQGCIPNRVPVLSTFVCRRLAVQLPDRSLFL FSTTLSLRARVQEMYSEEAQMDSSLLWDQDTLKEFKVFKEEHSSGKSATSAGPGSGHFFP IGPKGTGEDGDSRKGRKLI >gi568815591r:151460522_151695456|GENSCAN_predicted_CDS_3|420_bp atggatgcggggatcacggaggagtggagtgatggggagcgaccaccgcccactgctgtt cttggggtgatggcagaaagagcaggagagcagggctgtatccccaacagggttcctgtt ctaagcacgtttgtgtgcaggaggctggctgtgcagctgcctgaccgttctctgtttctt ttcagtacaaccctgtcactaagggctagagtccaggagatgtactcagaagaggcacag atggattctagtctcctgtgggatcaggacacattgaaagaattcaaggtcttcaaagag gaacacagctcaggcaagtcggccacaagtgcagggcccggctccggccacttcttcccc attggcccaaagggcacaggtgaggacggggacagcaggaaaggaaggaagctgatctga >gi568815591r:151460522_151695456|GENSCAN_predicted_peptide_4|745_aa MWWKKIDGQKKESDHRKRNEVLLLGTYSGDVHVARFASGLSSSPSTPTQVTKQHTFPLES YKHEPERLENRIYASSSPPDTGQRFCPSSFQSPTRPPLASPTHYAPSKAAALAAALGPAE AGMLEKLEFEDEGECAPPAAGPALPRPGRPRTGTRGDPAPHLRRCPTASCVLRKPRGRGG NRGGDAGAGELGLRATLAFISFLKLPCSSRPARLYTDSSCSLNHSFLSFGHLLVFSKSQT QLETASPPLTGRVGVLAGALSIAVGTGHLLVGSSLGLVTASGYCTISWGSPCTCAFGNGP IDGHSLSIILEMGLCTMKTKQEFQDWTKRSVELEGGNASSELQSFQGHHNKEGLRNCRSQ EEPEEMLPTIMWNPGRDPVEDSESGVYMRFMRSHKCYDIVPTSSKLVVFDTTLQVKKAFF ALVANGVRAAPLWESKKQSFVELYLQETFKPLVNISPDASLFDAVYSLIKNKIHRLPVID PISGNALYILTHKRILKFLQLFMSDMPKPAFMKQNLDELGIGTYHNIAFIHPDTPIIKAL NIFVERRISALPVVDESGKVVDIYSKFDVINLAAEKTYNNLDITVTQALQHRSQYFEGVV KCNKLEILETIVDRIVRAEVHRLVVVNEADSIVGIISLSDILQALILTPAAPVKYEQPSL CQGHMDPMWEQTACPLNSQNPILNEFTGMEQVMWHKVSARYVQITVPYVRIQQYVTAAAG AHACETTPSLNVEVFEPFTKSVCFL >gi568815591r:151460522_151695456|GENSCAN_predicted_CDS_4|2238_bp atgtggtggaagaagattgatggacagaaaaaggaaagtgaccacagaaagcggaatgaa gtcttgctactgggaacttacagtggtgatgttcatgttgcacgttttgcctccggcctc tcctcctctccgtcaacacccacccaagtgaccaagcagcacacgtttcccctggaatcc tataagcacgagcctgaacggttagagaatcgcatctatgcctcgtcttcccccccggac acagggcagaggttctgcccgtcttccttccagagcccgaccaggcctccactggcatca ccgacacactatgctccctccaaagccgcggcgctggcggcggccctgggacccgcggaa gccggcatgctggagaagctggagttcgaggacgaaggtgagtgcgccccgcccgctgcc ggcccggcgctcccacggcccggccgcccgaggaccgggacccgcggggaccccgcgccc catctccggcgctgccccacggcgagctgcgtcctgcggaagcccaggggccggggcggg aaccggggcggggatgcgggcgccggcgagctcgggctgcgggctactctggccttcatc agcttcctcaaactgccttgctcctccaggcctgccagactttacacagacagctcctgc tccttgaatcattccttcctctcctttggccacctgctcgttttctccaagtctcagaca cagctggagacagcgtcacctcccctgacaggccgtgttggagtgctggctggggcgctc agcatagccgtgggaacggggcacctcctggtcggctccagtctagggctggtaacagct tcaggttattgcactatttcttggggttccccatgcacctgtgcctttgggaatggtccc atagacggtcactcgctgagcatcatcttagaaatggggctgtgcactatgaagactaaa caagagttccaggactggaccaaaagaagtgtggaacttgagggtgggaatgcctcatca gaacttcaaagcttccaaggccatcacaacaaggagggtctgagaaactgtcgcagccag gaggagcccgaggagatgctcccgaccatcatgtggaatcctggacgggatccagtagaa gactcagaaagtggtgtttacatgcgattcatgaggtcacacaagtgttatgacatcgtt ccaaccagttcaaagcttgttgtctttgatactacattacaagttaaaaaggccttcttt gctttggtagccaacggtgtccgagcagcgccactgtgggagagtaaaaaacaaagtttt gtagagctttatttacaagaaacatttaagcctttagtgaatatatctccagatgcaagc ctcttcgatgctgtatactccttgatcaaaaataaaatccacagattgcccgttattgac cctatcagtgggaatgcactttatatacttacccacaaaagaatcctcaagttcctccag ctttttatgtctgatatgccaaagcctgccttcatgaagcagaacctggatgagcttgga ataggaacgtaccacaacattgccttcatacatccagacactcccatcatcaaagccttg aacatatttgtggaaagacgaatatcagctctgcctgttgtggatgagtcaggaaaagtt gtagatatttattccaaatttgatgtaattaatcttgctgctgagaaaacatacaataac ctagatatcacggtgacccaggcccttcagcaccgttcacagtattttgaaggtgttgtg aagtgcaataagctggaaatactggagaccatcgtggacagaatagtaagagctgaggtc catcggctggtggtggtaaatgaagcagatagtattgtgggtattatttccctgtcggac attctgcaagccctgatcctcacaccagcagccccagtcaaatatgaacaacccagcctt tgtcagggccacatggacccgatgtgggagcaaacagcatgccctttgaactctcagaat ccaatattaaacgaattcactggtatggaacaggtgatgtggcataaggtgagtgcacgg tatgttcagatcacagtgccttatgtccgaatacagcaatatgtcaccgccgcagccggg gcgcacgcgtgtgaaacaacaccgagcttgaatgtggaagtctttgaaccttttaccaaa tcagtttgttttctttag >gi568815591r:151460522_151695456|GENSCAN_predicted_peptide_5|139_aa MRIIGGKNGHDLGQTSQNLIFGGGSAGSERRFLTVTSSLWNVGEKTVCMGDRKKHTVAPG EVSQEARWRLQPVADTPFLGCWLQPAAMGLQPGLSWGSVDSTHHLTPPRQALWVSGPSPP RAISSHARPTRGFYAMAAP >gi568815591r:151460522_151695456|GENSCAN_predicted_CDS_5|420_bp atgagaatcataggtggtaagaatgggcatgaccttggacaaacatctcagaacctcatt ttcggtggaggaagtgcaggctctgagaggaggtttctgacggtgacctcatccctgtgg aatgttggggagaagaccgtttgtatgggcgacaggaaaaagcacacggtcgcaccaggc gaggtctcccaggaggccaggtggcggctgcagccggtggcagacactcccttcctgggg tgctggctgcagccggctgccatgggactccaaccagggctctcctggggttctgttgac tcaacacatcatctgacgccaccccggcaggccctctgggtgtcaggcccatcacctccc cgggccatcagcagccacgctaggcccacgagagggttctatgcaatggctgcaccttga