GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:51:59 Sequence gi568815575r:52597991_52805749 : 207759 bp : 44.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 504 618 115 2 1 92 55 131 0.963 10.22 1.02 Term + 1176 1279 104 1 2 73 37 114 0.916 3.24 1.03 PlyA + 1514 1519 6 1.05 2.03 PlyA - 1799 1794 6 -0.45 2.02 Term - 3089 2560 530 0 2 52 33 174 0.393 2.72 2.01 Init - 5757 5667 91 0 1 99 106 57 0.821 9.28 2.00 Prom - 8373 8334 40 -4.76 3.00 Prom + 9027 9066 40 -3.16 3.01 Init + 12885 12949 65 2 2 79 80 30 0.086 1.92 3.02 Intr + 13639 13926 288 1 0 118 52 89 0.025 4.66 3.03 Intr + 26941 27076 136 1 1 109 100 115 0.385 15.37 3.04 Intr + 27512 27626 115 1 1 97 60 110 0.999 9.12 3.05 Intr + 28302 28397 96 1 0 98 96 75 0.970 9.28 3.06 Intr + 30177 30307 131 1 2 55 113 17 0.617 1.31 3.07 Term + 32281 32397 117 0 0 32 54 132 0.941 2.74 3.08 PlyA + 32672 32677 6 1.05 4.03 PlyA - 32983 32978 6 -1.75 4.02 Term - 33135 33029 107 1 2 71 49 125 0.810 5.47 4.01 Init - 35434 35344 91 1 1 103 106 31 0.716 6.21 4.00 Prom - 42223 42184 40 -4.96 5.11 PlyA - 42631 42626 6 1.05 5.10 Term - 43095 43065 31 2 1 93 47 32 0.161 -3.07 5.09 Intr - 46680 46591 90 2 0 95 57 118 0.347 8.51 5.08 Intr - 47553 47457 97 1 1 92 -1 130 0.555 3.57 5.07 Intr - 50406 50271 136 0 1 32 111 120 0.357 8.84 5.06 Intr - 52398 52301 98 1 2 34 99 33 0.397 -1.27 5.05 Intr - 54357 54262 96 1 0 87 96 41 0.496 4.78 5.04 Intr - 54994 54880 115 1 1 97 60 133 0.917 11.42 5.03 Intr - 55502 55414 89 0 2 79 100 107 0.916 10.69 5.02 Intr - 58541 58373 169 2 1 90 110 93 0.330 11.22 5.01 Init - 68622 68575 48 0 0 82 60 36 0.291 1.15 5.00 Prom - 74202 74163 40 -3.46 6.03 PlyA - 74403 74398 6 1.05 6.02 Term - 77295 77189 107 1 2 58 49 104 0.776 2.07 6.01 Init - 79916 79826 91 2 1 92 106 70 0.852 9.94 6.00 Prom - 86979 86940 40 -7.36 7.11 PlyA - 88424 88419 6 1.05 7.10 Term - 89548 89368 181 0 1 -36 55 200 0.011 1.38 7.09 Intr - 99528 99439 90 2 0 95 57 82 0.123 4.91 7.08 Intr - 100098 100002 97 1 1 92 -1 105 0.210 1.07 7.07 Intr - 102588 102453 136 0 1 29 99 157 0.597 11.04 7.06 Intr - 104626 104529 98 2 2 37 99 104 0.941 6.13 7.05 Intr - 106602 106507 96 1 0 94 96 71 0.993 8.48 7.04 Intr - 107252 107138 115 2 1 133 60 145 0.999 16.22 7.03 Intr - 107826 107691 136 2 1 123 100 139 0.953 19.17 7.02 Intr - 111224 111052 173 2 2 77 37 84 0.721 0.94 7.01 Init - 114153 114040 114 0 0 91 93 78 0.955 8.81 7.00 Prom - 130809 130770 40 -2.26 8.00 Prom + 131637 131676 40 -2.26 8.01 Init + 148293 148406 114 2 0 91 93 78 0.955 8.81 8.02 Intr + 151226 151398 173 1 2 77 37 84 0.721 0.94 8.03 Intr + 154624 154759 136 1 1 123 100 139 0.953 19.17 8.04 Intr + 155198 155312 115 1 1 133 60 145 0.999 16.22 8.05 Intr + 155848 155943 96 2 0 94 96 71 0.993 8.48 8.06 Intr + 157824 157921 98 1 2 37 99 104 0.940 6.13 8.07 Intr + 159864 159999 136 2 1 29 99 157 0.597 11.04 8.08 Term + 162354 162454 101 1 2 100 35 125 0.775 6.69 8.09 PlyA + 163526 163531 6 1.05 9.08 PlyA - 163659 163654 6 1.05 9.07 Term - 168453 168357 97 2 1 96 43 95 0.094 3.24 9.06 Intr - 182614 182525 90 0 0 95 57 109 0.069 7.61 9.05 Intr - 191028 190913 116 0 2 67 75 20 0.261 -2.15 9.04 Intr - 193065 192852 214 2 1 97 53 140 0.881 10.12 9.03 Intr - 197951 197879 73 0 1 87 61 6 0.291 -3.74 9.02 Intr - 198478 198248 231 2 0 19 72 316 0.538 20.84 9.01 Init - 207361 207349 13 1 1 80 96 4 0.313 0.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 182548 182638 91 0 1 92 106 70 0.822 9.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:52597991_52805749|GENSCAN_predicted_peptide_1|72_aa AFNDTAKYFSEKEWEKMKASEKIIHVYMKRKYEAMTKQGFTATIPTFMDDKGAADCQGND FDNDHNCGNQGE >gi568815575r:52597991_52805749|GENSCAN_predicted_CDS_1|219_bp gccttcaatgatactgccaaatacttctctgagaaagaatgggaaaagatgaaagcctca gagaaaatcatccatgtgtatatgaagagaaagtatgaggccatgactaaacagggtttc acggctaccatcccaactttcatggatgacaaaggggccgcagactgccaggggaatgat tttgataatgaccataactgcgggaatcagggtgagtag >gi568815575r:52597991_52805749|GENSCAN_predicted_peptide_2|206_aa MPTFVNGDHVLFLIMGMCHIPEAEARGDKETQNLLKLISNFSKVSGYKINVQKSQAFLYT NNRQTESQIMSELPFTIASNGIKYLGIQLTRDVKDLFKENYKPLLSEIKEDTNKWKNIPC SWVGRINIMKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQK NKAGGITLPDFRSLIVIVGVERIWPV >gi568815575r:52597991_52805749|GENSCAN_predicted_CDS_2|621_bp atgcccacgttcgtgaatggtgaccatgttctgtttctcatcatgggcatgtgtcatatc cccgaggctgaggcaagaggagataaggaaacccaaaatctccttaagctgataagcaac ttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacacc aataacagacaaacagagagccaaatcatgagtgaactcccattcacaatagcttcaaac ggaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggagaac tacaaaccactgctcagtgaaataaaagaggatacaaacaaatggaagaacattccatgc tcatgggtaggaagaatcaatatcatgaaaatggccatactgcccaaggtaatttataga ttcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactact ttaaagttcatatggaaccaaaaaagagcccgtatcgccaagtcaatcctaagccaaaag aacaaagctggaggcatcacgctacctgatttcagatctcttatagtgattgtgggagtt gaaagaatctggcctgtttag >gi568815575r:52597991_52805749|GENSCAN_predicted_peptide_3|315_aa MPQRQLDSSEQRPNGEKRAGSRNQVTPHCPLGHYYAPSRSPQLLYLLWYLSIIPSSLCTV VVSVHGTGTSLTCKRNRMRLSSHRTFGLVEGEISEGRPHSVTWNQVLHFSIRGLSVPETK GPVARKSQAVSLAGESAPGAMNGDDAFAKRPRDDDKASEKRSKAFNDIATYFSKKEWEKM KYSEKISYVYMKRNYEAMTKLGFNVTLPPFMCNKQATDFQGNYFDNDRNRRIQECPHATE VSLEFGNLYQQRKILMYSLPVERPQMTFGRLQRIIPKIMPKKPAEEGNDSKGVSEASGPQ NDGKQLRRPGKSKYF >gi568815575r:52597991_52805749|GENSCAN_predicted_CDS_3|948_bp atgccacagagacagttggactcatcagaacagaggcctaatggagagaaacgtgcagga tccagaaaccaggtcaccccacactgtcccctgggccactactatgccccctccaggtca cctcagcttttgtatcttctctggtatttgagcatcatccctagctctctttgcacagtc gttgtctccgttcatggcaccgggactagtctgacctgcaagagaaacagaatgagactt tccagccacaggacctttggtcttgtggagggagaaatcagtgaaggccggccacactca gtcacctggaatcaggtgctgcatttctccatccggggcttatctgttcctgagaccaag ggtcctgtagctagaaagtctcaggctgtttctctcgcaggtgagagtgctcctggtgcc atgaacggagacgacgcctttgcaaagagacccagggatgatgataaagcatcagagaag agaagcaaggccttcaatgatattgccacatacttctctaagaaagagtgggaaaagatg aaatactcggagaaaatcagctatgtgtatatgaagagaaactatgaggccatgactaaa ctaggtttcaatgtcaccctcccacctttcatgtgtaataaacaggccacagacttccag gggaattattttgataatgaccgtaaccgcaggattcaggaatgccctcatgcgacagaa gtctctctagagtttggaaatctttaccaacaaagaaaaattctgatgtattctcttcca gttgaacgtcctcagatgacttttggcaggctccagagaatcatcccgaagatcatgccc aagaagccagcagaggaaggaaatgattcgaagggagtgtcagaagcatctggcccacag aacgatgggaaacagctgcgccgccccgggaaaagcaaatacttctga >gi568815575r:52597991_52805749|GENSCAN_predicted_peptide_4|65_aa MPMFVKGHHVLLLIMGMCHIPQAEARREKERTATATSPFSNHHLDQPAAINTEARPHTNK KSVTH >gi568815575r:52597991_52805749|GENSCAN_predicted_CDS_4|198_bp atgcccatgttcgtgaaaggtcaccatgttctgcttctcatcatgggcatgtgtcatatc ccccaggctgaggcgagaagagagaaggaaagaactgccacagccacttcacccttcagc aaccaccaccttgatcagccagcagccatcaacactgaggcaagaccccacaccaacaag aagagtgtgactcactga >gi568815575r:52597991_52805749|GENSCAN_predicted_peptide_5|322_aa MFLITLQVCVDVDAREHCRHRVSAPRKGLSEEDYIRSDIFMINSQPSTSSGQSYLYFSSN EQNQAGSTRWDEGQTAPGAMNGDDAFARRPRAGAQIPEKIQKSFDDIAKYFSKKEWEKMK SLEKISYVYMKRKYEAMTKLGFKATLPPFMHNTGATDLQGNDFDNDRNQGNQDDFLQAPE NLPEGECLSDLKDQRTFVPPWMQTLIMPKKPAEEGNDSKGVPEASGSQNDGKHLCPPGKP STSEKINKTSGPKRGKHAWTHRLRERKQLVIYEEISDPEEDDDLGDTTYAHDEKQNVVTF HEHGHGCGPLVIRNALIFLVDH >gi568815575r:52597991_52805749|GENSCAN_predicted_CDS_5|969_bp atgtttcttatcacacttcaagtctgtgttgatgttgatgccagagagcactgcagacat agagtatcagcccccagaaaaggcctttccgaggaggactatatcaggtctgacattttc atgatcaacagccagccatctaccagttctggccaatcctatctgtatttcagcagcaat gaacagaaccaagctgggagcacgagatgggatgagggtcagactgctcctggtgccatg aacggagacgacgcctttgcaaggagacctagggctggtgctcaaataccagagaagatc caaaagtccttcgatgatattgccaaatacttctctaagaaagagtgggaaaagatgaaa tccttggagaaaatcagctatgtgtatatgaagagaaagtatgaggccatgactaaacta ggcttcaaggccaccctcccacctttcatgcataatacaggggccacagacctccagggg aatgattttgataatgaccgtaaccaagggaatcaggatgactttttgcaggctccagag aatcttcccgaaggtgagtgtctctcagatctaaaggaccagagaacctttgtccctcca tggatgcaaacactgatcatgcccaagaagccagcagaggaaggaaatgattcgaaggga gtgccagaagcatctggctcacagaacgatgggaaacacctgtgccctccaggaaaacca agtacctctgagaagattaacaagacatccggacccaaaagggggaaacatgcctggacc cacagactgcgtgagagaaagcagctggtgatttatgaagagatcagcgaccctgaagaa gacgacgacctcggggatacgacatatgcccatgatgagaagcagaacgtggtgaccttt cacgaacatgggcatggctgcggacccctcgtcatcaggaatgccttaatatttctggtg gaccactga >gi568815575r:52597991_52805749|GENSCAN_predicted_peptide_6|65_aa MPVFMKGHHVLLLIMGMCRIPEAEARREKEKTATATPIFSNHHLDQPAAINTKARPSTSK KSVTH >gi568815575r:52597991_52805749|GENSCAN_predicted_CDS_6|198_bp atgcccgtgttcatgaaaggtcaccacgttctgcttctcatcatgggcatgtgtcgtatc cccgaggctgaggcaagaagagagaaggaaaaaactgccacagccactccaatcttcagc aaccaccaccttgatcagccagcagccattaacaccaaggcaagaccctccaccagcaaa aagagtgtgactcactga >gi568815575r:52597991_52805749|GENSCAN_predicted_peptide_7|411_aa MRYHYTLITIAKIKDIVTTPNGDKDAEKLDTSLSAAGKHCRHRVSTSRKGLSKEDYIRTD IFMINSQPSTSSGQSYLYFINNEQNQAGSMRWDEGKIKAAVAGKTQAISLAGQTAPGAMN GDDAFARRPTVGAQIPEKIQKAFDDIAKYFSKEEWEKMKASEKIFYVYMKRKYEAMTKLG FKATLPPFMCNKRAEDFQGNDLDNDPNRGNQDDFRQAPGNLPEGECLSDLKDQRTFVPPR MRTLIMPKKPAEEGNDSEEVPEASGPQNDGKELCPPGKPTTSEKIHERSGPKRGEHAWTH RLRERKQLVIYEEISDPEEDDDLRDTTHAHDEKQNVVTFHEHGHGCGPLVIRTGCLHEEP EAPSVFDHMMKKLDLTSDGQLDFQECLHLMDGMTVAYHDSFLKAAHSKKRI >gi568815575r:52597991_52805749|GENSCAN_predicted_CDS_7|1236_bp atgaggtatcactacacacttattacaatagctaaaataaaagacatagtgacaacacca aatggtgacaaggatgcagagaaactggacacctcattaagtgctgctgggaagcactgc agacacagagtatcaacctccagaaagggcctttccaaggaggactatatcaggactgac attttcatgatcaacagccagccatctaccagttctggccaatcctatctgtatttcatc aacaatgaacagaaccaagctgggagcatgagatgggacgagggcaagatcaaagctgct gtggctggaaagactcaggctatttctcttgcaggtcagactgctcccggtgccatgaac ggagacgacgcctttgcaaggagacccacggttggtgctcaaataccagagaagatccaa aaggccttcgatgatattgccaaatacttctctaaggaagagtgggaaaagatgaaagcc tcggagaaaatcttctatgtgtatatgaagagaaagtatgaggctatgactaaactaggt ttcaaggccaccctcccacctttcatgtgtaataaacgggccgaagacttccaggggaat gatttggataatgaccctaaccgtgggaatcaggatgactttcggcaggctccagggaat ctccccgaaggtgagtgtctctcagatctaaaggaccagagaacctttgtccctccacgg atgcgaacactgatcatgcccaagaagccagcagaggaaggaaatgattcggaggaagtg ccagaagcatctggcccacaaaatgatgggaaagagctgtgccccccgggaaaaccaact acctctgagaagattcacgagagatctggacccaaaaggggggaacatgcctggacccac agactgcgtgagagaaaacagctggtgatttatgaagagatcagcgaccctgaggaagat gacgacctcagggatacgacacatgcccatgatgagaagcagaacgtggtgacctttcac gaacatgggcatggctgcggacccctcgtcatcagaactggatgccttcatgaagaacca gaggcccccagtgtctttgaccacatgatgaagaaactggacctcactagtgatgggcag ctggatttccaagaatgtctgcatctgatggatggcatgactgtggcttaccatgactct tttctcaaggctgcccattccaagaagcggatctga >gi568815575r:52597991_52805749|GENSCAN_predicted_peptide_8|322_aa MRYHYTLITIAKIKDIVTTPNGDKDAEKLDTSLSAAGKHCRHRVSTSRKGLSKEDYIRTD IFMINSQPSTSSGQSYLYFINNEQNQAGSMRWDEGKIKAAVAGKTQAISLAGQTAPGAMN GDDAFARRPTVGAQIPEKIQKAFDDIAKYFSKEEWEKMKASEKIFYVYMKRKYEAMTKLG FKATLPPFMCNKRAEDFQGNDLDNDPNRGNQDDFRQAPGNLPEGECLSDLKDQRTFVPPR MRTLIMPKKPAEEGNDSEEVPEASGPQNDGKELCPPGKPTTSEKIHERSGPKRGEHAWTH RLRERKQLVIYEEISDPEEDDE >gi568815575r:52597991_52805749|GENSCAN_predicted_CDS_8|969_bp atgaggtatcactacacacttattacaatagctaaaataaaagacatagtgacaacacca aatggtgacaaggatgcagagaaactggacacctcattaagtgctgctgggaagcactgc agacacagagtatcaacctccagaaagggcctttccaaggaggactatatcaggactgac attttcatgatcaacagccagccatctaccagttctggccaatcctatctgtatttcatc aacaatgaacagaaccaagctgggagcatgagatgggacgagggcaagatcaaagctgct gtggctggaaagactcaggctatttctcttgcaggtcagactgctcccggtgccatgaac ggagacgacgcctttgcaaggagacccacggttggtgctcaaataccagagaagatccaa aaggccttcgatgatattgccaaatacttctctaaggaagagtgggaaaagatgaaagcc tcggagaaaatcttctatgtgtatatgaagagaaagtatgaggctatgactaaactaggt ttcaaggccaccctcccacctttcatgtgtaataaacgggccgaagacttccaggggaat gatttggataatgaccctaaccgtgggaatcaggatgactttcggcaggctccagggaat ctccccgaaggtgagtgtctctcagatctaaaggaccagagaacctttgtccctccacgg atgcgaacactgatcatgcccaagaagccagcagaggaaggaaatgattcggaggaagtg ccagaagcatctggcccacaaaatgatgggaaagagctgtgccccccgggaaaaccaact acctctgagaagattcacgagagatctggacccaaaaggggggaacatgcctggacccac agactgcgtgagagaaaacagctggtgatttatgaagagatcagcgaccctgaggaagat gacgagtaa >gi568815575r:52597991_52805749|GENSCAN_predicted_peptide_9|277_aa MGEEVQEEEDEGSSQEDEDLDSSAESSKQDEDLQLPEGSSQEDEDLGLSEGSSQEDEDLD SSEGSLMEEEDPDSSEGSSEEGSGDVYMFMQLYPAAHSSAECTYQRVYIARICLILPRSP DDDFICVSSHIHPNGPSQTLKPKIIQRAKAKMPNHLLQNPAPGLKCIVCIKIKTGGQSLA YEPLETLTGFLPYSVEGVYRWMILVVIAQLRGQRESLGDTTHAHDEKQNVVTFHEHGHGC GPLVIRALEPSLRNVNTEEDDVLVSLSPGEFSLDALL >gi568815575r:52597991_52805749|GENSCAN_predicted_CDS_9|834_bp atgggggaagaggtccaagaggaggaggacgaaggatcctcacaggaggacgaagaccta gactcatctgcagaatcttcaaagcaggatgaagacctacaattacctgaaggatcttca caggaggatgaagacctagggttatctgaaggatcttcacaagaggatgaagacctagac tcatctgaaggatctttgatggaggaagaagacccagactcatctgaaggatcgtcagag gagggaagtggagatgtgtatatgttcatgcaactgtacccggcagcacatagttctgct gaatgtacatatcaaagggtctacattgcacgcatctgccttattcttccacgttcccca gatgacgatttcatctgtgtctcctcccacatccacccaaatggaccgtcccagaccttg aaaccgaaaatcattcagagagcaaaggccaagatgcccaaccacctgctacagaatcct gctccaggactgaagtgtatagtctgtatcaaaataaaaactggaggacagtccctagcc tatgagcccctggagacccttacaggatttctgccttacagtgtggagggagtctacagg tggatgatattggttgtaatcgcccagctcagagggcagagggagagcctcggggatacg acacatgcccatgatgagaagcagaacgtggtgacctttcatgaacacgggcatggctgt ggacccctcgtcatcagggccttggagccatctcttcgaaatgtgaacactgaggaagat gatgtccttgtctccctgtcaccaggggagtttagcctagatgccttgctctaa