GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:22:56 Sequence gi568815576r:23473269_23680190 : 206922 bp : 50.14% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1183 1178 6 1.05 1.02 Term - 4565 4467 99 2 0 93 49 87 0.606 3.43 1.01 Init - 7184 7149 36 2 0 104 79 40 0.657 4.71 1.00 Prom - 7362 7323 40 -6.66 2.00 Prom + 10154 10193 40 -5.16 2.01 Init + 11148 11157 10 2 1 94 93 13 0.245 2.31 2.02 Intr + 27907 27995 89 2 2 37 55 144 0.116 5.69 2.03 Intr + 40268 40531 264 1 0 11 91 101 0.271 0.41 2.04 Term + 41376 41570 195 2 0 102 53 85 0.534 3.81 2.05 PlyA + 42399 42404 6 1.05 3.00 Prom + 44424 44463 40 -4.26 3.01 Init + 47771 47830 60 1 0 73 42 120 0.523 5.15 3.02 Intr + 48622 48710 89 0 2 57 48 64 0.068 -1.93 3.03 Intr + 56523 56624 102 0 0 78 41 107 0.481 4.29 3.04 Intr + 57216 57267 52 0 1 71 80 37 0.570 0.11 3.05 Intr + 59248 59366 119 0 2 63 94 53 0.155 2.66 3.06 Intr + 65493 65570 78 0 0 93 103 73 0.500 7.97 3.07 Intr + 65665 67208 1544 1 2 30 27 417 0.377 18.86 3.08 Term + 92549 92976 428 0 2 145 48 118 0.329 8.97 3.09 PlyA + 95071 95076 6 -0.45 4.28 PlyA - 99884 99879 6 1.05 4.27 Term - 100317 99998 320 1 2 129 39 460 0.981 40.24 4.26 Intr - 101801 101699 103 2 1 67 97 4 0.034 -1.05 4.25 Intr - 106860 106717 144 0 0 21 110 109 0.046 6.88 4.24 Intr - 116900 116703 198 2 0 34 46 170 0.077 6.95 4.23 Intr - 133875 133748 128 1 2 98 5 114 0.000 4.50 4.22 Intr - 140062 140021 42 2 0 91 91 57 0.020 4.51 4.21 Intr - 140392 140371 22 1 1 82 68 27 0.924 -2.78 4.20 Intr - 140946 140867 80 1 2 103 76 96 0.948 9.17 4.19 Intr - 143606 143585 22 2 1 92 75 -2 0.571 -3.98 4.18 Intr - 144389 144307 83 0 2 77 110 152 0.998 15.66 4.17 Intr - 146125 146096 30 2 0 103 70 34 0.686 1.20 4.16 Intr - 147347 147326 22 2 1 108 75 6 0.942 -1.58 4.15 Intr - 148908 148823 86 1 2 90 110 167 0.998 18.64 4.14 Intr - 151636 151615 22 1 1 102 83 5 0.845 -1.48 4.13 Intr - 152780 152713 68 0 2 108 116 85 0.999 11.92 4.12 Intr - 158705 158548 158 1 2 49 98 163 0.149 13.05 4.11 Intr - 165523 165384 140 1 2 11 26 212 0.133 6.56 4.10 Intr - 165674 165598 77 0 2 129 70 83 0.977 9.83 4.09 Intr - 167142 167040 103 0 1 61 84 -18 0.217 -5.15 4.08 Intr - 176085 175974 112 2 1 96 80 182 0.189 18.58 4.07 Intr - 176265 176158 108 2 0 55 80 82 0.661 3.50 4.06 Intr - 176438 176418 21 1 0 142 105 3 0.900 4.06 4.05 Intr - 179308 179211 98 1 2 97 59 57 0.956 2.51 4.04 Intr - 179676 179593 84 0 0 73 102 162 0.999 16.12 4.03 Intr - 180717 180653 65 1 2 88 57 -26 0.049 -7.26 4.02 Intr - 180879 180769 111 1 0 90 90 27 0.063 3.45 4.01 Init - 184712 184421 292 2 1 84 -22 459 0.032 31.81 4.00 Prom - 186934 186895 40 -9.46 5.00 Prom + 187496 187535 40 -9.06 5.01 Init + 190746 191131 386 2 2 88 44 295 0.320 21.41 5.02 Intr + 193175 194626 1452 2 0 9 8 553 0.038 28.19 5.03 Term + 199858 200011 154 1 1 53 54 127 0.122 3.19 5.04 PlyA + 206348 206353 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 68999 69120 122 0 2 81 80 67 0.872 5.41 S.002 Term + 134953 135094 142 1 1 55 43 277 0.931 17.30 S.003 Term - 140062 139944 119 2 2 91 55 75 0.894 3.20 S.004 Init - 158755 158548 208 1 1 87 98 139 0.841 13.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:23473269_23680190|GENSCAN_predicted_peptide_1|44_aa MASGERRYSHKKAPSKIPPFRFFHKFLIEIRAQNYDQYDQSYTH >gi568815576r:23473269_23680190|GENSCAN_predicted_CDS_1|135_bp atggcttctggagaaagaagatacagccacaagaaggctccctcaaagatcccacccttc aggttcttccacaagtttcttattgaaatccgagcacaaaactatgaccaatatgaccaa tcctatacccactga >gi568815576r:23473269_23680190|GENSCAN_predicted_peptide_2|185_aa MAPEVNVAADNWEIWQLIKSALKILAAENGSCWGERVSRALDNRQSLEAAWRGKGIPDGG TAEAKAQWHERPRSVEGTESSLVLGRHQRVELEPMRKDRPTRVQVAKTFYALLCQLQPAV VVSALQASQDSGFQRPQNTSILQARTEHMEKLRPKQGQWQSSWKPSSLVLLLQLQLLVGR PPGIC >gi568815576r:23473269_23680190|GENSCAN_predicted_CDS_2|558_bp atggcccctgaggttaacgttgcagctgacaattgggagatctggcagctgatcaaatcg gctctgaagatcctggcagctgaaaatgggagctgctggggtgagcgggtttccagggca ctcgacaaccggcagagtctggaagcagcctggaggggaaagggcattcctgacgggggc acagcagaagcaaaggcccagtggcatgaaagaccaaggagtgttgagggaacagagagc agcttggtgttggggcggcatcagcgggtggagctggagccgatgagaaaggataggcca actagggttcaagtggccaagaccttctatgccctgctctgccagttacagccagcagtg gtggtctcagcattgcaggcatcacaagacagcggattccagaggccccaaaacacttcc atcctccaagcacgcactgagcatatggagaagctgaggcccaagcagggccaatggcag agcagctggaagcccagctctcttgtgctcttgctgcagctccagctcctggtcggccgt ccaccaggcatctgctga >gi568815576r:23473269_23680190|GENSCAN_predicted_peptide_3|823_aa MGRLRIPPRSAGQARRPVPLVPLRMRRSGRRRPRQTARRHSGPVSVATGWLAINEQVSIC PLSTSRKGQVELVFSGHKPQLPHRKLGNRAPELWALEQKLQLGLCDLGQRTLPLWVSEPR EGIKPGACVKAQRGETWVLFRVKVLYSVVIEDKLTRESQVRPRSALRRGGVRASTENLIT LFQTIEQFCPWFPEQGTLDLKDWEKIGKELKQANREGKIIPLTVWNDWAIIKATLEPFQT GEDIVSVSDAPKSCVTDCEEEAGTESQQGTESSHCKYVAESVMAQSTQNVDYSQLQEIIY PESSKLGEGGPESLGPSEPKPRSPSTPPPVVQMPVTLQPQTQVRQAQTPRENQVERDRVS IPAMPTQIQYPQYQPVENKTQPLVVYQYRLPTELQYRPPSEVQYRPQAVCPVPNSTAPYQ QPTAMASNSPATQDAALYPQPPTVRLNPTASRSGQGGALHAVIDEARKQGDLEAWRFLVI LQLVQAGEETQVGAPARAETRCEPFTMKMLKDIKEGVKQYGSNSPYIRTLLDSIAHGNRL TPYDWEILAKSSLSSSQYLQFKTWWIDGVQEQVRKNQATKPTVNIDADQLLGTGPNWSTI NQQSVMQNEAIEQVRAICLRAWGKIQDPGTAFPINSIRQGSKEPYPDFVARLQDAAQKSI TDDNARKVIVELMAYENANPECTAPTTTPPPPAGLWGPPGPPAYCVSLDTASPLAELLSS FVKRGYDPSLRVTGKIGGINANYSPAASRQTHFSGRLSCCLLKEVDEATTTRFTDEETEA QGDLETIPGSWAPNPGLRAQGWLPPPAVVPPPPGQGPVVPDAR >gi568815576r:23473269_23680190|GENSCAN_predicted_CDS_3|2472_bp atggggcggctccggatcccgcctcgctcggcaggccaggcccggagacctgtccccctg gtcccgctgcgaatgcgcaggtcggggcgccggaggccgcggcagacagcgcggcgccac tccggcccggtctccgtggcaacgggctggctggctatcaatgagcaggtcagcatctgc cctctgtccacgtctagaaagggccaggtggagctggtgttctcaggccacaagccccag ctgccccacagaaagctggggaacagggcccctgagctctgggccctggagcagaaactt cagctggggctgtgtgaccttgggcagaggacgctgcctctctgggtctctgagccccgg gaaggtatcaagcctggtgcctgcgtgaaggctcagcgaggtgagacctgggtcctgttc agggtgaaggtactctacagtgtggtcattgaggacaagttgacgagagagtcccaagta cgtccacggtcagccttgcgaagagggggagttagagcttctacagaaaatctaattacg ctatttcaaacaatagaacaattctgcccatggtttccagaacagggaactttagatcta aaagattgggaaaaaattggcaaagaattaaaacaagcaaatagggaaggtaaaatcatc ccacttacagtatggaatgattgggccattattaaagcaactttagaaccatttcaaaca ggagaagatattgtttcagtttctgatgcccctaaaagctgtgtaacagattgtgaagaa gaggcagggacagaatcccagcaaggaacggaaagttcacattgtaaatatgtagcagag tctgtaatggctcagtcaacgcaaaatgttgactacagtcaattacaggagataatatac cctgaatcatcaaaattgggggaaggaggtccagaatcattggggccatcagagcctaaa ccacgatcgccatcaactcctcctcccgtggttcagatgcctgtaacattacaacctcaa acgcaggttagacaagcacaaaccccaagagaaaatcaagtagaaagggacagagtctct atcccggcaatgccaactcagatacagtatccacaatatcagccggtagaaaataagacc caaccgctggtagtttatcaataccggctgccaaccgagcttcagtatcggcctccttca gaggttcaatacagacctcaagcggtgtgtcctgtgccaaatagcacggcaccataccag caacccacagcgatggcgtctaattcaccagcaacacaggacgcggcgctgtatcctcag ccgcccactgtgagacttaatcctacagcatcacgtagtggacagggtggtgcactgcat gcagtcattgatgaagccagaaaacagggcgatcttgaggcatggcggttcctggtaatt ttacaactggtacaggccggggaagagactcaagtaggagcgcctgcccgagctgagact agatgtgaacctttcaccatgaaaatgttaaaagatataaaggaaggagttaaacaatat ggatccaactccccttatataagaacattattagattccattgctcatggaaatagactt actccttatgactgggaaattttggccaaatcttccctttcatcctctcagtatctacag tttaaaacctggtggattgatggagtacaagaacaggtacgaaaaaatcaggctactaag cccactgttaatatagacgcagaccaattgttaggaacaggtccaaattggagcaccatt aaccaacaatcagtgatgcagaatgaggctattgaacaagtaagggctatttgcctcagg gcctggggaaaaattcaggacccaggaacagctttccctattaattcaattagacaaggc tctaaagagccatatcctgactttgtggcaagattacaagatgctgctcaaaagtctatt acagatgacaatgcccgaaaagttattgtagaattaatggcctatgaaaatgcaaatcca gaatgcaccgcccccaccacaactccgccacctccagccgggctctggggtcctcctggg ccacctgcttactgtgtgtccctggacacggcgtctcccctcgctgagttactttcctca tttgtaaaacggggatatgacccgtctcttagagtgactgggaagatcggtggtatcaac gcaaactacagcccggcagcttccaggcaaactcacttctcaggacgcctgtcctgctgt cttctcaaagaggtggatgaagccaccaccacccggtttacagatgaggaaactgaggct cagggagatctggaaaccattcccggcagctgggctccgaacccaggcctgcgagcacaa ggctggctcccgcccccggctgtggtccctcctcctccggggcagggtccggtcgtgcca gacgctcgatga >gi568815576r:23473269_23680190|GENSCAN_predicted_peptide_4|912_aa MVSRAYEEEEEAEEGKKKEEEKETEQKELEEEEVEEEEGKEMEQEEEGKEMEEEEQEENE MEEEEEEKEMEEEKETGGGKRNGGGRGGGGQRNSQAGPKVQDTRAKLRGRGGTGVKSMGS DFKLLGCSPGSPSPVGLYWCLPQRVVSSIRMAPGTLVVEEWAQGTFKLNPNDEDIHTANK CHLKVVTDLRLWMWQTCFTLSGLLWELIRTMGDWAEVEVEVTVRCHSTRPTRPPRKAMFM AETKGVALNQLSLRELQTISPLFSGDMSRMWEYGHSVEQYSALDGTEHSSVDWQICQRGS WTGARCWPWGFQSKHNSVKHVFGSGTQLTVLGQPKTTPSVILFLPSCEEPQANKATLSVE KTTPSKQSNNKYVASSYLSLTPEQWRSRRSYSCQVMQEGSTVEKGKDAPCYESDTDIYET VAAATSESTTVEPGKLDVGATEGQDLQHISNQKMPTGPPEDRLSLKFLPSSEEDNDDAKI LPSPVQGSSEDNLSLVCLPRSEDDDCDDDDDDDAQILPSRVQGGCYRFDSSSCSSEDNLS LVCLPRSEDDDCDDDDDDAQILPSPVQACSEDSLFLRCSLRHKDEEEEDDDDIHITARIE SDLTLESLSDEEIHPGRRQRVQDELPERSSLYSFHISERIRPLLMTSWAPAQQERALRPG FMALEINKQKVLPVHTALGSVALEALIPKRGNLRPIPQEAASLATVGSLCQRTIDIRGSS QTIRAALAPAAAGSGRGNPWPAAPNSCIAEQGPGPWSPWRKQPVQPEEPVGQRGSWTGPR CWPRGFQSKHNSVTHVFGSGTQLTVLSQPKATPSVTLFPPSSEELQANKATLVCLMNDFY PGILTVTWKADGTPITQGVEMTTPSKQSNNKYAASSYLSLTPEQWRSRRSYSCQVMHEGS TVEKTVAPAECS >gi568815576r:23473269_23680190|GENSCAN_predicted_CDS_4|2739_bp atggtgtctcgtgcctacgaggaggaggaggaagcggaggaggggaaaaaaaaggaggag gaaaaagaaacggagcagaaggagttggaggaggaggaggtggaggaggaggagggaaaa gaaatggagcaggaggaggagggaaaagaaatggaagaggaggagcaggaggaaaatgaa atggaggaggaggaggaggaaaaagaaatggaggaggaaaaagaaacaggaggaggaaaa agaaacggaggaggaagaggaggaggaggacaaagaaacagccaggccggaccaaaagtt caggacaccagagccaagctcagaggccgtggtggaacaggggtgaagagcatgggttct gactttaaactgctgggctgcagccctggctctccctccccggtgggattgtattggtgc ctaccccagagagttgtgtcatcaattaggatggcacctggcaccttggtggttgaggag tgggcccagggcaccttcaaactcaaccccaatgatgaggacatccacacagccaacaag tgccacctgaaggtggtcacggacctcaggttgtggatgtggcagacctgcttcacgctc tcgggcctcctctgggagctcatcaggactatgggggattgggcagaggtggaggtggag gttacagtgagatgccattccactaggcccacgaggcctccaaggaaagccatgttcatg gccgagaccaagggggttgccctcaaccagctgtcactacgggagctgcagaccatcagc cccctgttctcaggcgacatgagccgcatgtgggaatacgggcacagcgtggagcagtac agtgccctggatggcactgagcactccagcgtcgactggcagatctgccagcgtggctcc tggactggcgccagatgctggccctgggggtttcaatccaagcataattcagtgaagcat gtgtttggcagtgggacccagctcactgttttaggtcagcccaagactaccccgtcggtc attctgttcctgccgtcctgtgaggagccccaagccaacaaggccacactgagcgtggag aagaccacgccctccaaacagagcaacaacaagtacgtggccagcagctacctgagcctg acgcccgagcagtggaggtcccgcagaagctacagctgccaggttatgcaagaagggagc accgtggagaaggggaaggacgcaccctgttatgaatctgatactgatatttatgagact gtggctgctgcaacatcagaatccactactgtagagcctggcaagctggatgtgggagcc acggagggccaagacctgcagcacatcagcaaccaaaagatgcccacaggtccccctgag gaccgcctgagtttaaaatttctgccatcaagtgaggaagacaatgatgatgccaagatt ttaccatcacctgtccagggttcttctgaggacaacctgagtttagtatgcctaccacga agtgaagatgatgactgtgatgatgatgatgatgatgatgcccagattttaccgtcacgt gtccagggtggctgttaccggtttgatagcagttcttgttcttctgaggacaacctgagt ttagtatgcctaccacgaagtgaagatgatgactgtgatgatgatgatgatgatgcccag attttaccgtcacctgtccaggcttgttctgaagatagcctgtttttaagatgctcactg agacacaaagatgaagaagaagaagatgatgatgacatccacataacagctcggatagaa agtgacttgacgctggagagtctaagtgatgaagagattcatccaggcaggagacagcgg gtccaggacgagctcccggaacgctcctccctctacagcttccacatcagtgagcggatc agacccttgctgatgacatcgtgggcccctgcacaacaggaaagggcactgaggccagga ttcatggctctggaaatcaacaaacagaaggtgcttcccgtccacacagctctgggctct gttgctttagaggctttaattcccaagagaggaaaccttcgaccaattccacaagaggct gcctccctggccactgtgggctccttatgccagagaacaatagacatccggggatcgagc caaactatcagggcagcgctggcccctgctgctgctgggtctggccgtggtaacccatgg cctgctgcgcccaacagctgcatcgcagagcagggccctgggccctggagcccctggagg aagcagccggtccagcctgaggagccggtggggcagcgcggctcctggactggccccagg tgctggccccgggggtttcaatccaagcataactcagtgacgcatgtgtttggcagcggg acccagctcaccgttttaagtcagcccaaggccaccccctcggtcactctgttcccgccg tcctctgaggagctccaagccaacaaggctacactggtgtgtctcatgaatgacttttat ccgggaatcttgacggtgacctggaaggcagatggtacccccatcacccagggcgtggag atgaccacgccctccaaacagagcaacaacaagtacgcggccagcagctacctgagcctg acgcccgagcagtggaggtcccgcagaagctacagctgccaggtcatgcacgaagggagc accgtggagaagacggtggcccctgcagaatgttcatag >gi568815576r:23473269_23680190|GENSCAN_predicted_peptide_5|663_aa MGKKQSRKTGNSKNQSASPPPKECSSSPAMEQSWTENDFDELREEGFRRSNYSELKEEVR TNGKEVKNLEKKLDEWLTRITNAEKSLKDLMEPKTTAQELRDECTSLSSRCDLLEEKVSV MEDEMNEMKPITSFEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSTEKE GILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIQHD QVGFIPGMQGWFNVRKSINVIQHINRTKDKNHMIISTDAEKAFDKIQQHFMLKTLNKLGI DGTYLKIIRAIYDKPTANITVNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQITSELPFTIASKRIKYLGIQLTKDVKDLFKENYKPLLNEIKEDTNKWRNI PCSWIGRINIVKMAILPKVIYRFNAIPIKPPMTFFTELEKTTLKFIWNQKRTHIAKSILT KKNKARGITLPDFKLYYKATVIKTAWYWYQNRDIDQWNRPEPSEIMPHTYNYLIFDKPDK NKQWGKDSLFNKCPQPGVTIPGYENAQISVASPGLSTSLDWTLTTQTWSWNPSSTLSTHA VQC >gi568815576r:23473269_23680190|GENSCAN_predicted_CDS_5|1992_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaatcagagtgcctctcctcct ccaaaggaatgcagctcctcaccagcaatggaacaaagctggacagagaatgactttgac gagttgagagaagaaggcttcagaagatcaaactactcggagctaaaggaggaagttcga accaatggcaaagaagttaaaaaccttgaaaaaaaattagacgaatggctaactagaata accaatgcagagaagtccttaaaggacctgatggagccgaaaaccacggcacaagaacta cgtgacgaatgcacaagcctcagtagccgatgcgatctactggaagaaaaggtatcagtg atggaagatgaaatgaatgaaatgaaaccaataacaagctttgaaattgaggcaataatt aatagcttaccaaccaaaaaaagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggagcttgtaccattccttctgaaactattccaatcaacagaaaaagaa ggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagcctggcaga gacacaacaaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggctaaccgaatccagcagcacatcaaaaagcttatccagcatgat caagtgggcttcatccctgggatgcaaggctggttcaacgtacgaaaatcaataaacgta atccagcatataaacagaaccaaagacaaaaaccacatgattatctcaacagatgcagaa aaggcctttgacaaaattcaacaacacttcatgctaaaaactctcaataaattaggtatt gatgggacatatctcaaaataataagagcaatctatgacaaacccacagccaatatcaca gtgaatggacaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccactcctattcaacatagtgttggaagttctcgccagggcaatcaggcaggag aaggaaataaagggcattcaattaggaaaagaggaagtcaaattgtccctgtttgcagat gacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattctta tacaccaataacagacaaacagagagccaaatcacgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaaaggatgtgaaggacctcttcaag gagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaggaacatt ccatgctcatggataggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatt tatagattcaatgccatccccatcaagccaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagaacccacattgccaagtcaatcctaacc aaaaagaacaaagctagaggcatcacgctacctgacttcaaactatactacaaggctaca gtaatcaaaacagcatggtactggtaccaaaacagagatatagaccaatggaacagacca gagccctcagaaataatgccgcatacctacaactatctgatctttgacaaacctgacaaa aacaagcaatggggaaaggattccctatttaataaatgcccccagcctggagtcactatc cctgggtacgagaacgcacagatctctgtggcttctccaggcctgtctacatccctggac tggacgcttactacccagacctggagttggaatccatcttccacgctgtctacccatgcg gtgcagtgttga