GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:17:55 Sequence gi568815586f:120587297_120796542 : 209246 bp : 48.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 2700 2695 6 1.05 1.10 Term - 7032 6716 317 1 2 16 32 299 0.895 12.20 1.09 Intr - 23610 23382 229 0 1 18 81 199 0.641 9.64 1.08 Intr - 25185 25048 138 0 0 48 111 13 0.429 0.26 1.07 Intr - 25362 25218 145 2 1 44 93 52 0.010 1.58 1.06 Intr - 35194 34990 205 2 1 97 70 99 0.365 7.26 1.05 Intr - 38646 38565 82 0 1 73 81 22 0.012 -0.79 1.04 Intr - 44048 43899 150 2 0 -18 80 120 0.061 0.96 1.03 Intr - 52681 52658 24 1 0 125 60 35 0.167 2.62 1.02 Intr - 53382 53275 108 0 0 48 86 153 0.364 11.58 1.01 Init - 54021 53518 504 0 0 78 50 430 0.356 31.19 1.00 Prom - 59896 59857 40 -3.56 2.00 Prom + 61207 61246 40 -2.86 2.01 Init + 68824 68976 153 0 0 62 57 242 0.213 17.39 2.02 Intr + 71672 71929 258 1 0 72 81 116 0.800 6.96 2.03 Intr + 72582 72612 31 2 1 93 93 17 0.863 0.40 2.04 Intr + 72900 73043 144 1 0 94 84 337 0.999 34.15 2.05 Intr + 73435 73544 110 2 2 88 82 162 0.999 15.60 2.06 Intr + 73775 73922 148 1 1 103 100 217 0.983 24.21 2.07 Term + 79579 79604 26 2 2 93 54 28 0.082 -1.71 2.08 PlyA + 80006 80011 6 1.05 3.00 Prom + 80631 80670 40 -3.56 3.01 Init + 100001 100412 412 1 1 103 77 581 0.935 53.18 3.02 Intr + 106795 106973 179 2 2 117 100 302 0.997 33.94 3.03 Intr + 107528 107704 177 1 0 50 86 169 0.999 13.02 3.04 Intr + 107799 107856 58 2 1 106 78 45 0.994 3.76 3.05 Term + 109020 109249 230 1 2 66 54 314 0.979 22.59 3.06 PlyA + 114543 114548 6 1.05 4.00 Prom + 117999 118038 40 -4.66 4.01 Init + 123179 123422 244 1 1 82 70 601 0.991 53.60 4.02 Intr + 125978 126091 114 0 0 96 96 93 0.999 11.32 4.03 Intr + 129332 129443 112 0 1 77 83 171 0.999 14.94 4.04 Intr + 129574 129746 173 1 2 0 36 214 0.973 6.99 4.05 Intr + 132624 132732 109 1 1 66 100 166 0.976 14.94 4.06 Intr + 138529 138635 107 1 2 76 105 46 0.096 4.96 4.07 Intr + 139730 139893 164 0 2 87 94 127 0.812 12.89 4.08 Intr + 149690 149839 150 1 0 98 66 315 0.755 30.66 4.09 Intr + 150060 150171 112 2 1 100 82 113 0.999 11.85 4.10 Intr + 150541 150692 152 2 2 71 86 128 0.999 10.78 4.11 Intr + 150984 151154 171 2 0 98 89 203 0.998 21.54 4.12 Intr + 151237 151374 138 0 0 28 95 301 0.993 25.46 4.13 Intr + 151524 151619 96 2 0 25 44 160 0.814 5.51 4.14 Intr + 151844 151900 57 1 0 28 109 112 0.977 6.38 4.15 Intr + 152000 152113 114 1 0 94 18 337 0.264 27.94 4.16 Intr + 152984 153262 279 1 0 40 53 183 0.232 7.67 4.17 Intr + 172642 172797 156 0 0 99 80 81 0.956 8.61 4.18 Term + 173893 174018 126 0 0 70 44 117 0.974 3.78 4.19 PlyA + 174791 174796 6 -0.45 5.09 PlyA - 175954 175949 6 1.05 5.08 Term - 177774 177703 72 0 0 71 45 52 0.213 -2.89 5.07 Intr - 179076 178967 110 1 2 128 100 -17 0.553 3.50 5.06 Intr - 180297 180098 200 2 2 56 82 229 0.958 18.09 5.05 Intr - 181192 181029 164 1 2 72 116 138 0.999 13.77 5.04 Intr - 181763 181657 107 0 2 121 94 122 0.368 16.03 5.03 Intr - 195471 195359 113 2 2 82 101 5 0.074 1.22 5.02 Intr - 196876 196816 61 2 1 62 75 55 0.254 -0.61 5.01 Intr - 204261 204173 89 2 2 104 55 99 0.547 7.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 138590 138635 46 1 1 71 105 120 0.880 10.90 S.002 Term + 141030 141080 51 2 0 68 43 114 0.881 2.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:120587297_120796542|GENSCAN_predicted_peptide_1|633_aa MGRSRWRNGSASDAAAGTESPRPRARREPRRPSAGDSPRSSPSRPRAGGSGDDGEGRDAA AGSASRGLQCAWGSTRWLPRAPSSRGAGRHRGLRELGSPDDCGVAGQEPGSRRLGSPGSR AGPWRGARLPPLAAAAAAFSSDVLDSLLAMCELIAAGPALAWPGGGGARSLYARSPELPA AGGADLGALPAVPGEPTPGSLAAQMLQKFLLSQSANSAVREQCGYICGTGSGAWPCTKKT YWYCTVKANWVQQRKLPLLHGQGDYSAGSLQSRPKIARAGNGAAVGLVGGARHSADDKLD SPRMTHRALDVHFVLTGESTAKRKEGQPRPSPWPPDTLHPDILLLTPLALQSRKKGPSCK VSAVVRGNAVWNTMTVDKALCESMDGSLGGSIACRIGKTVSRVSIPKRFNIVNLPPGSWL ITLRNAAISRAQYWSLLLANWALSSSCSQVSLHNVINVMDQYDILWKGKTIKISANQIMT QSQRVDIPLRQDSEGILLALPAEGKLLLVGLMDRNGDKGTCQINGCIPVQEAKAEEILEK GLEVWEYELRKNNFSDTGNFDFGIQEHINLGIKYDPSVGVYGLDFSAVLSRPGFGITDKK HRTGCTGAKHRISKEETMHWLQQKCDGIMLPCK >gi568815586f:120587297_120796542|GENSCAN_predicted_CDS_1|1902_bp atggggcgcagccggtggaggaacgggtccgcctcggacgcggcggctggaacggagtcc ccgcggccccgggctcgcagagagccccggaggccgagcgccggggacagtccccgttcc tccccatcccgcccgcgggccggcggcagcggcgacgacggcgaaggtcgggacgccgcg gccggcagcgcctcccggggcctgcagtgcgcctgggggagcacacgctggctcccccgc gcgccctcctctcgcggcgccggccggcacaggggcctccgcgaactggggtccccggac gactgcggcgtcgccgggcaggagccgggtagccggcggctaggcagccccgggtcccgg gcagggccgtggcggggagcccggctgcccccgctcgccgccgccgccgccgccttcagc agcgacgtcttggactcgcttttggcgatgtgcgagctcatcgcggcggggcccgcgctc gcatggcccggcgggggcggcgcgcgcagcctctacgcccggagcccggagctcccggcg gctggcggtgcagatctgggggcgctgccggctgtccccggagagcccacccccggctcc ctggcggcccagatgctacagaaattcctgctctcgcagtcagccaacagtgcagtccgt gagcagtgcggttacatctgtggcactgggagtggtgcctggccctgcaccaagaagacc tactggtactgcacggtgaaagcaaactgggttcagcaaaggaagctcccattgctgcat ggacagggcgattattcagcaggctctctccagtcccgccccaaaattgcgagggcaggc aacggtgcagcagtggggctggtcggcggtgccagacactctgctgatgacaagctagac agccccagaatgacccacagggcccttgatgtgcattttgtgctcactggagagtccaca gcaaagagaaaggagggtcagccacggccttctccttggccgccagacaccctccacccc gacatcctgctccttactcccctcgccctacagtccaggaagaagggccccagctgcaaa gtgagtgccgtggtcagaggtaatgctgtgtggaataccatgacggtggataaggcactc tgtgagtccatggatggtagtcttggcggaagcattgcatgcaggataggcaaaaccgta tccagagtgtctattccgaagaggttcaatatagtcaacctgcctccaggtagctggctg atcaccctgaggaatgctgccatatcgagggctcagtattggtctctgctgctggcaaat tgggcactcagcagtagctgtagccaggtcagcttgcataatgtcatcaatgtaatggac cagtatgatatcttgtggaagggaaaaacgatcaagatctctgccaaccaaataatgaca caaagccagagagttgatatacccctgaggcaggacagtgaaggtatattgctggccttg ccagctgaaggcaaattgcttctcgtgggccttatggacaggaatggagacaaaggcact tgccaaatcaatggctgcataccagttcaagaggccaaagcagaagaaatcctggagaag ggtctagaggtatgggagtatgagttgagaaaaaataacttctcagatactggaaacttt gattttgggatccaggaacacatcaatctgggtatcaaatatgacccaagtgtgggtgtc tatggcctggacttctctgcggtgctgagtaggccgggtttcggcatcacagacaagaag cacaggacaggctgcactggggccaaacacagaatcagcaaagaggagaccatgcactgg ctccaacagaagtgtgatgggatcatgcttccttgcaaataa >gi568815586f:120587297_120796542|GENSCAN_predicted_peptide_2|289_aa MVVQTSEEGLAADAELPGPLLMLAQNCAVMHNLLGPACIFLRKGFAENRQPAQGNWQLTN LSFLQACSLEMSLPSVLTPRFRVALIAQETWALLEAGDMCFLDPGSSQKHWWPPGSYVVS CDPMTGERTGMGVKLEKDRSLRPEEIEELREAFREFDKDKDGYINCRDLGNCMRTMGYMP TEMELIELSQQINMNLGGHVDFDDFVELMGPKLLAETADMIGVKELRDAFREFDTNGDGE ISTSELREAMRKLLGHQVGHRDIEEIIRDVDLNGDGRVDFEEFVRMMSR >gi568815586f:120587297_120796542|GENSCAN_predicted_CDS_2|870_bp atggtggtgcagacgagcgaggaggggctggcggctgacgccgagctcccgggaccgctc ctgatgctggcccagaactgcgcagtcatgcacaacctgctgggccctgcctgcattttc ctgcgcaagggcttcgctgagaacaggcagcctgcccagggcaactggcagctcaccaac ctgagcttcctccaggcctgctccttggagatgtctctgccatctgttctgacgcctaga ttcagggttgccttgattgcccaggagacctgggcactgcttgaggctggagacatgtgc tttttggaccctggctccagccagaagcactggtggcccccggggtcctatgtggtgtcc tgtgatcccatgacaggggaacgtacagggatgggggtcaaattggagaaggatagatca ctgcgaccagaggaaattgaagagctccgagaggccttcagagaattcgacaaggacaag gatggctacatcaactgccgggatctgggcaactgcatgcgcaccatgggctacatgccc accgagatggagctcatcgaactgtcccagcagatcaacatgaacctgggtggccatgta gattttgatgacttcgtggagctaatggggcctaaactcctggcagagacagcagatatg attggtgtaaaggaactgcgagatgctttccgagagtttgacaccaatggtgatggggaa ataagcaccagtgagctgcgagaggctatgaggaagctcctgggtcatcaggtgggacac cgagacatagaggaaattatccgagatgtggacctcaatggggatggacgagtggacttt gaagagtttgtccggatgatgtcccgctga >gi568815586f:120587297_120796542|GENSCAN_predicted_peptide_3|351_aa MLGAWAVEGTAVALLRLLLLLLPPAIRGPGLGVAGVAGAAGAGLPESVIWAVNAGGEAHV DVHGIHFRKDPLEGRVGRGESPPAEPRDPGPAVLGAAGRGLRAPSPFRPWGRVSGAKFLS AASSGARSELRAWHWLQASDYGMKLPILRSNPEDQILYQTERYNEETFGYEVPIKEEGDY VLVLKFAEVYFAQSQQKVFDVRLNGHVVVKDLDIFDRVGHSTAHDEIIPMSIRKGKLSVQ GEVSTFTGKLYIEFVKGYYDNPKVCALYIMAGTVDDVPKLQPHPGLEKKEEEEEEEEYDE GSNLKKQTNKNRVQSGPRTPNPYASDNSSLMFPILVAFGVFIPTLFCLCRL >gi568815586f:120587297_120796542|GENSCAN_predicted_CDS_3|1056_bp atgctgggagcctgggcggttgagggaaccgctgtggcgctcctgcgactgctgctgctg ctgctgccgccggcgatccggggacccgggctcggcgtggccggcgtggccggcgcggcg ggggccgggctgcccgagagcgtcatttgggcggtcaacgcgggtggagaggcgcatgtg gacgtgcacgggatccacttccgcaaggaccctttggaaggccgggtgggccgaggtgag agtccccctgccgagccgcgggatccagggcctgctgtgctgggcgcagccggccggggg ctgcgggccccgagcccctttcgaccctggggccgcgtctctggagcgaagtttctctct gcagcttcttcgggggcccgctctgagctcagggcctggcactggctccaagcctcagac tatggcatgaaactgccaatcctgcgttccaaccctgaggaccagatcctgtatcaaact gagcggtacaatgaggagacctttggctacgaagtgcccatcaaagaggagggggactac gtgctggtcttgaaatttgcagaggtctactttgcacagtcccagcaaaaggtatttgat gtacgattgaatggccacgtcgtggtgaaggacttggatatctttgatcgtgttgggcat agcacagctcacgatgaaattatacctatgagcatcagaaaggggaagctgagtgtccag ggggaggtgtccaccttcacagggaaactctacattgagtttgtcaaggggtactatgac aatcccaaggtctgtgcactctacatcatggctgggacagtggatgatgtaccaaagctt cagcctcatccgggattggagaagaaagaagaggaagaagaagaagaagaatatgatgaa gggtctaatctcaaaaaacagaccaataagaaccgggtgcagtcaggcccccgcacaccc aacccctatgcctcggacaacagcagcctcatgtttcccatcctggtggccttcggagtc ttcattccaaccctcttctgcctctgccggttgtga >gi568815586f:120587297_120796542|GENSCAN_predicted_peptide_4|857_aa MSGSNPKAAAAASAAGPGGLVAGKEEKKKAGGGVLNRLKARRQAPHHAADDGVGAAVTEQ ELLALDTIRPEHVLRLSRVTENYLCKPEDNIYSIDFTRFKIRDLETGTVLFEIAKPCVSD QEEDEEEGGGDVDISAGRFVRYQFTPAFLRLRTVGATVEFTVGDKPVSNFRMIERHYFRE HLLKNFDFDFGFCIPSSRNTCEHIYEFPQLSEDVIRLMIENPYETRSDSFYFVDNKLIMH NKADYAYNGGHTPEQRARSGRSRSLGLCLSPMAAALLARASGPARRALCPRAWRQLHTIY QSVELPETHQMLLQTCRDFAEKELFPIAAQVDKEHLFPAAQVKKMGGLGLLAMDVPEELG GAGLDYLAYAIAMEEISRGCASTGVIMSVNNSLYLGPILKFGSKEQKQAWVTPFTSGDKI GCFALSEPGNGSDAGAASTTARAEGDSWVLNGTKAWITNAWEASAAVVFASTDRALQNKG ISAFLVPMPTPGLTLGKKEDKLGIRGSSTANLIFEDCRIPKDSILGEPGMGFKIAMQTLD MGRIGIASQALGIAQTALDCAVNYAENRMAFGAPLTKLQVIQFKLADMALALESARLLTW RAAMLKDNKKPFIKEAAMAKLAASEAATAISHQAIQILGGMGYVTEMPAERHYRDARITE IYEGTSEIQRLDQPPGRRKEGPRLPESGSKAVEASSPRWENTEPREAAIVLTSALGGGPG QPVRGLALVLVPAGLSLGKGAPEAKVQALNPLGQCRSQQLEEKTGAVTGYVTICLGIKTP SRHDSNRRCEEIQSLMAKAQTQKLEAHVVATLQQLQERALDFKGKAEEETDDGWSGKNFV MGESLRMRQLESEPAEI >gi568815586f:120587297_120796542|GENSCAN_predicted_CDS_4|2574_bp atgagcgggtctaacccgaaggctgcggccgcggcgtcggcggctgggcccggggggctg gtggctggcaaggaggagaagaagaaggcgggcggcggcgtcctgaaccgcctgaaggcg cggcggcaggcgccccaccacgcggccgacgacggcgtcggggcagcggtcacggagcag gagctgctggcgctggacaccatccggcccgagcacgtcctgcgcctcagccgggtcacc gagaattatttatgtaaacccgaagacaacatctacagtattgatttcacccgcttcaaa attcgagatttggagacagggacagtactttttgagattgccaaaccttgcgtttcagac caggaggaggatgaggaggagggaggtggagacgtggacatcagcgcaggacgttttgtc cgctatcagttcacaccggcatttctccgcctccggacagtcggggctacggtggagttc acagtgggagacaaacctgtttcaaacttccggatgatcgaacggcactatttccgggaa cacttgctgaaaaactttgactttgattttggcttctgcatccccagcagtaggaacact tgtgaacatatctatgagtttccccagctttcggaggatgtcattcgtctaatgattgaa aatccttacgagacccgctctgacagcttctactttgttgacaacaagctgataatgcac aacaaggctgattatgcctataatggaggccacactccggaacagcgcgctcgcagcggg aggtcgcgaagcctgggactgtgtctgtcgcccatggccgccgcgctgctcgcccgggcc tcgggccctgcccgcagagctctctgtcctagggcctggcggcagttacacaccatctac cagtctgtggaactgcccgagacacaccagatgttgctccagacatgccgggactttgcc gagaaggagttgtttcccattgcagcccaggtggataaggaacatctcttcccagcggct caggtgaagaagatgggcgggcttgggcttctggccatggacgtgcccgaggagcttggc ggtgctggcctcgattacctggcctacgccatcgccatggaggagatcagccgtggctgc gcctccaccggagtcatcatgagtgtcaacaactctctctacctggggcccatcttgaag tttggctccaaggagcagaagcaggcgtgggtcacgcctttcaccagtggtgacaaaatt ggctgctttgccctcagcgaaccagggaacggcagtgatgcaggagctgcgtccaccacc gcccgggccgagggcgactcatgggttctgaatggaaccaaagcctggatcaccaatgcc tgggaggcttcggctgccgtggtctttgccagcacggacagagccctgcaaaacaagggc atcagtgccttcctggtccccatgccaacgcctgggctcacgttggggaagaaagaagac aagctgggcatccggggctcatccacggccaacctcatctttgaggactgtcgcatcccc aaggacagcatcctgggggagccagggatgggcttcaagatagccatgcaaaccctggac atgggccgcatcggcatcgcctcccaggccctgggcattgcccagaccgccctcgattgt gctgtgaactacgctgagaatcgcatggccttcggggcgcccctcaccaagctccaggtc atccagttcaagttggcagacatggccctggccctggagagtgcccggctgctgacctgg cgcgctgccatgctgaaggataacaagaagcctttcatcaaggaggcagccatggccaag ctggccgcctcggaggccgcgaccgccatcagccaccaggccatccagatcctgggcggc atgggctacgtgacagagatgccggcagagcggcactaccgcgacgcccgcatcactgag atctacgagggcaccagcgaaatccagcggctggaccagcctccaggcaggaggaaggaa ggccccagattgccagagtcggggagcaaagctgtggaggcctcgagccccaggtgggag aacacagaacctcgagaagcagccattgtgctcactagcgcgctcgggggcggccctggc cagccagtgcgggggcttgccctggtcctggtgcctgcagggctttctctgggcaaaggt gcccctgaagccaaggtccaagctcttaaccctctgggccagtgtcgatctcagcagctg gaagagaaaacaggagctgtcacaggatatgttactatctgcctggggataaaaacccca tcaaggcacgacagcaaccgcaggtgtgaggaaattcaatctctaatggcaaaggctcaa acacaaaagctagaagctcacgtggtggctacactccagcagctccaggaaagggctctg gatttcaagggaaaggctgaggaggaaacagatgatggctggagcgggaagaattttgtg atgggagagtctctgcgcatgagacagttggaatctgaacctgcagaaatttga >gi568815586f:120587297_120796542|GENSCAN_predicted_peptide_5|305_aa XSLNMDFENQDKEKDSNSSSGSFNGNSTNNNYQKVELGIEAIVSLYPRSCRISFGCCGRF TAAELLSFSLSVMLVLIWVLTGHWLLMDALAMGLCVAMIAFVRLPSLKVSCLLLSGLLIY DVFWVFFSAYIFNSNVMVKVATQPADNPLDVLSRKLHLGPNVGRDVPRLSLPGKLVFPSS TGSHFSMLGIGDIVMPGLLLCFVLRYDNYKKQASGDSCGAPGPANISGRMQKVSYFHCTL IGYFVGLLTATVASRIHRAAQPALLYLVPFTLLPLLTMAYLKGDLRRMWSEPFHSKSSSS RFLEV >gi568815586f:120587297_120796542|GENSCAN_predicted_CDS_5|918_bp nngtcccttaatatggactttgaaaatcaagataaggagaaagacagtaatagttcttct gggtctttcaatggcaacagcaccaataataattaccagaaggttgagcttggcatagag gccattgtcagcctctacccacgttcttgcaggatttcctttggttgctgtggacgtttc actgctgctgagttgctgtcattctctctgtctgtcatgctcgtcctcatctgggttctc actggccattggcttctcatggatgcactggccatgggcctctgtgtcgccatgatcgcc tttgtccgcctgccgagcctcaaggtctcctgcctgcttctctcagggcttctcatctat gatgtcttttgggtatttttctcagcctacatcttcaatagcaacgtcatggtgaaggtg gccactcagccggctgacaatccccttgacgttctatcccggaagctccacctggggccc aatgttgggcgtgatgttcctcgcctgtctctgcctggaaaactggtcttcccaagctcc actggcagccacttctccatgttgggcatcggagacatcgttatgcctggtctcctacta tgctttgtccttcgctatgacaactacaaaaagcaagccagtggggactcctgtggggcc cctggacctgccaacatctccgggcgcatgcagaaggtctcctactttcactgcaccctc atcggatactttgtaggcctgctcactgctactgtggcgtctcgcattcaccgggccgcc cagcccgcccttctctatttggtgccatttactttattgccactcctcacgatggcctat ttaaagggcgacctccggcggatgtggtctgagcctttccactccaagtccagcagctcc cgattcctggaagtatga