GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:31:21 Sequence gi568815592r:149405534_149646005 : 240472 bp : 43.23% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4044 4163 120 1 0 112 28 63 0.442 3.17 1.02 Intr + 18789 18856 68 1 2 51 103 68 0.000 3.22 1.03 Intr + 30642 30841 200 2 2 78 42 102 0.007 2.75 1.04 Intr + 38539 39236 698 1 2 53 82 367 0.310 23.73 1.05 Intr + 40613 40669 57 0 0 102 80 19 0.287 1.36 1.06 Term + 40871 40974 104 0 2 57 40 76 0.285 -1.86 1.07 PlyA + 42080 42085 6 1.05 2.07 PlyA - 42115 42110 6 1.05 2.06 Term - 45946 45150 797 2 2 118 49 772 0.973 69.75 2.05 Intr - 47189 47083 107 1 2 84 64 67 0.955 3.76 2.04 Intr - 51367 51133 235 2 1 99 94 816 0.960 80.05 2.03 Intr - 56437 56298 140 0 2 95 69 90 0.928 7.91 2.02 Intr - 69080 68706 375 1 0 85 55 467 0.146 37.23 2.01 Init - 81932 81880 53 2 2 99 73 18 0.044 2.03 2.00 Prom - 85296 85257 40 -4.56 3.10 PlyA - 85745 85740 6 1.05 3.09 Term - 100171 99920 252 1 0 105 48 139 0.562 7.24 3.08 Intr - 111917 111802 116 0 2 91 13 182 0.197 11.17 3.07 Intr - 115638 115527 112 0 1 31 86 62 0.646 0.25 3.06 Intr - 119676 119610 67 2 1 77 95 56 0.943 4.11 3.05 Intr - 121243 121119 125 1 2 75 84 33 0.937 0.98 3.04 Intr - 128041 127925 117 1 0 65 95 151 0.999 14.16 3.03 Intr - 129241 129145 97 0 1 19 95 53 0.989 -0.89 3.02 Intr - 130189 130063 127 2 1 83 80 100 0.796 8.44 3.01 Init - 140472 140403 70 0 1 121 65 160 0.984 18.21 3.00 Prom - 155565 155526 40 -1.76 4.00 Prom + 157243 157282 40 -2.56 4.01 Init + 160882 161001 120 0 0 93 91 247 0.999 23.69 4.02 Intr + 166974 167070 97 2 1 64 95 6 0.139 -1.52 4.03 Intr + 173289 173440 152 1 2 80 71 82 0.175 5.58 4.04 Intr + 174301 174457 157 0 1 81 88 0 0.175 -1.12 4.05 Intr + 175060 175190 131 2 2 68 94 40 0.142 3.01 4.06 Intr + 176907 177070 164 2 2 76 108 -31 0.096 -3.53 4.07 Term + 185194 185305 112 1 1 61 41 133 0.876 3.93 4.08 PlyA + 185600 185605 6 1.05 5.00 Prom + 187138 187177 40 -5.36 5.01 Sngl + 188786 189010 225 1 0 59 41 291 0.210 16.64 5.02 PlyA + 189031 189036 6 1.05 6.11 PlyA - 189367 189362 6 1.05 6.10 Term - 189701 189503 199 1 1 28 32 184 0.828 3.67 6.09 Intr - 191656 191530 127 2 1 33 80 86 0.740 2.04 6.08 Intr - 192108 191974 135 1 0 99 70 60 0.820 5.84 6.07 Intr - 192817 192691 127 1 1 91 25 31 0.761 -2.65 6.06 Intr - 196219 196061 159 1 0 49 23 119 0.597 1.68 6.05 Intr - 197840 197735 106 1 1 81 106 27 0.954 4.02 6.04 Intr - 199249 199128 122 1 2 65 115 76 0.915 7.29 6.03 Intr - 217750 217570 181 0 1 36 94 94 0.836 4.67 6.02 Intr - 227383 227226 158 1 2 59 115 58 0.882 4.41 6.01 Intr - 233027 232853 175 1 1 105 94 36 0.553 5.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:149405534_149646005|GENSCAN_predicted_peptide_1|415_aa XQRSIIKTPKTQDTEDDEGAQWNCTACTFLNHPALIRCEQWHLHLRVPIYKADMEKIHIN YHHDLGAASCPIPGPKLCHLASCDFYSEAHSKTILPVPPDVPVSADTGAEKGESQCSELP GHLPSGVMGFQPQVLATLQPHQLVPLSQCVSQPVCQSASVSQPVCQSASVSHPVSVSQCV SHCQPMSASVSASQCQPVLVSQCVSQPVSASVSHPVSVSQCVSHCQPMSASVSASQCQPV LVSQCVSQPVSASVSHPVSVSQCVSHCQPMSASVSASQCQPVLVSQCVSQPVSVSQCVSH CQPMSASVSASQCQPAHVSASVSVSQSQPVSVRQCQSASVSPSVSHPVFQSASVHQPVSV SQWKAKGAPLSPQVAAASKPSGPYASKEPMFGFMLCRHLEIVTHFEQGTLQFDFA >gi568815592r:149405534_149646005|GENSCAN_predicted_CDS_1|1248_bp natcaaaggtccatcatcaaaacaccaaagactcaagacacagaagatgatgagggagct cagtggaattgtaccgcctgtacttttttgaaccatccagccttaattcgctgtgaacag tggcatttacacttacgtgtgcccatctacaaagcagacatggaaaaaatacacatcaac taccaccatgaccttggtgcagccagctgccccatcccaggcccaaagctctgccatctg gcttcttgtgatttctattccgaggctcacagcaagaccatcctccccgttcctccagat gttcctgtctctgcagacactggagcagagaagggtgagagccagtgctcagagctgccc ggccatttgccttcaggggtcatgggcttccagccacaggtcctggccacacttcagcct catcagctagtgccactcagccagtgtgtcagtcagccagtgtgtcagtcagccagtgtt agtcagccagtgtgtcagtcagccagtgtcagccacccagtgtcagtcagccagtgtgtc agccactgtcagccaatgtcagccagtgtgtcagccagccagtgtcagccagtgttagtc agccagtgtgtcagtcagccagtgtcagccagtgtcagtcacccagtgtcagtcagccag tgtgtcagccactgtcagccaatgtcagccagtgtgtcagccagccagtgtcagccagtg ttagtcagccagtgtgtcagtcagccagtgtcagccagtgtcagtcacccagtgtcagtc agccagtgtgtcagccactgtcagccaatgtcagccagtgtgtcagccagccagtgtcag ccagtgttagtcagccagtgtgtcagtcagccagtgtcagtcagccagtgtgtcagccac tgtcagccaatgtcagccagtgtgtcagccagccagtgtcagccagcccatgtgtcagcc agtgtgtcagtcagccagtctcagccagtgtcagtcaggcagtgtcagtcagccagtgtg tcacccagtgtcagccacccagtgtttcagtcagccagtgtccatcagccagtgtccgtc agccaatggaaagccaagggagccccacttagtcctcaggtggctgcagcaagcaaaccc tcagggccctacgcttcgaaggaacccatgtttggtttcatgctctgccgacatcttgaa attgttacacattttgaacagggaaccttgcagtttgattttgcatag >gi568815592r:149405534_149646005|GENSCAN_predicted_peptide_2|568_aa MASSTGLELKILTISYSISCLQGSERRKPGAGPAPVAAGHSMEHPSKMEFFQKLGYDRED VLRVLGKLGEGALVNDVLQELIRTGSRPGALEHPAAPRLVPRGSCGVPDSAQRGPGTALE EDFRTLASSLRPIVIDGSNVAMSHGNKETFSCRGIKLAVDWFRDRGHTYIKVFVPSWRKD PPRADTPIREQHVLAELERQAVLVYTPSRKVHGKRLVCYDDRYIVKVAYEQDGVIVSNDN YRDLQSENPEWKWFIEQRLLMFSFVNDRFMPPDDPLGRHGPSLSNFLSRKPKPPEPSWQH CPYGKKCTYGIKCKFYHPERPHHAQLAVADELRAKTGARPGAGAEEQRPPRAPGGSAGAR AAPREPFAHSLPPARGSPDLAALRGSFSRLAFSDDLGPLGPPLPVPACSLTPRLGGPDWV SAGGRVPGPLSLPSPESQFSPGDLPPPPGLQLQPRGEHRPRDLHGDLLSPRRPPDDPWAR PPRSDRFPGRSVWAEPAWGDGATGGLSVYATEDDEGDARARARIALYSVFPRDQVDRVMA AFPELSDLARLILLVQRCQSAGAPLGKP >gi568815592r:149405534_149646005|GENSCAN_predicted_CDS_2|1707_bp atggcatcttctacaggacttgagctgaagatattgacgatttcttactcaatctcatgt ctgcagggctctgagaggaggaagcctggggcaggacctgcgccagtggccgctgggcac agcatggagcaccccagcaagatggaattcttccagaagctgggctatgaccgggaggat gtgctccgggtgttgggcaagctgggcgagggcgccctggtcaacgacgtgctgcaggag cttatccgcacgggcagccgcccgggtgccctggagcacccggctgcacccaggctagtg cctcggggctcctgtggggtcccggactctgcccagcgtggcccggggacagccctggaa gaggacttcagaaccctggccagttctctgcgacccatagtgattgatggcagcaacgtg gcgatgagccatggaaataaagaaaccttctcttgccggggaatcaagctggctgttgac tggttcagggacagaggacacacctacatcaaagtttttgttccatcctggaggaaggac ccaccaagagctgacacccctatcagagagcagcacgtgctggcggagctggagcggcag gcggtgctggtgtacacgccgtcccgcaaggtgcacggcaagcgcctggtctgctacgac gaccgctacatcgtgaaggtggcctacgagcaggacggcgtcatcgtctccaacgacaac taccgggacctgcagagcgagaaccccgagtggaagtggttcatcgagcagaggctgctc atgttctccttcgtcaacgaccggttcatgccgcctgatgaccccctgggccgccatgga ccctccctgagcaacttcctgagcaggaagccgaagcccccagagccatcctggcagcat tgtccttatggcaagaaatgcacctatggcatcaagtgcaagttctaccacccggagagg ccgcaccacgcgcaactggcggtggccgacgagctccgcgccaagacaggggcccggcct ggcgcgggcgccgaggagcagcggccaccgagagccccgggcggctccgcaggagcccgg gcggccccccgggaaccatttgcgcacagcctcccgccggcgcgggggtccccggacctg gccgccctgcgagggagcttctctcggctggccttcagcgacgacctggggcccctgggg ccgcctctcccggtccccgcctgcagcctcacgccccgactgggcgggcccgactgggtg tccgcgggcggccgggtgccaggcccgctcagcctccctagcccggagagccagttctcc ccgggcgacctcccgcctccgcccggcctgcagctccagccgcggggcgaacaccgccct agggacctgcacggcgacttgctttccccgcgcaggccacccgacgacccgtgggcccgt ccaccccgctccgaccgcttccctgggcgctccgtctgggcggagccggcctggggcgac ggcgccactgggggactttcagtgtacgcgaccgaggacgacgagggggacgcgcgcgcc cgggctcgcatcgcgctctacagcgtcttcccgcgtgaccaggtggaccgcgtgatggcc gcgttcccggagctctcagacctcgccaggctcatcctcctggtacagagatgccagagc gcgggggcgcccctgggcaagccctaa >gi568815592r:149405534_149646005|GENSCAN_predicted_peptide_3|360_aa MAVLLETTLGDVVIDLYTEERPRGENLDYLDGVHTVFGEVTEGMDIIKKINETFVDKDFV PYQDIRINHTVILDDPFDDPPDLLIPDRSPEPTREQLDSGRIGADEEIDDFKGRSAEEVE EIKAEKEAKTQAILLEMVGDLPDADIKPPENVLFVCKLNPVTTDEDLEIIFSRFGPIRSC EVIRDWKTGESLCYAFIEFEKEEDCEKAFFKMDNVLIDDRRIHVDFSQSVAKVKWKGKGG KYTKSDFKEYEKEQDKPPNLVLKDKVKPKQEYPYQESDIYREMGFGHYEEEESCWEKQKS EKRDRTQNRSRSRSRERDGHYSNSHKSKYQTDLYERERSKKRDRSRSPKKSKDKEKSKYR >gi568815592r:149405534_149646005|GENSCAN_predicted_CDS_3|1083_bp atggcggttctactggagaccactttaggcgacgtcgtcatcgacttgtacaccgaagaa cggccgcgtggagaaaatctagattatcttgatggtgtccatacggtgtttggtgaggtg acagaaggcatggacataattaagaaaattaatgagacctttgttgacaaggactttgta ccatatcaggatatcaggataaatcatacggtgattttagatgatccatttgatgaccct cctgatttattaatccctgatcgatcaccagaacctacaagggaacaattagatagtggt cgaataggagcagatgaagaaattgatgatttcaaaggaagatcagctgaggaagtagaa gaaataaaggcagaaaaagaggctaaaactcaggctatacttttggagatggtgggagac ctacctgatgcagatattaaacctccagaaaatgtactgtttgtgtgtaaattgaaccca gtgaccacagatgaggatctggaaataatattctctagatttgggccaataagaagttgt gaagttatccgagactggaagacaggagagtccctctgttacgcttttattgaatttgaa aaggaagaagattgtgagaaagcattcttcaaaatggacaatgtgcttatagatgacaga agaatacatgtggattttagccagtcggttgcaaaggttaaatggaaaggaaaaggtggg aaatacaccaagagtgatttcaaggagtatgaaaaagaacaggataaaccacctaatttg gttctgaaagataaagtaaagcccaaacaggagtatccttaccaagaatcagatatctat agagaaatggggtttggtcactatgaagaagaagaaagctgttgggagaaacaaaagagt gaaaagagagaccgaactcagaaccgaagtcgtagccgatctcgagagagggatggccat tatagtaatagtcataaatcaaaataccaaacagatctttatgaaagagaaaggagtaaa aagagagaccgaagcagaagtccaaagaagtccaaagataaagaaaaatctaagtataga tga >gi568815592r:149405534_149646005|GENSCAN_predicted_peptide_4|310_aa MEGAPPGSLALRLLLFVALPASGWLTTGAPEPPPLSGAPQVVLNITYESGQVYVNDLPVN SGVTRISCQTLIVKNENLENLEEKEYFGIVSVRILVHEWPMTSGSSLQLIVIQEEVVEID GKQVQQKDVTEIDILVKNRGVLRHSNYTLPLEESMLYSISRDSDILFTLPNLSKKESVSS LQTTSQYLIRNVETTVDEDVLPGKLPETPLRAEPPSSYKVMCQWMEKFRKDLCRFWSNVF PVFFQFLNIMVVGITGAAVVITILKVFFPVSEYKGILQLDKVDVIPVTAINLYPDGPEKR AENLEDKTCI >gi568815592r:149405534_149646005|GENSCAN_predicted_CDS_4|933_bp atggagggcgctccaccggggtcgctcgccctccggctcctgctgttcgtggcgctaccc gcctccggctggctgacgacgggcgcccccgagccgccgccgctgtccggagccccacag gttgttcttaacataacctatgagagtggacaggtgtatgtaaatgacttacctgtaaat agtggtgtaacccgaataagctgtcagactttgatagtgaagaatgaaaatcttgaaaat ttggaggaaaaagaatattttggaattgtcagtgtaaggattttagttcatgagtggcct atgacatctggttccagtttgcaactaattgtcattcaagaagaggtagtagagattgat ggaaaacaagttcagcaaaaggatgtcactgaaattgatattttagttaagaaccgggga gtactcagacattcaaactataccctccctttggaagaaagcatgctctactctatttct cgagacagtgacattttatttacccttcctaacctctccaaaaaagaaagtgttagttca ctgcaaaccactagccagtatcttatcaggaatgtggaaaccactgtagatgaagatgtt ttacctggcaagttacctgaaactcctctcagagcagagccgccatcttcatataaggta atgtgtcagtggatggaaaagtttagaaaagatctgtgtaggttctggagcaacgttttc ccagtattctttcagtttttgaacatcatggtggttggaattacaggagcagctgtggta ataaccatcttaaaggtgtttttcccagtttctgaatacaaaggaattcttcagttggat aaagtggacgtcatacctgtgacagctatcaacttatatccagatggtccagagaaaaga gctgaaaaccttgaagataaaacatgtatttaa >gi568815592r:149405534_149646005|GENSCAN_predicted_peptide_5|74_aa MPAWFLNRQKDVKDGKYSQVLANGLDNKLCEDLERLKKIRARRGLRHFWGLRVRGQHTKT TGCRGRIVGVSKKK >gi568815592r:149405534_149646005|GENSCAN_predicted_CDS_5|225_bp atgccagcctggttcttgaacagacagaaggatgtaaaggatggaaaatatagccaggtc ctagccaatggtctggacaacaagctctgtgaagacctggagcgactgaaaaagattcgg gcccgtagagggctgcgtcacttctggggccttcgtgtccgaggccagcacaccaagacc acaggctgccgtggccgcatcgtgggtgtgtctaagaagaagtaa >gi568815592r:149405534_149646005|GENSCAN_predicted_peptide_6|496_aa XLQLNMSLLMISENVKLAREYALLGNYDSAMVYYQGVLDQMNKYLYSVKDTYLQQKWQQV WQEINVEAKHVKDIMKTLESFKLDSTPLKAAQHDLPASEGEVWSMPVPVERRPSPGPRKR QSSQYSDPKSHGNRPSTTVRVHRSSAQNVHNDRGKAVRCREKKEQNKGREEKNKSPAAVT EPETNKFDSTGYDKDLVEALERDIISQNPNVRWDDIADLVEAKKLLKEAVVLPMWMPEFF KGIRRPWKGVLMVGPPGTGKTLLAKAVATECKTTFFNVSSSTLTSKYRGESEKLVRLLFE MARFYSPATIFIDEIDSICSRRGTSEEHEASRRVKAELLVQMDGVGGTSENDDPSKMVMV LAATNFPWDIDEALRRRLEKRIYIPLPSAKGREELLRISLRELELADDVDLASIAENMEG YSGADITNVCRDASLMAMRRRIEGLTPEEIRNLSKEEMHMPTTMEDFEMALKKVSKSVSA ADIERYEKWIFEFGSC >gi568815592r:149405534_149646005|GENSCAN_predicted_CDS_6|1491_bp nncttacagttgaacatgagtcttcttatgattagtgagaatgtaaaattggctcgtgaa tatgcattgctgggaaactatgactctgcgatggtctattatcagggagttcttgaccaa atgaacaagtatctgtactcagtcaaagatacatacctccagcagaaatggcaacaggtt tggcaggaaataaatgtggaagctaaacatgttaaagatatcatgaaaacactagagagc tttaaactggacagcactcccttgaaagcggcacagcatgaccttccagcttctgaggga gaagtctggtccatgcctgtacctgttgaacgaagaccctcaccaggacctagaaaacgc caatcttctcagtacagtgaccctaaatcacatggtaatcgtccaagtacaactgtcaga gttcaccgttcatctgcacagaatgttcacaatgacagagggaaagctgttcgttgtcgt gaaaagaaagaacagaataaaggaagagaggaaaagaacaaatcacctgctgcagtaaca gaaccagagacaaataaatttgatagtaccggatatgataaagacttagtagaagctttg gaaagagatataatttcccagaatcccaatgttcgatgggatgatatcgctgatttagta gaagctaaaaagttgcttaaggaagccgtagtgttaccaatgtggatgcccgaattcttt aagggcattaggagaccatggaaaggagtactgatggtcggcccacctggcacggggaag acgctccttgctaaagcagtagctacagaatgcaagacaacattcttcaatgtctcttca tcaactttgacttccaaatacagaggagaatctgagaagcttgttcgtcttctgtttgaa atggctcgattttattctccagccaccatatttattgatgagatagactccatctgtagt cgccgagggacttctgaagaacatgaagcaagcagaagggtgaaagcggagctgctggtt cagatggatggtgttggaggtacttctgaaaatgatgacccttccaaaatggttatggtt ctggcagctactaattttccctgggatatagatgaggctttaagacgacgccttgagaaa cgaatctatattcctttgccgtcagcaaaaggcagggaggagctattacgaataagtcta cgtgagttggaattggctgatgatgttgaccttgcaagtatagcagaaaacatggaaggt tattcaggtgcggacattaccaacgtgtgcagggatgcgtccttgatggcaatgagaagg cgcattgaaggtttgactccagaggaaatccgaaatctttccaaagaagaaatgcacatg cctacaactatggaggatttcgagatggctttaaaaaaggtttctaagtcagtgtctgct gcagacattgaaagatacgagaaatggatatttgagtttggatcatgctaa