GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:49:23 Sequence gi568815589f:38295749_38497299 : 201551 bp : 45.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1126 1121 6 1.05 1.03 Term - 10324 10308 17 2 2 153 53 -3 0.107 1.10 1.02 Intr - 14242 14064 179 0 2 81 84 73 0.283 5.76 1.01 Init - 22779 22751 29 0 2 94 110 54 0.616 7.28 1.00 Prom - 23822 23783 40 -2.76 2.00 Prom + 38163 38202 40 -3.16 2.01 Init + 38302 38414 113 0 2 52 99 60 0.253 3.19 2.02 Term + 50627 51014 388 2 1 78 46 156 0.139 4.81 2.03 PlyA + 54818 54823 6 1.05 3.07 PlyA - 57551 57546 6 1.05 3.06 Term - 59184 59087 98 1 2 84 42 238 0.093 16.93 3.05 Intr - 67725 67669 57 1 0 92 82 17 0.094 0.36 3.04 Intr - 77221 77162 60 2 0 97 87 55 0.707 5.01 3.03 Intr - 83086 83002 85 1 1 24 113 66 0.011 2.09 3.02 Intr - 90058 89927 132 1 0 62 77 71 0.056 4.24 3.01 Init - 97212 97069 144 0 0 55 18 114 0.066 1.22 3.00 Prom - 99314 99275 40 0.24 4.00 Prom + 99724 99763 40 -9.95 4.01 Sngl + 100001 101554 1554 1 0 82 48 2328 0.937 221.58 4.02 PlyA + 101686 101691 6 1.05 5.09 PlyA - 103254 103249 6 1.05 5.08 Term - 115801 115652 150 1 0 106 45 136 0.963 9.01 5.07 Intr - 117605 117489 117 2 0 121 97 91 0.999 13.96 5.06 Intr - 118455 118346 110 1 2 99 63 112 0.780 9.80 5.05 Intr - 119330 119258 73 2 1 15 94 51 0.154 -2.62 5.04 Intr - 123432 123351 82 2 1 105 75 10 0.917 1.04 5.03 Intr - 124183 124070 114 0 0 64 61 179 0.970 12.36 5.02 Intr - 126746 126674 73 0 1 75 92 15 0.973 -0.94 5.01 Init - 128676 128217 460 0 1 83 109 728 0.995 68.32 5.00 Prom - 129888 129849 40 -5.16 6.04 PlyA - 134072 134067 6 1.05 6.03 Term - 136727 136540 188 0 2 64 50 145 0.404 5.95 6.02 Intr - 142607 142514 94 2 1 100 72 8 0.185 -0.06 6.01 Init - 144011 143955 57 2 0 75 121 19 0.229 5.21 6.00 Prom - 154110 154071 40 -4.26 7.00 Prom + 155301 155340 40 -5.36 7.01 Init + 158072 158082 11 1 2 117 43 18 0.030 -0.92 7.02 Intr + 160370 160445 76 2 1 91 89 57 0.031 5.62 7.03 Intr + 163460 163557 98 1 2 13 94 28 0.030 -5.29 7.04 Intr + 166355 166485 131 2 2 132 79 160 0.944 19.84 7.05 Intr + 174099 174217 119 1 2 76 99 96 0.903 9.68 7.06 Intr + 178679 178792 114 1 0 78 115 7 0.297 3.04 7.07 Intr + 182699 183021 323 1 2 -26 4 260 0.264 0.86 7.08 Term + 183104 183602 499 2 1 4 50 288 0.440 10.60 7.09 PlyA + 185680 185685 6 1.05 8.02 PlyA - 186072 186067 6 1.05 8.01 Term - 192548 192133 416 0 2 9 43 406 0.452 23.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 59034 59170 137 1 2 7 50 276 0.878 13.88 S.002 Init - 78602 78452 151 2 1 58 69 83 0.818 3.60 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:38295749_38497299|GENSCAN_predicted_peptide_1|74_aa MTKGTGFGDRASGKCHLKLPSLSNQKEGGLEQSSSQDCPGNWTLKTVGAYRSACKQESEA HTDNSNQPQGLPIP >gi568815589f:38295749_38497299|GENSCAN_predicted_CDS_1|225_bp atgaccaagggcacaggatttggcgacagggcatctgggaaatgtcatcttaagctgccc agcctctccaaccagaaggagggtggactggagcaatccagttcacaggactgtccaggg aactggactctgaagactgtgggggcctataggagtgcatgcaaacaggaaagcgaggct cacacagacaacagcaatcagccacagggcctccctattccctga >gi568815589f:38295749_38497299|GENSCAN_predicted_peptide_2|166_aa MPLLSSHVERPDYPTELRFHVYTMGGKNLYLREIVTLRTRAVILIKTFGALHGLRNQAHA LQLTIKPSAIRLAWDFSGLIPHFPSISTHTLQGHSHGPTSYSPVPLVLSHLRALGHVVPS LGNTLPPHLCPSNLLSFQDSPKAISSMKSSLTTFQKAGTSQSSELL >gi568815589f:38295749_38497299|GENSCAN_predicted_CDS_2|501_bp atgccattgctgagcagccatgtagagagaccagattatcccactgagcttcggttccac gtctacacaatgggagggaaaaacctatatctcagagagattgtgacactcaggaccagg gccgttatccttatcaaaactttcggtgctctccatgggctgagaaaccaggctcatgct cttcagctgacaatcaagccctctgccatccggctggcctgggacttctctggccttatt ccccatttccccagcatctccacccatactcttcagggccactcacacgggcccacctcc tattctccagtacccctggttctctcccacctccgagccttgggtcatgtggtcccttca cttggaaacactcttcctcctcatctctgcccatctaatctactgtccttccaggacagc ccaaaggccatctcctccatgaagtcctctctgactaccttccagaaggcagggacctct cagtcctctgagctcctatga >gi568815589f:38295749_38497299|GENSCAN_predicted_peptide_3|191_aa MGLGDVERVLQCNGGKASPQMRTPRPREESPHAEPGAAASRGSRGETSTGEADRHGDWHV SLTQCEKENPESGAKRAEKQEMGQNNYTKPGKHMVDVAVVCPELVGSLLTDFKNEAVDPH DRREDLLINVCTSASEEKQGYPPSDPASVLSPPHHEAAAVPKVISEQQQQQQQQQQQQQQ QTSFEGPKKQA >gi568815589f:38295749_38497299|GENSCAN_predicted_CDS_3|576_bp atgggattaggggacgtggagagagtcctacagtgcaacggcggaaaggccagcccccag atgcggacgccgcggcccagagaggaaagcccccacgcggagccaggagctgcggcttcc cgagggagccgaggggagaccagcactggggaggcagacagacatggagactggcatgtt tccctcacacagtgtgagaaagagaacccagagagtggtgccaagagagcagagaaacag gaaatgggacaaaacaactacacaaagcctgggaagcatatggtagatgtagcggtggtg tgcccggaattggtgggttctttgctcactgacttcaagaatgaagccgtggaccctcac gaccgccgtgaagatttactaataaatgtctgcacatcagccagtgaagagaagcagggc tacccacccagtgatcctgcctctgtcctctcacccccgcatcatgaagcagctgccgtc ccaaaggtcatctcagagcagcagcagcagcagcagcagcagcagcagcagcagcagcag cagacaagctttgaggggcccaaaaagcaggcctag >gi568815589f:38295749_38497299|GENSCAN_predicted_peptide_4|517_aa MLRFLAPRLLSLQGRTARYSSAAALPSPILNPDIPYNQLFINNEWQDAVSKKTFPTVNPT TGEVIGHVAEGDRADVDRAVKAAREAFRLGSPWRRMDASERGRLLNRLADLVERDRVYLA SLETLDNGKPFQESYALDLDEVIKVYRYFAGWADKWHGKTIPMDGQHFCFTRHEPVGVCG QIIPWNFPLVMQGWKLAPALATGNTVVMKVAEQTPLSALYLASLIKEAGFPPGVVNIITG YGPTAGAAIAQHVDVDKVAFTGSTEVGHLIQKAAGDSNLKRVTLELGGKSPSIVLADADM EHAVEQCHEALFFNMGQCCCAGSRTFVEESIYNEFLERTVEKAKQRKVGNPFELDTQQGP QVDKEQFERVLGYIQLGQKEGAKLLCGGERFGERGFFIKPTVFGGVQDDMRIAKEEIFGP VQPLFKFKKIEEVVERANNTRYGLAAAVFTRDLDKAMYFTQALQAGTVWVNTYNIVTCHT PFGGFKESGNGRELGEDGLKAYTEVKTVTIKVPQKNS >gi568815589f:38295749_38497299|GENSCAN_predicted_CDS_4|1554_bp atgctgcgcttcctggcaccccggctgcttagcctccagggcaggaccgcccgctactcc tcggcagcagccctcccaagccccattctgaacccagacatcccctacaaccagctgttc atcaacaatgaatggcaagatgcagtcagcaagaagaccttcccgacggtcaaccctacc accggggaggtcattgggcacgtggctgaaggtgaccgggctgatgtggatcgggccgtg aaagcagcccgggaagccttccgcctggggtccccatggcgccggatggatgcctctgag cggggccggctgctgaaccgcctggcagacctagtggagcgggatcgagtctacttggcc tcactcgagaccttggacaatgggaagcctttccaagagtcttacgccttggacttggat gaggtcatcaaggtgtatcggtactttgctggctgggctgacaagtggcatggcaagacc atccccatggatggccagcatttctgcttcacccggcatgagcccgttggtgtctgtggc cagatcatcccgtggaacttccccttggtcatgcagggttggaaacttgccccggcactc gccacaggcaacactgtggttatgaaggtggcagagcagacccccctctctgccctgtat ttggcctccctcatcaaggaggcaggctttccccctggggtggtgaacatcatcacgggg tatggcccaacagcaggtgcggccatcgcccagcacgtggatgttgacaaagttgccttc accggttccaccgaggtgggccacctgatccagaaagcagctggcgattccaacctcaag agagtcaccctggagctgggtggtaagagccccagcatcgtgctggccgatgctgacatg gagcatgccgtggagcagtgccacgaagccctgttcttcaacatgggccagtgctgctgt gctggctcccggaccttcgtggaagaatccatctacaatgagtttctcgagagaaccgtg gagaaagcaaagcagaggaaagtggggaacccctttgagctggacacccagcaggggcct caggtggacaaggagcagtttgaacgagtcctaggctacatccagcttggccagaaggag ggcgcaaaactcctctgtggcggagagcgtttcggggagcgtggtttcttcatcaagcct actgtctttggtggcgtgcaggatgacatgagaattgccaaagaggagatctttgggcct gtgcagcccctgttcaagttcaagaagattgaggaggtggttgagagggccaacaacacc aggtatggcctggctgcggctgtgttcacccgggatctggacaaggccatgtacttcacc caggcactccaggccgggaccgtgtgggtaaacacctacaacatcgtcacctgccacacg ccatttggagggtttaaggaatctggaaacgggagggagctgggtgaggatgggcttaag gcctacacagaggtaaagacggtcaccatcaaggttcctcagaagaactcgtaa >gi568815589f:38295749_38497299|GENSCAN_predicted_peptide_5|392_aa MPRLSLLLPLLLLLLLPLLPPLSPSLGIRDVGGRRPKCGPCRPEGCPAPAPCPAPGISAL DECGCCARCLGAEGASCGGRAGGRCGPGLVCASQAAGAAPEGTGLCVCAQRGTVCGSDGR SYPSVCALRLRARHTPRAHPGHLHKARDGPCEFVPITRFYNCFPQPLIHRQFSLSPDRRQ SETLSKKKKKKEEEEEEEEEGEEEKEEEGCKSNFQHTINFKEISEGFGKIFSFQPSMIDI IDEASTLHVAQHAVVLDARVAELLSNAAPVVVVPPRSVHNVTGAQVGLSCEVRAVPTPVI TWRKVTKSPEGTQALEELPGDHVNIAVQVRGGPSDHEATAWILINPLRKEDEGVYQCHAA NMVGEAESHSTVTVLDLSKYRSFHFPAPDDRM >gi568815589f:38295749_38497299|GENSCAN_predicted_CDS_5|1179_bp atgccgcgcttgtctctgctcttgccgctgctgcttctgctgctgctgccgctgctgccg ccgctgtccccgagccttgggatccgcgacgtgggcggccggcgccccaagtgtggtccg tgccggccagagggctgcccggcgcctgcgccctgcccggcgcccgggatctcggcgctc gacgagtgcggctgctgcgcccgctgcctgggagccgagggcgcgagctgcgggggccgc gccggcgggcgctgtggccccggcctggtatgcgcgagccaggccgctggggcagcgccc gagggcaccgggctctgcgtgtgcgcgcagcgcggcaccgtctgcggctccgacggtcgc tcgtaccccagcgtctgcgcgctgcgcctgcgcgctcggcacacgccccgcgcgcacccc ggtcacctgcacaaggcgcgcgacggcccttgcgagttcgttcctatcactcgtttttat aactgctttcctcagccgttaattcacaggcaattctctttgtctccagacaggagacag agtgagaccctgtctaaaaagaagaagaagaaggaggaggaggaggaggaggaggaggag ggggaggaggagaaggaagaagaaggatgcaaaagcaatttccaacacaccattaacttt aaagaaatctcagagggatttgggaagattttttcattccagccatcaatgatcgatata attgacgaggcctctacactgcacgttgcccaacacgctgtggtgctggatgccagggtg gctgagttgctgtccaatgcagctcctgtggtcgtcgttcctccccgaagtgttcacaac gtcaccggggcgcaggtgggcctgtcctgtgaagtgagggctgtgcctaccccagtcatc acgtggagaaaggtcacgaagtcccctgagggcacccaagcactggaggagctgcctggg gaccatgtcaatatagctgtccaagtgcgagggggcccttctgaccatgaggccacggcc tggattttgatcaaccccctgcgaaaggaggatgagggtgtgtaccagtgccatgcagcc aacatggtgggagaggctgagtcccacagcacagtgacggttctagatctgagtaaatac aggagcttccacttcccagctcccgatgaccgcatgtga >gi568815589f:38295749_38497299|GENSCAN_predicted_peptide_6|112_aa MVLTSKEHQGSKGHSWHQKFKTDIVTPEVMVGFFVIFWSRLSVLGSKTFEVVTFIELKLL PYPPYLGHLLEHTEHGFLSYSPVSSTGQVYSEPSESLLDEEQMSQCISESTD >gi568815589f:38295749_38497299|GENSCAN_predicted_CDS_6|339_bp atggtgctgacttcaaaggaacatcaagggtccaaggggcacagctggcaccaaaagttc aaaacagatattgtgacgccagaagtgatggttggattttttgttatattttggagtagg ctttcagttcttggatctaaaacatttgaagtagtaacattcattgagctgaaattgctt ccctatccaccctacctgggccaccttcttgagcacacagagcatggcttcctcagctac tccccagtgtccagcacaggccaggtgtacagcgagccttctgagagcttgctggatgaa gaacaaatgagtcaatgcatcagtgaatctacagattag >gi568815589f:38295749_38497299|GENSCAN_predicted_peptide_7|456_aa MAMRTRQLNGTCTSSVDMELFLHYSLIPSAQAVDMLRSQPRALEKRESTDVRTEARGDLS CRPLDFVGTFSNRWSYGITFGATANKVMFLFSEGYQPLQIPQWAQAFELLIGGIEVGLSH FPFFACLSSEFQLVSSILGFCYSDLTAWAVPCIPSSGEMMVGFHICVASPALEITLSPCI SHQARRRPAGATEDEVARSAKKMDEIVQEKNTAGALGLLTELQNVLELLRSTGTGMLLNV SLKQSTDEEVTSLAKSFVKSWKTLPDEPSTEKDPNEKRIEPAMTSQNSKRNASNSVRMKY REMLAAALRTGDDCIEMGADEEELGSRIEEAVDPERGNTGMKYKNRVQSKISNLTDAKNP NLRKNASCGNIPPDLLARMSAEEMASDELKEMHKNLTKEAIREHQMAKTGGTQPDSLTCG KCKKNCTSTQVQACSVGEPMATFVDCNECGNQQKFC >gi568815589f:38295749_38497299|GENSCAN_predicted_CDS_7|1371_bp atggcgatgaggaccaggcagttaaatggcacttgtaccagctcggtggacatggagctc ttccttcattattccctcatcccatcagctcaagctgtggacatgctgaggtctcagcca agggctctggagaagagagaaagcacagatgtcagaactgaggcaaggggtgacctgagc tgcaggcctctggacttcgtgggcactttcagtaaccgatggtcctatggaatcaccttt ggggccacagctaataaggtcatgtttttgttctcagaaggctaccagcccctgcagatc ccgcagtgggcccaagcctttgagcttctgattggaggcattgaagtcggcctgtcccac ttccccttctttgcctgcctctcgtcggaattccagctggtcagctccatcttgggcttc tgctactctgatctgacagcatgggctgtaccgtgcattccctcctctggagaaatgatg gtaggtttccacatctgtgtagcatctcccgctctggagatcaccctgagcccttgcatc tcccaccaggcacggaggagacctgccggagccacggaggatgaggtggcccgcagtgct aagaagatggacgagatagtgcaggagaagaacacggccggagcactgggtttgttaacc gagcttcagaatgttctggaattactgcggtccacaggaactggaatgttacttaatgtt agtctcaagcagagtacagatgaagaagttacatctctagcaaagtctttcgtcaaatcc tggaaaacgttaccagatgagccatcaactgagaaagaccccaacgaaaagcgaatagaa cctgcaatgacatcacagaatagcaaaagaaacgcttctaattctgtgcggatgaagtac agggagatgcttgctgcagctcttcgaacaggagatgactgcattgaaatgggagctgat gaggaagaattaggatctcgaattgaggaagctgtagatccagaaagagggaatacaggc atgaagtacaaaaatagagtccaaagtaagatatcaaatcttacagatgcaaagaatcca aatttaaggaaaaatgcatcgtgtgggaatattcctcctgacttacttgctagaatgtcc gcagaagaaatggctagcgatgagctcaaagagatgcacaaaaacttgacgaaagaagcc atcagagagcatcagatggccaagacaggtggaacccagcctgattcgctcacatgtggc aaatgtaaaaagaattgcacttccacacaggtacaagcctgcagtgttggtgaaccaatg gcaacgtttgttgactgtaatgaatgtggaaatcaacagaagttctgttga >gi568815589f:38295749_38497299|GENSCAN_predicted_peptide_8|138_aa XWLALRFFVARDSAATFIGQWRTDLGEPEMLRELARETLGLCTRSAPQDKGLGGEVPRGR LRPVDLCEGAREPPDGAPGCGDPLERYLDSQDPRRRSSTTHRPPPSRVRTLSPQRVSPTP SPCPGSLVPARERRRSGP >gi568815589f:38295749_38497299|GENSCAN_predicted_CDS_8|417_bp nggtggctggcgctgcgcttcttcgtggcccgcgacagtgcggccaccttcatcggccag tggcgcactgatctgggcgagcccgaaatgctccgcgagctggcgagggagacactgggc ctctgcacccggtcagctccccaagataaaggcctcggaggggaagttccgcgtgggaga ctccggcctgttgatctctgtgagggtgcccgggagccgcctgacggcgcgcctgggtgt ggcgacccgctggagcgctacctggacagtcaggacccgcgccgccgctcctccaccacc caccgcccgcccccgtcgagggtccgcaccctgtccccgcagagggtgtcgcccacccct agcccctgccctggtagcctggtccccgcgagagagcgccggcgctccggaccctag