GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:38:28 Sequence gi568815595f:179469822_179688007 : 218186 bp : 41.34% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 386 589 204 1 0 63 47 178 0.974 7.59 1.02 PlyA + 2017 2022 6 1.05 2.09 PlyA - 2259 2254 6 1.05 2.08 Term - 21878 21662 217 1 1 106 38 143 0.859 6.73 2.07 Intr - 22711 22510 202 2 1 48 23 201 0.526 7.02 2.06 Intr - 35445 35284 162 1 0 108 71 50 0.227 4.33 2.05 Intr - 38678 38523 156 0 0 71 54 113 0.125 5.26 2.04 Intr - 52996 52621 376 1 1 80 30 199 0.042 6.96 2.03 Intr - 71900 71819 82 1 1 69 52 100 0.011 3.22 2.02 Intr - 77145 76749 397 1 1 77 39 200 0.053 6.71 2.01 Init - 77201 77186 16 2 1 95 68 15 0.591 0.68 2.00 Prom - 88601 88562 40 -6.65 3.00 Prom + 88687 88726 40 -5.25 3.01 Init + 93272 93296 25 1 1 97 105 57 0.293 8.06 3.02 Intr + 100003 100079 77 2 2 89 115 89 0.945 10.02 3.03 Intr + 100246 100420 175 0 1 84 51 151 0.996 9.59 3.04 Intr + 105582 105648 67 1 1 80 15 97 0.716 -1.36 3.05 Intr + 106396 106490 95 1 2 72 94 112 0.721 8.89 3.06 Intr + 106799 106905 107 0 2 100 106 34 0.983 5.41 3.07 Intr + 110819 110880 62 1 2 91 45 79 0.616 0.51 3.08 Intr + 111073 111187 115 1 1 59 70 121 0.992 6.93 3.09 Intr + 111319 111399 81 0 0 103 100 48 0.957 6.42 3.10 Intr + 113532 113627 96 2 0 87 100 33 0.856 3.69 3.11 Intr + 120660 120772 113 2 2 22 100 120 0.183 4.86 3.12 Term + 121429 121501 73 1 1 53 42 69 0.150 -4.80 3.13 PlyA + 121961 121966 6 1.05 4.07 PlyA - 122804 122799 6 1.05 4.06 Term - 124074 123910 165 0 0 112 54 134 0.597 9.33 4.05 Intr - 128950 128854 97 0 1 54 116 92 0.108 7.69 4.04 Intr - 134849 134706 144 1 0 -41 110 159 0.054 3.68 4.03 Intr - 135130 134900 231 0 0 93 50 199 0.092 12.57 4.02 Intr - 136393 136265 129 0 0 72 61 69 0.077 1.49 4.01 Init - 143284 143217 68 1 2 99 115 -32 0.068 1.20 4.00 Prom - 144232 144193 40 -7.25 5.00 Prom + 144381 144420 40 -2.25 5.01 Init + 148660 148700 41 0 2 85 70 96 0.325 7.21 5.02 Term + 154099 154219 121 1 1 86 47 159 0.994 8.57 5.03 PlyA + 154263 154268 6 1.05 6.03 PlyA - 154481 154476 6 1.05 6.02 Term - 171041 170922 120 2 0 108 31 72 0.707 0.99 6.01 Init - 171759 171700 60 0 0 39 81 56 0.597 1.30 6.00 Prom - 178384 178345 40 -2.25 7.03 PlyA - 179017 179012 6 1.05 7.02 Term - 179922 179641 282 0 0 50 48 286 0.954 15.24 7.01 Init - 180956 180924 33 2 0 84 93 15 0.976 1.54 7.00 Prom - 181443 181404 40 -13.49 8.00 Prom + 181637 181676 40 -7.35 8.01 Init + 183405 183572 168 2 0 86 105 218 0.986 20.88 8.02 Intr + 183686 183929 244 1 1 65 77 130 0.938 5.75 8.03 Intr + 188514 188635 122 1 2 102 87 10 0.227 1.59 8.04 Intr + 212057 212182 126 1 0 123 78 62 0.010 8.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 128925 128854 72 0 0 70 116 90 0.889 11.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:179469822_179688007|GENSCAN_predicted_peptide_1|67_aa SILLQLIKRKKLTVKQPQPGLSGGIPEESMVIIGSDSSLCVIAPEAFQYRWQDVEEGDSD IDDPNPV >gi568815595f:179469822_179688007|GENSCAN_predicted_CDS_1|204_bp agtatactcctacaacttataaaaagaaaaaagttaactgtaaaacagccccagccaggt ctttcaggaggtattccagaagaaagcatggttatcataggaagtgacagctccttgtgt gttattgcccctgaagccttccagtatagatggcaagatgtggaggagggagacagtgat attgatgaccccaaccctgtgtag >gi568815595f:179469822_179688007|GENSCAN_predicted_peptide_2|535_aa MGLNQGPRSLQSADGKASQACVHPLRAASSPGPWAGPEMLSRSQGLEPEILELYLVLYCT AAELVQKPQDKVLPILPSPFPRWRNFSPCPPPPQAHRKYFQGVADIHLRPKSSSVSLWWM LPGMGLTAVDSGLSSGPELSFTFCHEWKQPDTLTRSQAVAGAMLFAGPYIIPDTTPDPHD CISLIHLTFTPFPHISFFSVPHPDHTWFIDGNSTRPNRHSPAKAGYAIVSSTSIIEAIAL PRSTISQQAELIALTQALTLAKGVRVNIYTDCKYAFFHSRPCEETTKQALFSFRKCKQIA IYFNPISILYSACEAERSWEPHAAPAAASPVRGLLCPCVIHGALNQIYVWSFAMRGSAFL YLFYIFLQLLCLLMPYWYRPVTGAGSFSPRLLLFTAGYVTQWQLRSQGGTRAARALGKRH SSSMTLGVVEHTGKSGTGTRVWHAAQQQLQPHGWCSITVAQILGLKHVWDGYNSILDQED KGPNLEMVEEKASVSLGSRGWYGLYTSSGMLPYGLLQERKITVTNFCVFVIPGTT >gi568815595f:179469822_179688007|GENSCAN_predicted_CDS_2|1608_bp atgggcctcaaccaaggccccaggtctctgcaatcagcagatggcaaagccagccaggct tgtgtccatcccttaagggcagcaagttcccctgggccctgggcaggtccagagatgttg tccaggagccagggcctagagccagaaatcttagaactctacctggtgctctattgtact gcagctgagctggtacagaaaccacaagacaaagtccttcctattcttccctcccctttc cccaggtggaggaatttctccccatgtccaccaccaccacaggcccacaggaagtacttc cagggtgtagctgatattcacttaaggcccaagagctcttcagttagcttgtggtggatg ctgccaggcatgggactcaccgctgtggacagtgggctgtcctctggcccagaactctcc ttcaccttctgccatgagtggaagcagcctgataccctcaccagaagccaagcagttgct ggtgccatgctttttgctggcccctacattattcctgataccacacctgacccccatgac tgtatctctctgatccacctgacattcactccatttccccatatttccttcttttctgtt cctcaccctgatcacacttggtttattgatggcaattccaccaggcctaatcgccactca ccagcaaaggcaggctatgctatagtatcttccacatctattattgaggctattgctctt ccccgctccactatctctcagcaagccgaactcattgccttaactcaggccctcactctt gcaaaaggagtacgtgtcaatatttatactgactgtaaatatgccttctttcactcgcgt ccgtgcgaagagaccaccaaacaggctttgttttcttttcgcaaatgtaagcagatagct atctattttaaccctatcagcatcttgtacagcgcttgcgaggctgagagatcatgggag cctcatgctgccccagctgctgcctctccagtgaggggtctgctgtgcccatgtgtcatc catggtgctctcaatcagatctacgtttggagctttgcgatgagagggtctgctttcctc tacttgttctacatcttcctccagttgctctgtttactgatgccgtattggtataggcct gtaacaggtgctggctccttctctcctagactgttgctgttcactgcagggtatgtgaca cagtggcaactcaggtctcaaggaggtaccagggctgctcgggccctggggaaacggcac agcagcagcatgactctgggggtggtggagcatactggcaagtctggcactggcactagg gtttggcatgcagcacagcagcagctccagcctcatgggtggtgtagtatcacagtagct cagatcttgggcttgaaacatgtctgggatggctacaactccattttggaccaggaagac aagggacctaacttagaaatggtggaggagaaagcttcagtcagcttggggtccagagga tggtatggcctctataccagctccggaatgcttccatatggacttttacaagaaagaaaa ataactgtcactaatttttgtgtctttgtcattcctgggacaacctaa >gi568815595f:179469822_179688007|GENSCAN_predicted_peptide_3|361_aa MSGGVYGGDEVGALVFDIGSYTVRAGYAGEDCPKVDFPTAIGMVVERDDGSTLMEIDGDK GKQGGPTYYIDTNALRVPRENMEAISPLKNGMALFIIYFTKQDNLQGKGSFGAPLFANGR STGLILDSGATHTTAIPVHDGYVLQQGIVKSPLAGDFITMQCRELFQEMNIELVPPYMIA SKCVIQDFQASVLQVSDSTYDEQVAAQMPTVHYEFPNGYNCDFGAERLKIPEGLFDPSNV KGLSGNTMLGVSHVVTTSVGMCDIDIRPGLYGSVIVAGGNTLIQSFTDRLNRELSQKTPP VLNVNSKDLKAVATEEATVENKQGTLRKQGKSSDFDESTFSQEPFGSSALSLSGATAMGV G >gi568815595f:179469822_179688007|GENSCAN_predicted_CDS_3|1086_bp atgagcggcggcgtgtacgggggagatgaagttggagcccttgtttttgacattggatcc tatactgtgagagctggttatgctggtgaggactgccccaaggtggattttcctacagct attggtatggtggtagaaagagatgacggaagcacattaatggaaatagatggcgataaa ggcaaacaaggcggtcccacctactacatagatactaatgctctgcgtgttccgagggag aatatggaggccatttcacctctaaaaaatgggatggctttattcatcatctacttcact aaacaggataaccttcagggaaaaggcagttttggtgcaccattatttgctaatggtcgt tctactgggctgattttggacagtggagccactcataccactgcaattccagtccacgat ggctatgtccttcaacaaggcattgtgaaatcccctcttgctggagactttattactatg cagtgcagagaactcttccaagaaatgaatattgaattggttcctccatatatgattgca tcaaaatgtgttatccaggattttcaagcttcggtacttcaagtgtcagattcaacttat gatgaacaagtggctgcacagatgccaactgttcattatgaattccccaatggctacaat tgtgattttggtgcagagcggctaaagattccagaaggattatttgacccttccaatgta aaggggttatcaggaaacacaatgttaggagtcagtcatgttgtcaccacaagtgttggg atgtgtgatattgacatcagaccaggtctctatggcagtgtaatagtggcaggaggaaac acactaatacagagttttactgacaggttgaatagagagctgtctcagaaaactcctcca gtcttaaatgtgaatagcaaagatttaaaagctgtagcaacagaagaagcaacagtggaa aataagcaaggaacacttaggaaacaaggcaagagttctgacttcgatgaaagcaccttt agccaggagcccttcggttcctctgccctgtctctctctggagccacagcaatgggagtt ggatag >gi568815595f:179469822_179688007|GENSCAN_predicted_peptide_4|277_aa MGSQKSLYTHYKRLNLKGFNIIRVSEIIKYTEYQAFDLCYDYSHRHDATHCLQGGPNSPI KTVLVRPAPTSSLRKATSEEAPKSEASAKGPARQSCHRSNRNPPQQTHGRHGYYGQEEEG RRVTEAERANLPMRTKAGCGHLNDAIVGGNAFCQLCENMAAAGLALLCRRVSSALKSSRS LITPQVPACTGYVLLKERNMLLTLEQEAKRQRLPMPSPERLDKVVDSMDALDKVVQERED ALRLLQTGQERARPGAWRRDIFGRIIWYVVNTVALLR >gi568815595f:179469822_179688007|GENSCAN_predicted_CDS_4|834_bp atggggtcccaaaagagtctttacacacattacaaaaggctaaatctaaaaggattcaac ataataagagtctctgagataataaagtacactgaatatcaagcatttgatctttgctat gattattcacaccgccatgatgcaactcactgtctccaaggaggtccaaatagccctatt aagactgtattggtcaggccagcacccaccagcagccttcggaaagccacgagtgaggaa gcccccaaatccgaggcgagtgccaaggggccggccagacagagctgccaccgcagtaac cgaaacccgccgcaacaaactcatggccgccatggctactacgggcaggaggaagaaggg aggcgggtcacggaagctgagagagccaatttaccaatgaggacaaaggcaggatgtggc catcttaatgacgctatcgttggcggaaacgcgttttgccagttatgcgaaaacatggct gcggccggtttggcccttctttgtaggagagtttcatccgccctgaaatcttcccgatcg ttaataactcctcaggtccctgcctgcacagggtatgtcttactgaaagaaagaaacatg cttctaaccctagagcaggaggccaagcggcagagattgccaatgccaagtccagagcgg ttagataaggtagtagattccatggatgcattagataaagttgtccaggaaagagaagat gccctaaggcttcttcagactggtcaagaaagagctagacctggtgcttggagaagagac atctttggaagaatcatctggtatgtagttaacactgtggctcttctaagatga >gi568815595f:179469822_179688007|GENSCAN_predicted_peptide_5|53_aa MAVLQIEAEKAELRVKELEVRKLMHVRGDGPWYYYETIDKELIDHSPKATPDN >gi568815595f:179469822_179688007|GENSCAN_predicted_CDS_5|162_bp atggccgtccttcagattgaagctgaaaaggctgaattacgggtaaaggagctggaagtg cgaaaattgatgcatgtgagaggagatggaccctggtattactatgagacaattgacaag gaacttattgatcattctccgaaagcaactcctgacaattaa >gi568815595f:179469822_179688007|GENSCAN_predicted_peptide_6|59_aa MAEGKVGAGILHAKVGARNKAYLWAKYFIHISIIKGYSSNLDSWKQKTHDTSRAQEIES >gi568815595f:179469822_179688007|GENSCAN_predicted_CDS_6|180_bp atggcagaaggcaaagtgggagcaggcatcttacatgcaaaagtgggagcaagaaacaag gcttacctatgggccaagtattttatccacatcagtatcatcaaaggttactcttccaat ctggattcttggaaacagaagacacatgataccagcagagcccaggagatcgaaagctaa >gi568815595f:179469822_179688007|GENSCAN_predicted_peptide_7|104_aa MIVKPPQPHGTTWYCGTREIRRYQKSTELLIRKLPFQCLVREIAQDFKTDMHFQSAAVGA LQEASEAYLVGLFEDTNLCAIHAEHVTIMPKDIQLTHHIPEECA >gi568815595f:179469822_179688007|GENSCAN_predicted_CDS_7|315_bp atgattgtgaagcctccccagccacatggaactacctggtactgtggcactcgtgaaatt agacgttatcagaagtccactgaacttctgattcgcaaacttcccttccagtgtctggtg cgagaaattgctcaggactttaaaacagatatgcacttccagagtgcagctgttggtgct ttgcaggaggcaagtgaggcctacctggttggtctttttgaagacaccaacctgtgtgct atccatgccgaacatgtaacaattatgccaaaagacatccagctaacacaccacatacct gaagaatgtgcttaa >gi568815595f:179469822_179688007|GENSCAN_predicted_peptide_8|220_aa MQRRGALFGMPGGSGGRKMAAGDIGELLVPHMPTIRVPRSGDRVYKNECAFSYDSPSWLR NTAVRQTLSERPRAAAAEDWLVLVVLLRQPPQAGRARFPAACTQPRRRLKIDEIQEFPVP NCTLQIVCVLRGAERRADFILSSECLHYVESVSCRTAVLVCPHSWLSCANSAPLPTCINS EGGLYVCMNTFLAFGREHVERHFRKTGQSVYMHLKRHVRE >gi568815595f:179469822_179688007|GENSCAN_predicted_CDS_8|660_bp atgcagcgccggggcgccctgttcggcatgccgggcggcagcggaggcaggaagatggct gcaggagacatcggcgagctgctagtgccccacatgcccacgatccgcgtgcccaggtcc ggcgacagggtctacaagaacgagtgcgccttctcctacgactctcccagttggctcagg aacactgcagttcggcagacacttagtgagcgccccagggctgctgcagccgaggactgg ctcgtgctggtggttttgctccgccagcctccccaggctggaagggcccgattcccagca gcttgcacacagccccgccgccgtttaaagatagatgaaatacaagagttccctgttccg aactgcacgttgcagatcgtttgcgtcctccgcggggcagagcgcagggcggacttcatc ctttcctctgaatgcctgcattacgtagagtctgtgtcatgccgcacagcggttctagtc tgtcctcatagttggctctcttgtgctaattctgcccccctccccacctgcattaattct gaaggtggactctatgtatgcatgaatacatttttggcctttggaagggaacatgttgaa agacattttcgaaaaactggacagagtgtatacatgcacctgaaaagacatgtgcgagag