GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:47:15 Sequence gi568815596f:159629023_159867339 : 238317 bp : 37.97% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 330 369 40 -3.15 1.01 Init + 9356 9509 154 1 1 90 111 111 0.780 13.79 1.02 Term + 35942 36831 890 0 2 47 33 246 0.076 6.63 1.03 PlyA + 38071 38076 6 1.05 2.03 PlyA - 38278 38273 6 1.05 2.02 Term - 58372 58353 20 2 2 114 42 14 0.333 -3.10 2.01 Init - 83520 83292 229 0 1 97 107 188 0.929 18.67 2.00 Prom - 96509 96470 40 -2.65 3.00 Prom + 97854 97893 40 -5.95 3.01 Init + 102059 102148 90 1 0 82 64 67 0.344 2.24 3.02 Intr + 108892 109059 168 0 0 58 63 66 0.266 0.32 3.03 Intr + 109172 109336 165 1 0 11 101 146 0.689 7.44 3.04 Intr + 114039 114231 193 2 1 43 110 80 0.644 3.84 3.05 Intr + 118783 119881 1099 2 1 51 84 421 0.600 26.06 3.06 Intr + 123380 123549 170 2 2 56 99 120 0.963 8.57 3.07 Intr + 130204 130336 133 2 1 72 111 102 0.820 9.68 3.08 Term + 133317 133380 64 0 1 103 42 10 0.362 -5.72 3.09 PlyA + 133665 133670 6 1.05 4.06 PlyA - 134082 134077 6 1.05 4.05 Term - 143031 142829 203 1 2 119 38 88 0.911 3.47 4.04 Intr - 151156 150983 174 2 0 75 94 148 0.921 13.09 4.03 Intr - 151976 151860 117 0 0 59 115 64 0.193 5.72 4.02 Intr - 154447 154337 111 2 0 93 53 59 0.164 2.33 4.01 Init - 169176 169110 67 0 1 87 102 273 0.880 27.89 4.00 Prom - 171767 171728 40 -7.45 5.07 PlyA - 171871 171866 6 1.05 5.06 Term - 176200 176022 179 2 2 102 32 77 0.982 0.37 5.05 Intr - 178118 177951 168 0 0 69 106 81 0.988 7.00 5.04 Intr - 179549 179427 123 0 0 81 111 68 0.990 8.04 5.03 Intr - 181653 181504 150 1 0 45 101 77 0.937 3.91 5.02 Intr - 186551 186383 169 2 1 71 64 89 0.971 3.40 5.01 Init - 187903 187784 120 1 0 52 95 85 0.612 5.84 5.00 Prom - 190328 190289 40 -5.55 6.00 Prom + 191094 191133 40 -4.75 6.01 Init + 193961 194184 224 1 2 88 72 177 0.380 14.18 6.02 Intr + 194434 194681 248 1 2 -48 77 180 0.041 -0.52 6.03 Term + 197065 197933 869 2 2 73 37 219 0.121 7.14 6.04 PlyA + 199081 199086 6 1.05 7.12 PlyA - 199785 199780 6 1.05 7.11 Term - 202344 202127 218 1 2 17 49 101 0.760 -4.58 7.10 Intr - 202764 202648 117 1 0 90 100 62 0.922 7.12 7.09 Intr - 206623 206458 166 1 1 112 47 118 0.527 8.71 7.08 Intr - 211933 211707 227 2 2 54 95 173 0.590 11.38 7.07 Intr - 213352 213223 130 1 1 51 92 71 0.585 3.15 7.06 Intr - 221118 220958 161 1 2 66 101 63 0.467 4.19 7.05 Intr - 221445 221340 106 0 1 93 79 35 0.206 1.97 7.04 Intr - 225513 225374 140 1 2 125 -28 141 0.098 5.46 7.03 Intr - 225917 225882 36 0 0 106 109 46 0.741 5.82 7.02 Intr - 229454 229340 115 2 1 97 115 -10 0.685 1.70 7.01 Intr - 231867 231799 69 0 0 131 86 72 0.849 9.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 225513 225329 185 1 2 125 42 144 0.814 10.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:159629023_159867339|GENSCAN_predicted_peptide_1|347_aa MTSPNELSKTPGNNPGEKEIYDLSDKEFRIAVVRKLQRIQDNTDKSFRILSAQNLLKLIS NFSKVSGYKINVQKSQAFLYTNNRQTESQIMGELPFTIASKRIKYLGIQLTRDVKDLFKE NYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKT TLKFIWNQKRAHITKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTE PSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRW IKDLNVRPKTIKTLEENLGITIQDIGVSNHFLRWIPFYTSTISVCKF >gi568815596f:159629023_159867339|GENSCAN_predicted_CDS_1|1044_bp atgacctcaccaaatgaactaagtaagacaccagggaacaatcctggagaaaaagagata tatgacctttcagacaaagaattcagaatagctgttgtgaggaagcttcaaagaattcaa gataacacagataagtcattcagaattctatcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaacaacagacaaacagagagccaaatcatgggtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggag aactacaaaccactgctcaaggaaataaaagaggacacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttac agattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcccacatcaccaagtcaatcctaagccaa aagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagag ccctcagaaataacgccgcatatctacaactatctgatctttgacaaacctgagaaaaac aagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagccata tgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaagatgg attaaagatttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcatt accattcaggacataggcgtgtcaaaccattttctaaggtggattccattttacacgtcc accatcagtgtttgcaaattctag >gi568815596f:159629023_159867339|GENSCAN_predicted_peptide_2|82_aa MAPAPLPDPAQNQPPPLSAPPCSSSPHFADFLASRESGGAAGTESCGRAGGTPPRGGALT AGTVEELPPGSQRGATEVWGKD >gi568815596f:159629023_159867339|GENSCAN_predicted_CDS_2|249_bp atggcgcccgcccctctcccggatccggcgcagaaccagccaccaccgttaagtgcccct ccctgctcctcctccccccacttcgcggacttcctggcttctcgcgagagtggcggcgcg gcggggaccgagagctgcgggcgggctggaggtaccccaccgcggggtggtgctctgact gcgggaaccgttgaagagctgcccccggggagtcagcgaggcgccactgaggtctgggga aaggattga >gi568815596f:159629023_159867339|GENSCAN_predicted_peptide_3|693_aa MGFRYVAQAGLTLLGSTILPTFPPRVLGLQLEASVAGGAFAQVLLGPAGCVLPTRTGRLC STRAISSDPTPAKGEPDMDRQGVHGQAAASVGTEERSGTQKLGDTRNYRAPKRLSQPWFR ELLGLDSPKGQSSSVLVAGSMSTSASASASPFQSAWYSESEITQGARSRSQNQQRDHDSK RPKLSCTNCTTSAGRNVGNGLNTLSDVQDRVPSYSQGARPKENSMSTLQLNTSSTNHQLP SEHQTILSSRDSRNSLRSNFSSRESESSRSNTQPGFSYSSSRDEAPIISNSERVVSSQRP FQESSDNEGRRTTRRLLSRIASSMSSTFFSRRSSQDSLNTRSLNSENSYVSPRILTASQS RSNVPSASEVPDNRASEASQGFRFLRRRWGLSSLSHNHSSESDSENFNQESEGRNTGPWL SSSLRNRCTPLFSRRRREGRDESSRIPTSDTSSRSHIFRRESNEVVHLEAQNDPLGAAAN RPQASAASSSATTGGSTSDSAQGGRNTGISGILPGSLFRFAVPPALGSNLTDNVMITVDI IPSGWNSADGKSDKTKSAPSRDPERLQKIKESLLLEDSEEEEGDLCRICQMAAASSSNLL IEPCKCTGSLQYVHQDCMKKWLQAKINSGSSLEAVTTCELCKEKLELNLEDFDIHELHRA HANEQVSIFCLIWLTQQNSFELKNCFYQLQISV >gi568815596f:159629023_159867339|GENSCAN_predicted_CDS_3|2082_bp atggggtttcgctatgttgcccaggcaggtctcacactcctgggctcaaccattctgccc accttccctcccagagtgctgggattacagctagaagcctctgtggctggtggtgccttt gcccaggttttgctcgggcctgctgggtgtgttctgcctactcggactggcaggctttgt tcgactcgtgctatcagctcggatcccacgcctgccaagggtgagccagacatggatcgg cagggggtgcatgggcaagcagcagcttctgtgggcactgaggaacgcagtggcacccaa aagcttggagacacaaggaactacagagccccaaagaggttgtcacagccctggttcagg gagctcctaggtctggactccccaaagggtcagagctcttccgtccttgtcgccggcagc atgtctacatcagcatcagcatctgcgtcaccatttcaatctgcatggtatagtgaatct gagataactcagggagcacgctcaagatcgcagaaccagcaacgggatcatgattcaaaa agacctaaactttcctgtacaaactgtactacctcagctgggagaaatgttggaaatggt ttaaacacattatcagatgttcaagacagagttccttcatattcacaaggagcaagacca aaagaaaactcaatgagcactttacagttgaatacatcatccacaaaccaccaattgcct tctgaacatcagaccatactaagttctagggactccagaaattctttaagatcaaatttt tcttcaagagaatcagaatcttcccgaagcaatacgcagcctggattttcttacagttca agtagagatgaagccccaatcataagcaattcagaaagggttgtttcatctcaaagacca tttcaagaatcttctgacaatgaaggtaggcggacaacgaggagattgctgtcacgcata gcttctagcatgtcatctacttttttttcacgaagatctagtcaggattccttgaataca agatcattgaattctgaaaattcttacgtttctccaagaatcttgacagcttcacagtcc cgtagtaatgtaccatcagcttctgaagttcccgataatagggcatctgaagcttctcag ggatttcgatttcttaggcgaagatggggtttgtcatctcttagccacaatcatagctct gagtcagattcagaaaattttaaccaagaatctgaaggtagaaatacaggaccatggtta tcttcctcacttagaaatagatgcacacctttgttctctagaaggaggcgagagggaaga gatgaatcttcaaggatacctacctctgatacatcatctagatctcatatttttagaaga gaatcaaatgaagtggttcaccttgaagcacagaatgatcctcttggagctgctgccaac agaccacaagcatctgcagcatcaagcagtgccacaacaggtggctctacatcagattcg gctcaaggtggaagaaatacaggaatatcagggattcttcctggttccttattccggttt gcagtccctccagcacttgggagtaatttgaccgacaatgtcatgatcacagtagatatt attccttcaggttggaattcagctgatggtaaaagtgataaaactaaaagtgcgccttca agagatccagaaagattgcagaaaataaaagagagcctccttttagaggactcagaagaa gaagaaggtgacttatgtagaatttgtcaaatggcagctgcatcatcatctaatttgctg atagagccatgcaagtgcacaggaagtttgcagtatgtccaccaagactgtatgaaaaag tggttacaggccaaaattaactctggttcttcattagaagctgtaaccacctgtgaacta tgtaaagagaagttggagcttaacctggaggattttgatattcatgaactacatagagct catgcaaatgaacaagttagtatattttgcctaatttggttaacacaacagaacagtttt gaattgaagaactgtttttaccaactacaaatttctgtgtga >gi568815596f:159629023_159867339|GENSCAN_predicted_peptide_4|223_aa MLRAALPALLLPLLGLAAAAVADCPSSTWIQFQDSCYIFLQEAIKVESIEDVRNQCTDHG ADMISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTDDASFKWFDNSNMTFDKWTDQDD DEDLVDTCAFLHIKTGEWKKGNCEVSSVEGTLCKTANNHILISALVIASTVILTVLGAII WFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVGEENEYPVQFD >gi568815596f:159629023_159867339|GENSCAN_predicted_CDS_4|672_bp atgctccgggccgcgctgcccgcgctcctgctgccgttgctgggcctcgccgctgctgcc gtcgcggactgtccttcatctacttggattcagttccaagacagttgttacatttttctc caagaagccatcaaagtagaaagcatagaggatgtcagaaatcagtgtactgaccatgga gcggacatgataagcatacataatgaagaagaaaatgcttttatactggatactttgaaa aagcaatggaaaggcccagatgatatcctactaggcatgttttatgacacagatgatgcg agtttcaagtggtttgataattcaaatatgacatttgataagtggacagaccaagatgat gatgaggatttagttgacacctgtgcttttctgcacatcaagacaggtgaatggaaaaaa ggaaattgtgaagtttcttctgtggaaggaacactatgcaaaacagctaataaccacatt ttaatatcagcattggtgattgctagcacggtaattttgacagttttgggagcaatcatt tggttcctgtacaaaaaacattctgattctcgtttcaccacagttttttcaaccgcaccc caatcaccttataatgaagactgtgttttggtagttggagaagaaaatgaatatcctgtt caatttgactaa >gi568815596f:159629023_159867339|GENSCAN_predicted_peptide_5|302_aa MCSQSGGHLASVHNQNGQLFLEDIVKRDGFPLWVGLSSHDGSESSFEWSDGSTFDYIPWK GQTSPGNCVLLDPKGTWKHEKCNSVKDGAICYKPTKSKKLSRLTYSSRCPAAKENGSRWI QYKGHCYKSDQALHSFSEAKKLCSKHDHSATIVSIKDEDENKFVSRLMRENNNITMRVWL GLSQHSVDQSWSWLDGSEVTFVKWENKSKSGVGRCSMLIASNETWKKVECEHGFGRVVCK VPLGPDYTAIAIIVATLSILVLMGGLIWFLFQRHRLHLAGFSSVRYAQGVNEDEIMLPSF HD >gi568815596f:159629023_159867339|GENSCAN_predicted_CDS_5|909_bp atgtgttctcaaagtggaggtcacttggcaagcgttcacaaccaaaatggccagctcttt ctggaagatattgtaaaacgtgatggatttccactatgggttgggctctcaagtcatgat ggaagtgaatcaagttttgaatggtctgatggtagtacatttgactatatcccatggaaa ggccaaacatctcctggaaattgtgttctcttggatccaaaaggaacttggaaacatgaa aaatgcaactctgttaaggatggtgctatttgttataaacctacaaaatctaaaaagctg tcccgtcttacatattcatcaagatgtccagcagcaaaagagaatgggtcacggtggatc cagtacaagggtcactgttacaagtctgatcaggcattgcacagtttttcagaggccaaa aaattgtgttcaaaacatgatcactctgcaactatcgtttccataaaagatgaagatgag aataaatttgtgagcagactgatgagggaaaataataacattaccatgagagtttggctt ggattatctcaacattctgttgaccagtcttggagttggttagatggatcagaagtgaca tttgtcaaatgggaaaataaaagtaagagtggtgttggaagatgtagcatgttgatagct tcaaatgaaacttggaaaaaagttgaatgtgaacatggttttggaagagttgtctgcaaa gtgcctctgggccctgattacacagcaatagctatcatagttgccacactaagtatctta gttctcatgggcggactgatttggttcctcttccaaaggcaccgtttgcacctggcgggt ttctcatcagttcgatatgcacaaggagtgaatgaagatgagattatgcttccttctttc catgactaa >gi568815596f:159629023_159867339|GENSCAN_predicted_peptide_6|446_aa MGRNQCKKAENSKNQNTSSPPKDHNSSPASEQNWMEKEFDELTEVGFRRWVITNCSELKE HVLTQCKEAKNLEKRPNLRLTGVPESDEENETKLENTLQDIIWENFTNLARQANTEIQKT QRTTQRYSSRGATPKHIIIKFTKVEMKEKMLRTAREKVLEVLARAIWQEKEMKGIQMGRG EVKLSLFADDMIVYLESPITSAQILLKLIRNLSKVSRYKINVQKSQAFLYTNNRQIESQI MNELPFTTVTKRIKYLGIQLTWDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWIGRINIV KMAILPKVIYRFNAIPMKLPLTFFTELEKTTLNFIWNQKRARIAKTILDKKNKDGGIMLP DFKLYYEATVTKTSWYWYQNRDIDQWNRTEASEITPHIYNHLIFDKSDTNKHWVKYSLFN KWCWENWLAICRKLKLDPPSSHLIQK >gi568815596f:159629023_159867339|GENSCAN_predicted_CDS_6|1341_bp atggggagaaaccagtgcaaaaaggctgaaaattccaaaaaccagaacacttcttctcct ccaaaggatcacaactcttcaccagctagcgaacaaaactggatggagaaagagtttgat gaattgacagaagtgggcttcagaaggtgggtaattacaaactgctccgagctaaaggag catgttctaacccaatgcaaggaagctaagaaccttgaaaaaagaccaaatctacgtttg actggtgtacctgaaagtgatgaggagaatgaaaccaagttggaaaacacacttcaggat attatctgggagaacttcaccaacctagcaagacaggccaacactgaaattcagaagaca cagagaacaacccaaagatactcctcgagaggagcaaccccaaaacacataatcatcaaa tttaccaaggttgaaatgaaggaaaaaatgttaagaacagccagagagaaagtattggaa gttctggccagggcaatctggcaagagaaagaaatgaagggcattcaaatgggaagaggg gaagtcaaattgtctctgtttgcagatgacatgattgtatatttagaaagccccatcacc tcagcccaaattctccttaagctgataaggaacctcagcaaagtctcaagatacaaaatc aatgtgcaaaaatcacaagcattcctgtacaccaataacagacaaatagagagccaaatc atgaatgaactcccattcacaactgttacaaagagaataaaatacctaggaatacaactt acctgggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaa gaggacacaaacaaatggaaaaacattccatgctcatggataggtagaatcaatatcgtg aaaatggccatactgcccaaagtaatttatagattcaatgctatccccatgaagctacca ctgactttcttcacagaactggaaaaaactactttaaacttcatatggaaccaaaaaaga gcccggatagccaagacaatcctggacaagaagaacaaagatggaggcatcatgctacct gacttcaaactatactatgaggctacagtaaccaaaacatcatggtactggtaccaaaac agagatatagaccaatggaacagaacagaggcctcagaaataacaccacacatctacaac catctgatctttgacaaatctgacacaaacaagcactgggtaaaatattccctatttaat aaatggtgttgggaaaactggctagccatatgcagaaaactgaaactggaccccccttcc tcacaccttatacaaaaataa >gi568815596f:159629023_159867339|GENSCAN_predicted_peptide_7|494_aa VSTIIMPNEFQQDYDIRDCAAVKVFHRPWRRGWHFYDDREFIYLRPFACDTKLEWVCQIP KGRTPKTPDWYNPDRAGIHGPPLIIEGSEYWFVADLHLNYEEAVLYCASNHSFLATITSF CFLKIKPVSLTFSQASDTCHSYGGTLPSVLSQIEQDFITSLLPDMEATLWIGLRWTAYEK INKWTDNRELTYSNFHPLLVSGRLRIPENFFEEESRYHCALILNLQKSPFTGTWNFTSCS ERHFVSLCQKYSEVKSRQTLQNASETVKYLNNLYKIIPKTLTWHSAKRECLKSNMQLVSI TDPYQQAFLSVQALLHNSSLWIGLFSQDDELNFGWSDGKRLHFSRWAETNGQLEDCVVLD TDGFWKTVDCNDNQPGAICYYSGNPKSHILSIRDEKENNFVLEQLLYFNYMASWVMLGIT YRSFTESMAGQPQETYNHVRGEWETSRFYYGGAGERVKEEVLHTFEQPELIRTHSLSREQ QGGKSPPTRPLLQD >gi568815596f:159629023_159867339|GENSCAN_predicted_CDS_7|1485_bp gtgtctactattatcatgccaaatgagtttcagcaggattatgacatcagagactgtgct gctgtcaaggtatttcataggccatggcgaagaggctggcatttctatgatgatagagaa tttatttatttgaggccttttgcttgtgatacaaaacttgaatgggtgtgccaaattcca aaaggccgtactccaaaaacaccagactggtacaatccagaccgtgctggaattcatgga cctccacttataattgaaggaagtgaatattggtttgttgctgatcttcacctaaactat gaagaagccgtcctgtactgtgccagcaatcacagctttcttgcaactataacatctttt tgttttctaaagatcaaacccgtgtctctcacattttctcaagcaagcgatacctgtcac tcctatggtggcacccttccttcagtgttgagccagattgaacaagactttattacatcc ttgcttccggatatggaagctactttatggattggtttgcgctggactgcctatgaaaag ataaacaaatggacagataacagagagctgacgtacagtaactttcacccattattggtt agtgggaggctgagaataccagaaaatttttttgaggaagagtctcgctaccactgtgcc ctaatactcaacctccaaaaatcaccgtttactgggacgtggaattttacatcctgcagt gaacgccactttgtgtctctctgtcagaaatattcagaagttaaaagcagacagacgttg cagaatgcttcagaaactgtaaagtatctaaataatctgtacaaaataatcccaaagact ctgacttggcacagtgctaaaagggagtgtctgaaaagtaacatgcagctggtgagcatc acggacccttaccagcaggcattcctcagtgtgcaggcgctccttcacaactcttcctta tggatcggactcttcagtcaagatgatgaactcaactttggttggtcagatgggaaacgt cttcattttagtcgctgggctgaaactaatgggcaactcgaagactgtgtagtattagac actgatggattctggaaaacagttgattgcaatgacaatcaaccaggtgctatttgctac tattcaggaaatccaaaatcacatattctgagtattcgagatgaaaaggagaataacttt gttcttgagcaactgctgtacttcaattatatggcttcatgggtcatgttaggaataact tatagaagctttacagaaagcatggctgggcagcctcaggaaacttacaatcatgtcaga ggcgaatgggaaacaagcaggttttactatggtggagcaggagagagagtgaaggaggag gtgctacacacttttgaacaaccagaactcattagaactcactcactatcacgagaacag caagggggaaaatcacctcccaccaggcccctcctccaagactga