GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:55:25 Sequence gi568815594r:163428751_163713504 : 284754 bp : 36.77% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 911 987 77 1 2 90 88 51 0.566 5.91 1.02 Term + 34289 34607 319 2 1 -2 39 235 0.167 2.97 1.03 PlyA + 35340 35345 6 1.05 2.02 PlyA - 36399 36394 6 1.05 2.01 Sngl - 44981 43104 1878 2 0 80 37 1927 0.605 180.49 2.00 Prom - 54087 54048 40 -6.55 3.10 PlyA - 54569 54564 6 1.05 3.09 Term - 65986 65835 152 2 2 26 43 178 0.989 4.19 3.08 Intr - 66146 66006 141 0 0 112 89 197 0.999 21.60 3.07 Intr - 69499 69387 113 0 2 46 97 70 0.444 2.80 3.06 Intr - 93339 93231 109 1 1 57 94 80 0.122 3.92 3.05 Intr - 94430 94206 225 0 0 81 87 89 0.030 5.13 3.04 Intr - 100296 100054 243 1 0 84 56 129 0.044 5.85 3.03 Intr - 108357 108286 72 1 0 60 98 79 0.478 4.56 3.02 Intr - 115833 115686 148 0 1 76 50 175 0.873 11.39 3.01 Init - 129747 129613 135 0 0 73 60 84 0.304 4.29 3.00 Prom - 145047 145008 40 -3.65 4.05 PlyA - 146236 146231 6 1.05 4.04 Term - 155771 155673 99 2 0 75 36 94 0.955 0.05 4.03 Intr - 157179 156999 181 2 1 86 87 180 0.982 16.65 4.02 Intr - 184288 183521 768 0 0 72 82 409 0.262 28.36 4.01 Init - 184691 184564 128 2 2 68 111 21 0.302 2.08 4.00 Prom - 192850 192811 40 -4.05 5.00 Prom + 200451 200490 40 -5.95 5.01 Sngl + 204749 205405 657 1 0 71 43 374 0.931 27.12 5.02 PlyA + 205633 205638 6 1.05 6.00 Prom + 205802 205841 40 -6.15 6.01 Init + 206749 207924 1176 0 0 70 86 403 0.049 31.77 6.02 Term + 216263 216466 204 1 0 75 37 161 0.219 6.09 6.03 PlyA + 216748 216753 6 1.05 7.00 Prom + 216781 216820 40 -4.15 7.01 Sngl + 216882 217055 174 2 0 56 36 177 0.705 3.94 7.02 PlyA + 218221 218226 6 1.05 8.00 Prom + 231331 231370 40 -4.15 8.01 Init + 231508 231652 145 0 1 83 72 107 0.913 8.93 8.02 Intr + 238626 238699 74 1 2 118 84 55 0.817 6.31 8.03 Intr + 267687 267763 77 2 2 22 82 78 0.132 -2.01 8.04 Term + 273327 273450 124 0 1 74 33 141 0.376 4.08 8.05 PlyA + 274063 274068 6 1.05 9.02 PlyA - 275008 275003 6 1.05 9.01 Term - 282795 282595 201 0 0 89 32 132 0.651 4.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 86563 86711 149 1 2 104 50 132 0.958 9.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:163428751_163713504|GENSCAN_predicted_peptide_1|131_aa MVPASAQLLGRPHGAFTHGRRRSRSRRFCQNQYFEVLVGAVTISQLSMNNNIKNGKKYAF LSKSRKRVIMKDNIGNDDRGIPKQISFRNKNMDFRTWKQTHLKSSMPTYQDPSKVGLPQF EDCELHSLEAL >gi568815594r:163428751_163713504|GENSCAN_predicted_CDS_1|396_bp atggtgccagcatctgctcagcttctgggaaggcctcatggagcttttactcatggcaga aggcgaagcaggagcaggaggttttgccagaaccagtattttgaagtattggttggtgct gtgacaatttcacaactgtcaatgaataataacattaagaatggaaagaagtatgccttt ctttcgaaaagtagaaaaagggtaattatgaaagacaatattggaaatgatgacaggggt attcccaaacagatcagttttagaaacaaaaacatggacttcaggacctggaagcagacg cacttaaaatcttccatgccgacataccaagatccttccaaagtaggcctacctcagttt gaagactgtgaacttcactcgctagaagcactttaa >gi568815594r:163428751_163713504|GENSCAN_predicted_peptide_2|625_aa MANDAKPDVKTVQVLRDTANRLRIHSIRATCASGSGQLTSCCSAAEVVSVLFFHTMKYKQ TDPEHPDNDRFILSRGHAAPILYAAWVEVGDISESDLLNLRKLHSDLERHPTPRLPFVDV ATGSLGQGLGTACGMAYTGKYLDKASYRVFCLMGDGESSEGSVWEAFAFASHYNLDNLVA VFDVNRLGQSGPAPLEHGADIYQNCCEAFGWNTYLVDGHDVEALCQAFWQASQVKNKPTA IVAKTFKGRGIPNIEDAENWHGKPVPKERADAIVKLIESQIQTNENLIPKSPVEDSPQIS ITDIKMTSPPAYKVGDKIATQKTYGLALAKLGRANERVIVLSGDTMNSTFSEIFRKEHPE RFIECIIAEQNMVSVALGCATRGRTIAFAGAFAAFFTRAFDQLRMGAISQANINLIGSHC GVSTGEDGVSQMALEDLAMFRSIPNCTVFYPSDAISTEHAIYLAANTKGMCFIRTSQPET AVIYTPQENFEIGQAKVVRHGVNDKVTVIGAGVTLHEALEAADHLSQQGISVRVIDPFTI KPLDAATIISSAKATGGRVITVEDHYREGGIGEAVCAAVSREPDILVHQLAVSGVPQRGK TSELLDMFGISTRHIIAAVTLTLMK >gi568815594r:163428751_163713504|GENSCAN_predicted_CDS_2|1878_bp atggccaacgacgccaagcccgacgtgaagaccgtgcaggtgctgcgggacacagccaac cgcctgcggatccattccatcagggccacgtgtgcctctggttctggccagctcacgtcg tgctgcagtgcagcggaggtcgtgtctgtcctcttcttccacacgatgaagtataaacag acagacccagaacacccggacaacgaccggttcatcctctccaggggacatgctgctcct atcctctatgctgcttgggtggaggtgggtgacatcagtgaatctgacttgctgaacctg aggaaacttcacagcgacttggagagacaccctaccccccgattgccgtttgttgacgtg gcaacagggtccctaggtcagggattaggtactgcatgtggaatggcttatactggcaag taccttgacaaggccagctaccgggtgttctgccttatgggagatggcgaatcctcagaa ggctctgtgtgggaggcttttgcttttgcctcccactacaacttggacaatctcgtggcg gtcttcgacgtgaaccgcttgggacaaagtggccctgcaccccttgagcatggcgcagac atctaccagaattgctgtgaagcctttggatggaatacttacttagtggatggccatgat gtggaggccttgtgccaagcattttggcaagcaagtcaagtgaagaacaagcctactgct atagttgccaagaccttcaaaggtcggggtattccaaatattgaggatgcagaaaattgg catggaaagccagtgccaaaagaaagagcagatgcaattgtcaaattaattgagagtcag atacagaccaatgagaatctcataccaaaatcgcctgtggaagactcacctcaaataagc atcacagatataaaaatgacctccccacctgcttacaaagttggtgacaagatagctact cagaaaacatatggtttggctctggctaaactgggccgtgcaaatgaaagagttattgtt ctgagtggtgacacgatgaactccaccttttctgagatattcaggaaagaacaccctgag cgtttcatagagtgtattattgctgaacaaaacatggtaagtgtggcactaggctgtgct acacgtggtcgaaccattgcttttgctggtgcttttgctgccttttttactagagcattc gatcagctccgaatgggagccatttctcaagccaatatcaaccttattggttcccactgt ggggtatccactggagaagatggagtctcccagatggccctggaggatctagccatgttc cgaagcattcccaattgtactgttttctatccaagtgatgccatctcgacagagcatgct atttatctagccgccaataccaagggaatgtgcttcattcgaaccagccaaccagaaact gcagttatttataccccacaagaaaattttgagattggccaggccaaggtggtccgccac ggtgtcaatgataaagtcacagtaattggagctggagttactctccatgaagccttagaa gctgctgaccatctttctcaacaaggtatttctgtccgtgtcatcgacccatttaccatt aaacccctggatgccgccaccatcatctccagtgcaaaagccacaggcggccgagttatc acagtggaggatcactacagggaaggtggcattggagaagctgtttgtgcagctgtctcc agggagcctgatatccttgttcatcaactggcagtgtcaggagtgcctcaacgtgggaaa actagtgaattgctggatatgtttggaatcagtaccagacacattatagcagccgtaaca cttactttaatgaagtaa >gi568815594r:163428751_163713504|GENSCAN_predicted_peptide_3|445_aa MTKLSSRRPILKPENLLSSPKHLLHLLKHLLRTTEKLFLLPHSEEVAAAHASVHRPEHFK KPKGAAGIKGRPGPTIGIVDGVSVRQRGDGDSVPEKNRVHLFEGKKKSWLSVKDGHLDSV LEWPFWTKLVVVAIGFTGGLVFMYVQCKVYVQLWRRLKAYNRVIFVQNCPDTAKKLEKNF SCNVNTDIKDAVVVPVPQTEDQEANDILHIPMLWEDYRTVFQGSVRSSAAKPCPAVDQVD THKAIRVLATGARTVARRPVKKLLHIGQARHGDFATICYPQGITKMIAAGEKKTRPRAKV FYEWAEGSSGRHIYTCTKGGINNQCSSIAMSVVAKECKQFKGLSQIGLSSRNRTLPLCLL NKSLPGNQPSGSGEQEGAHHGDVLVATEQLRTLDPGAAAHKSTRAESHVAPEKPAPTQPE QERRVRFSKASPLTETASFSPVLGF >gi568815594r:163428751_163713504|GENSCAN_predicted_CDS_3|1338_bp atgaccaagttgagtagcagaagacccatcctgaagccagagaaccttctatcttctccg aagcatctcctccacttgctcaagcatctcctcagaacaacagaaaaactcttcctactg cctcactcagaagaggtggcagcagcacacgccagcgtgcaccgccctgagcatttcaaa aagccgaaaggggctgcggggataaagggaaggccaggacccacgataggaattgtggat ggggtttctgtgcgtcaaagaggggatggtgacagcgttccagaaaaaaatcgtgtccac ttatttgaaggcaagaaaaagagctggctgtctgttaaagatggtcatttggacagtgtc cttgaatggccattttggacaaaactggttgtggtagccattggcttcacaggaggtctt gtcttcatgtacgtacagtgtaaagtctatgttcagttgtggcgcaggctgaaggcctac aaccgtgtgatctttgtacaaaattgcccagacactgccaaaaaactggagaagaacttc tcatgtaatgtaaacacagacatcaaagatgctgtggtagtgcctgtaccacaaacagaa gatcaggaggcaaatgatatcctccatatccccatgctctgggaagattataggacagtc ttccaaggctcagtcaggtcatccgcagcaaaaccctgcccagctgtggatcaagtggac acacacaaggccatcagggtgctggcaacaggagcaagaacagtggcaagaaggccagtt aagaagctactgcacataggacaggcaagacatggcgactttgccactatttgttatcca caaggaatcactaaaatgattgctgcaggggagaagaaaactaggccacgagctaaagta ttctatgaatgggcagaaggttcttctgggagacatatttacacgtgtaccaaagggggg ataaataatcaatgcagcagtattgcaatgtctgtggtagcaaaagaatgtaaacaattt aaaggtctatcacagatcgggttaagttcccgaaaccgcaccctccctctctgcctgctc aacaagagccttccggggaaccaaccgagcggttcgggggagcaggagggggcccaccat ggtgacgtcctcgtggccacggagcagctccgcactctagacccaggagcagcagctcac aagagcacccgggccgaatcccacgtggcgcctgagaagcccgcgccgacccaaccggaa caggagaggcgagtgcgtttctctaaagcgtccccgttaactgagaccgcgtctttttcg cccgtgttgggcttttaa >gi568815594r:163428751_163713504|GENSCAN_predicted_peptide_4|391_aa MYYLNQDAKLSNLFLQASSPTTGTAPRSQSRLSVCPSTQDICRSAILHDMSEESFEYCTP VMVLSPARKESGKKSVIQRPRRRRKASERYEHAAEEQIRGRKNDFHLQISSPRWRELYTD SSDSSSTDESHWIQAKRRAQVKFRLSRRRRRNNNKPCENLAGSSTPNGIELVDLGSKGKE QQELIECESCSLNLHRGKHTRYQECNPSLTQGSSEIKRSSRNHEKSKGRYHHRDPQLLQS LRKNEIMKKTFSETDSSTEILGVPEGSKDMNDAGLQVNNPVQKPPATYDDGSDNLEVCRI CHCEGDEESPLITPCRCTGTLRFVHQSCLHQWIKSSDTRCCELCKYDFIMETKLKPLRKG PEVSMPEIEMDPDSLSFASSAGQMLSIVCSS >gi568815594r:163428751_163713504|GENSCAN_predicted_CDS_4|1176_bp atgtattaccttaaccaagatgccaaattatctaacttgtttctccaggcaagcagccca acaacagggacagctcccaggagccagtcaaggttgtctgtctgtccatccactcaggac atctgcagatctgccatcttgcatgacatgtcagaagagtcatttgagtattgcacccct gtcatggtgctgagccctgctaggaaggagtctggtaagaaatcagtaatacaaagacct aggaggaggagaaaagcctctgagagatatgagcatgctgcagaagaacaaataagaggg agaaaaaatgactttcaccttcaaatctcaagccccaggtggagggagctttacacagat tcttcagattcatcttccacagatgagagtcattggattcaggcaaaaagaagagcccag gttaaattcagactgtcaagaagaaggaggagaaataataataagccgtgtgaaaactta gctgggtcttccactcctaacggaattgagctcgttgatctgggatccaaaggtaaagag caacaagagctgattgaatgtgagagttgctctttaaatctccacagaggaaaacataca aggtaccaagaatgcaacccttctctgactcaagggtcctctgaaattaagagatcctca aggaatcatgagaagagcaaaggcagatatcaccacagagatcctcagctcttgcaaagt cttaggaaaaatgaaataatgaagaagacattttcagaaacagattccagcactgaaata ctgggagttccagaaggcagcaaggacatgaatgatgcagggctccaggtgaataaccct gttcagaagcctcctgccacctatgacgatgggtctgataatttagaagtatgcagaatc tgtcactgcgaaggggatgaagagagccccctcatcacaccctgtcgctgcactgggaca ctgcgctttgtccaccagtcctgcctccaccagtggataaagagctcagatacacgctgc tgtgagctctgcaagtatgacttcataatggagaccaagctcaaacccctccggaagggc ccagaggtgtcaatgccagaaatagaaatggaccctgattccttgtcctttgcttcatcc gctggacagatgctgagcattgtctgcagtagttaa >gi568815594r:163428751_163713504|GENSCAN_predicted_peptide_5|218_aa MEDEMNEMKREGKFREKRIKRNEQSLQEIWDYVKRPNLRLIGIPESDGENGTKLENTLQD IIQENFPNLARQDNIQIQEIQRMPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLMADTLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGVIKYFTDK QMLRDSVTTRPALKELLKEALNMERNNRYQLLQNHAKM >gi568815594r:163428751_163713504|GENSCAN_predicted_CDS_5|657_bp atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtaaaaagaccaaatctacgtctg attggtatacctgaaagtgatggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggacaacattcagattcaggaaata cagagaatgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaagggaagcccatcagactaacggcggatctcatggcagacaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagtaataaaatactttacagacaag caaatgctgagagattctgtcaccaccaggcctgccctaaaagagctcctgaaggaagca ctaaacatggaaaggaacaaccggtaccagctgctgcaaaatcatgccaaaatgtaa >gi568815594r:163428751_163713504|GENSCAN_predicted_peptide_6|459_aa MDKFLDTYTLPRLNQEEVESPNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFHEASIILIPKPGRHTTKKENFRPISLMNIDAKILNKILA KRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIQHINRAKDKNHMIISIDAEKAFDKI QQPFMLKTLNKLGIDGTYFKTIRAIYDKPTANIILNGKKLEAFRLKTGTRQGCPLSPPLF NIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQSLLKLISNFSKV SGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPL LKKIKEDTNKWKNIPCSWVGRINIVKMAILPKLQPRQYCLPRDPPSDIAEALPGTWRKPH LSINLVTGPLSSDSKVDPHPSTSPTDQGPEDSSVHLGTT >gi568815594r:163428751_163713504|GENSCAN_predicted_CDS_6|1380_bp atggataaattccttgacacatatactctcccaagactaaaccaggaagaagttgaatct ccgaatagaccaataacaggagctgaaattgtggcaataatcaatagcttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca tttcatgaggccagcatcattctgataccaaagccaggcagacacacaaccaaaaaagag aattttagaccaatatccctgatgaacattgatgcaaaaatcctcaataaaatactggca aaacgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggttggttcaatatatgcaaatcgataaatgtaatccagcatataaacaga gccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaacccttcatgctaaaaactctcaataaattaggtatcgatgggacgtatttcaaa acaataagagctatctatgacaaacccacagccaatatcatactgaatgggaaaaaactg gaagcattccgtttgaaaaccggcacaagacagggatgccctctctcaccacccctattc aacatagtgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtatt caattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatcta gaaaaccccattgtctcagcccaaagtctccttaagctgataagcaacttcagcaaagtc tcaggatacaaaatcaatgtgcaaaaatcacaagcattcctatacaccaacaacagacaa acagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaagaaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtagga agaatcaatattgtgaaaatggccatactgcccaagttacaaccaaggcagtactgcctg cccagggacccacccagtgacatagcagaagccctcccaggaacctggaggaagccacat ttatccataaacctggtaacaggcccattgtcttcagactcaaaagtggaccctcatccc agtaccagccctacagaccaaggtcctgaagatagttcagtccacctagggaccacatag >gi568815594r:163428751_163713504|GENSCAN_predicted_peptide_7|57_aa MHEHNEKFNKDIKLKNTKTEVKNSIESFSSRLNHGEERIKKLEDRSFEICQFKEQKE >gi568815594r:163428751_163713504|GENSCAN_predicted_CDS_7|174_bp atgcacgaacacaatgagaagtttaataaagatataaagctgaagaatacaaagacagaa gtaaaaaactcaatagagagcttcagcagcagactcaatcatggagaagaaagaattaaa aagcttgaagacagatcatttgaaatttgccaattcaaggaacaaaaagaataa >gi568815594r:163428751_163713504|GENSCAN_predicted_peptide_8|139_aa MDEGGSHYSQQTDTGTENQTLHVLIHKWELNNENTWTQGEECHTPGPVVISSTTTKIPYI LEDVENSVLQIEHINFSVSKPDDTSVTAVEHQDSHELPWLTDIPAEQYFFLYMQHPSHTT TDVTVEHVITDRDESQRYG >gi568815594r:163428751_163713504|GENSCAN_predicted_CDS_8|420_bp atggatgaaggtggaagccattattctcagcaaactgacacaggaacagaaaaccaaaca ctgcatgttctcattcataagtgggagttgaacaatgagaacacatggacacaaggagag gaatgtcacacaccggggcctgtcgtgatatcttcgacaacaacaaaaataccatatata ttagaagatgttgaaaatagtgttctacagattgaacatatcaatttctctgtttctaag ccagatgacacttcagtgactgctgttgagcaccaagactctcatgaacttccctggctt acagatattccagctgaacagtactttttcctttacatgcaacatccatctcatacaacc actgatgtcactgtggagcatgttattacagaccgagatgagtcacaacgatacggttag >gi568815594r:163428751_163713504|GENSCAN_predicted_peptide_9|66_aa DSALILIGVMKLKARSKGALVRSNRVCLLGTDQGGEQWDMDLHGLVEDTPSTPVKITFET QMNMIL >gi568815594r:163428751_163713504|GENSCAN_predicted_CDS_9|201_bp gactctgcattaattctcattggtgtaatgaaactgaaagccaggagcaaaggagccttg gtgagatccaaccgggtgtgcctcctgggcacagatcagggtggggaacagtgggacatg gatctacacggactggtggaagatacacctagcacacctgtgaaaataacatttgaaacg caaatgaacatgatactttaa