GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:21:06 Sequence gi568815596f:105141859_105342974 : 201116 bp : 44.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1712 1829 118 0 1 75 100 18 0.311 1.22 1.02 Intr + 2235 2317 83 0 2 84 88 26 0.561 1.48 1.03 Intr + 10899 10979 81 1 0 113 33 36 0.374 0.21 1.04 Term + 14265 14446 182 1 2 79 38 88 0.706 0.67 1.05 PlyA + 15113 15118 6 1.05 2.00 Prom + 15272 15311 40 -2.86 2.01 Init + 27452 27537 86 1 2 50 92 40 0.059 0.89 2.02 Intr + 35705 35772 68 2 2 59 94 58 0.151 2.05 2.03 Term + 36162 36268 107 1 2 110 48 66 0.811 3.37 2.04 PlyA + 37692 37697 6 1.05 3.04 PlyA - 40236 40231 6 1.05 3.03 Term - 43842 43734 109 2 1 116 47 73 0.480 3.98 3.02 Intr - 46678 46528 151 2 1 30 31 89 0.145 -3.38 3.01 Init - 54458 54371 88 2 1 87 81 21 0.154 2.11 3.00 Prom - 64431 64392 40 -2.46 4.03 PlyA - 65195 65190 6 1.05 4.02 Term - 66775 66342 434 2 2 44 41 249 0.934 11.26 4.01 Init - 68856 68751 106 0 1 62 59 117 0.344 4.61 4.00 Prom - 70191 70152 40 -4.96 5.06 PlyA - 70360 70355 6 1.05 5.05 Term - 70675 70588 88 0 1 46 43 118 0.055 0.23 5.04 Intr - 92414 92230 185 2 2 32 81 104 0.250 2.69 5.03 Intr - 93748 93633 116 2 2 69 116 -4 0.236 0.67 5.02 Intr - 95085 94890 196 0 1 84 96 75 0.208 6.89 5.01 Init - 95438 95304 135 2 0 68 97 41 0.302 3.20 5.00 Prom - 97764 97725 40 -8.26 6.00 Prom + 97890 97929 40 -8.16 6.01 Sngl + 100001 101119 1119 1 0 99 50 2110 0.962 205.04 6.02 PlyA + 103253 103258 6 1.05 7.17 PlyA - 103936 103931 6 1.05 7.16 Term - 116575 116543 33 1 0 105 50 28 0.185 -1.71 7.15 Intr - 127847 127414 434 0 2 60 115 683 0.686 61.57 7.14 Intr - 131156 130997 160 2 1 98 85 214 0.997 21.66 7.13 Intr - 131832 131686 147 0 0 103 99 107 0.999 13.73 7.12 Intr - 133845 133702 144 0 0 92 68 233 0.999 22.18 7.11 Intr - 135813 135756 58 2 1 89 109 18 0.998 2.89 7.10 Intr - 138865 138524 342 0 0 24 87 509 0.937 38.95 7.09 Intr - 142540 142458 83 1 2 105 91 59 0.728 6.34 7.08 Intr - 154652 154498 155 0 2 84 72 80 0.896 5.79 7.07 Intr - 156847 156653 195 2 0 97 73 229 0.950 21.69 7.06 Intr - 157704 157641 64 0 1 30 72 38 0.016 -5.31 7.05 Intr - 166460 165625 836 0 2 105 54 1271 0.008 116.51 7.04 Intr - 181452 181434 19 0 1 107 90 29 0.004 1.18 7.03 Intr - 186994 186804 191 2 2 50 45 158 0.024 7.00 7.02 Intr - 187896 187767 130 0 1 54 117 65 0.915 6.27 7.01 Init - 188421 188359 63 0 0 109 88 75 0.996 8.81 7.00 Prom - 190069 190030 40 -7.46 8.03 PlyA - 190660 190655 6 1.05 8.02 Term - 195819 195664 156 0 0 28 45 225 0.473 10.13 8.01 Intr - 196125 195964 162 0 0 126 30 35 0.347 1.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:105141859_105342974|GENSCAN_predicted_peptide_1|154_aa XCHSVSLRPRAFQLPGLSFAAHKMVGPSESEVCGRVEGGSCGSGSSRRRRGRAAFVSGRR ARGAQRRRRRREERACIDDFDCDCVYVQHTVKNMEFHNMLSRKIQLLIPSEDFHHMEDSK KHSNANNQKKEPTDLGHCICCDFRVAKGDLHSRH >gi568815596f:105141859_105342974|GENSCAN_predicted_CDS_1|465_bp ncctgccactcagtatccctgcgtcctcgggcatttcaactcccaggtctcagttttgct gctcataaaatggttggaccatccgaaagcgaggtgtgcgggcgggtggaaggaggaagc tgcggctcggggagcagccggcggcgccgcggccgcgcagcctttgtctcgggccgccgg gcgcgcggggcccagcgcaggcgaaggagaagagaagagagggcgtgtatagatgacttt gactgtgactgtgtttatgtccaacacactgtgaaaaatatggaatttcataatatgttg agcaggaagatccagctactaatcccaagtgaggatttccaccacatggaggacagcaag aagcacagcaatgccaataaccagaagaaggagcctacagaccttggccactgcatctgc tgcgatttcagagttgccaagggggacttacactcaaggcactga >gi568815596f:105141859_105342974|GENSCAN_predicted_peptide_2|86_aa MGFKTCYPNTLAFEKTAEAGRSLSPSPQSRMDLVTTLAKECDRNPAGDSDTGISQRQAAS SLRTLILIARQRPTPNPIALGSGVLT >gi568815596f:105141859_105342974|GENSCAN_predicted_CDS_2|261_bp atggggttcaagacctgctaccccaacaccttggcgtttgagaaaacagcagaagcagga aggtctctctcaccttccccccagagccgaatggaccttgtgactaccttggccaaagaa tgtgacagaaatcctgctggagactctgacacaggtatcagccagcggcaggcagcaagc tctctccggactctcatccttatcgcccgccaaaggcccacccctaatcccatcgcactg gggagtggtgttttaacataa >gi568815596f:105141859_105342974|GENSCAN_predicted_peptide_3|115_aa MVLSMNQAEGCHREPSLLCLDLGTSSLQNALYSSSKQDSIAGATLKVPKVDDETFQYLNS PGFQPFIVRKGSSTKLTLLKENGIIVGRERAVAALSSKAVRAGVTVKAQGTKSCS >gi568815596f:105141859_105342974|GENSCAN_predicted_CDS_3|348_bp atggtgctgtctatgaaccaggcagagggctgtcatcgagaaccaagtttactatgcctt gatcttgggacttcgagcctccagaatgctttatattcatcttcaaagcaagattctata gcaggagctacactgaaagtaccaaaagtggatgatgaaacatttcagtatctcaattcc cctggttttcaaccattcattgtaagaaaaggctctagtaccaagctgactttgttaaaa gagaacggaataattgttgggagagagagagcagtggcggccctcagtagcaaggccgtg agagctggggtcactgtgaaggcccagggcaccaagagctgctcctga >gi568815596f:105141859_105342974|GENSCAN_predicted_peptide_4|179_aa MASRPIREVRGASARPPLLGSEEPLCPASRPVREGEIQTTIRDYYKHLYANKLENLEEMD KFLDTYTLPRLNQEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAGFYQRYKEELVP FLLKLFQSTEKEGILPNSFYEASIILIPKPGRDTTEKENFRPISLMNIDAKILNKILAN >gi568815596f:105141859_105342974|GENSCAN_predicted_CDS_4|540_bp atggccagccgccccatccgggaggtgaggggcgcttctgcccggccgcccctactggga agtgaggagcccctctgcccggccagccgccccgtccgggagggagaaatacaaactacc atcagagattactacaaacacctctatgcaaataaactagaaaatctagaagaaatggat aaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatctctgaat agaccaataacaggagctgaaattgtggcaataatcaatagcttaccaaccaaaaagagt ccaggaccagatggattcacagctggattctaccagaggtacaaggaggaactggtacca ttccttctgaaactattccaatcaacagaaaaagagggaatcctccctaactcattttat gaggccagcatcatcctgataccaaagccgggcagagacacaaccgaaaaagagaatttt agaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaactga >gi568815596f:105141859_105342974|GENSCAN_predicted_peptide_5|239_aa MPCEGTKAGRTRETCGGTVGAAPWQGAEQQRRLPRGGGRLPGNRSQLHLPPVTLPLYLQQ KLPWARPCSHVASSVGASIPIGPFPSSASKGVACIPVPLHLDPVGSRLRPGYAERGHSDN DVTSSSYQTFISGMSQHWEPQEGCLSAGKEWNPIVNCAREGSRLRTPYVNLMPDDLMPDD LLRWNSFIPKPSAPGPVEKLPPNKRIPGAKRPALKELLKEALNMERNNRYQLLQNHAKM >gi568815596f:105141859_105342974|GENSCAN_predicted_CDS_5|720_bp atgccttgcgaggggacaaaggcgggcagaaccagagagacctgcggaggcacagtcggg gcagccccgtggcagggagccgagcagcagcggcgattgccacgtggaggaggtagactt cctggcaaccggtcgcaactccacttgccccctgtgacactgcccctgtacctccagcaa aagctgccctgggctcgtccttgttcccacgtggcttcgtctgtcggagcatccattcct attggccccttccccagctctgcgtcgaaaggtgttgcgtgcatcccggtgcctctgcac ctggaccctgtgggatcgaggctgcgcccaggctatgcagagagaggccactcagataat gatgtaactagcagttcctatcagacttttatatcaggaatgagtcagcattgggagcca caggaaggatgtctttcagcaggaaaggagtggaaccctattgtgaattgtgcacgtgag ggatctaggttgcgcactccttatgtgaatctaatgcccgatgatctaatgcctgatgat ctgctgagatggaacagtttcatcccaaaaccatccgctcctggtcctgtggaaaaattg cctcccaacaaacggatccctggtgccaaaaggcctgccctgaaagagctcctgaaggaa gcactaaacatggaaaggaacaaccggtaccagctgctgcaaaatcatgccaaaatgtaa >gi568815596f:105141859_105342974|GENSCAN_predicted_peptide_6|372_aa MACNSTSLEAYTYLLLNTSNASDSGSTQLPAPLRISLAIVMLLMTVVGFLGNTVVCIIVY QRPAMRSAINLLLATLAFSDIMLSLCCMPFTAVTLITVRWHFGDHFCRLSATLYWFFVLE GVAILLIISVDRFLIIVQRQDKLNPRRAKVIIAVSWVLSFCIAGPSLTGWTLVEVPARAP QCVLGYTELPADRAYVVTLVVAVFFAPFGVMLCAYMCILNTVRKNAVRVHNQSDSLDLRQ LTRAGLRRLQRQQQVSVDLSFKTKAFTTILILFVGFSLCWLPHSVYSLLSVFSQRFYCGS SFYATSTCVLWLSYLKSVFNPIVYCWRIKKFREACIELLPQTFQILPKVPERIRRRIQPS TVYVCNENQSAV >gi568815596f:105141859_105342974|GENSCAN_predicted_CDS_6|1119_bp atggcctgcaacagcacgtcccttgaggcttacacatacctgctgctgaacaccagcaac gcctcagactcggggtccacccagttgcccgcacccctcaggatctccttggccatagtg atgctgctgatgaccgtggtggggttcctgggcaacactgtggtctgcatcatcgtgtac cagaggccggctatgcgctcggccatcaacctgctgctggccaccctggccttctccgac atcatgctgtccctctgctgcatgcccttcaccgccgtcaccctcatcaccgtgcgctgg cactttggggaccacttctgccgcctctcagccacgctctactggttttttgtcctggag ggcgtggccatcctgctcatcatcagcgtggaccgcttcctcatcatcgtccagcgccag gacaagctgaacccgcgcagggccaaggtgatcatcgcggtctcctgggtgctgtccttc tgcatcgcggggccctcgctcacgggctggacgctggtggaggtgccggcgcgggcccca cagtgcgtgctgggctacacggagctccccgctgaccgcgcctacgtggtcaccttggtg gtggccgtgttcttcgcgccctttggcgtcatgctgtgcgcctacatgtgcatcctcaac acggtccgcaagaacgccgtgcgcgtgcacaaccagtcggacagcctggacctgcggcag ctcaccagggcgggcctgcggcgcctgcagcggcagcaacaggtcagcgtggacttgagc ttcaagaccaaggccttcaccaccatcctgatcctcttcgtgggcttctccctctgctgg ctgccccactccgtctacagcctcctgtctgtgtttagccagcgcttttactgcggttcc tccttctacgccaccagcacctgcgtcctgtggctcagttacctcaagtccgtcttcaac cccatcgtctactgctggagaatcaaaaaattccgcgaggcctgcatagagttgctgccc cagaccttccaaatcctccccaaagtgcctgagcggatccgaaggagaatccagccaagc acagtctacgtgtgcaatgaaaaccagtctgcggtttag >gi568815596f:105141859_105342974|GENSCAN_predicted_peptide_7|1017_aa MGLVGGLRIPTMPSSPGGAPARLAAAPRDGKRRLRGRASPGAAGRSGAAGPQAAGRRGTA GAGAGLLSDPQEASYLAAKNGGSQITQQALLHVDCTRSCGQNMSEQSLITDEVLMGKHLV YQEGRQLREADKARADQPVDMMSIKAFTLVSAVERELLMGDKERVNIECVECCGRDLYVG TNDCFVYHFLLEERPVPAGPATFTATKQLQRHLGFKKPVNELRAASALNRLLVLCDNSIS LVNMLNLEPVPSGARIKGAATFALNENPVSGDPFCVEVCIISVKRRTIQMFLVYEDRVQI VKEVSTAEQPLAVAVDGHFLCLALTTQYIIHNYSTGVSQDLFPYCSEERPPIVKRIGRQE FLLAGPGGLGEEDAFVRLIFLGGLGDCVLKLESEIGAVQNKSSVTLGVCGKTEFSGWAMF SLKQVNAVTLFDLQGMFATVAGISQRAPVHWSENVIGAAVSFPYVIALDDEFITVHSMLD QQQKQTLPFKEGHILQDFEGRVIVATSKGVYILVPLPLEKQIQDLLASRRVEEALVLAKG ARRNIPKEKFQVMYRRILQQAGFIQFAQLQFLEAKELFRSGQLDVRELISLYPFLLPTSS SFTRSHPPLHEYADLNQLTQGDQEKMAKCKRFLMSYLNEVRSTEVANGYKEDIDTALLKL YAEADHDSLLDLLVTENFCLLTDSAAWLEKHKKYFALGLLYHYNNQDAAAVQLWVNIVNG DVQDSTRSDLYEYIVDFLTYCLDEELVWAYADWVLQKSEEVGVQVFTKRPLDEQQKNSFN PDDIINCLKKYPKALVKYLEHLVIDKRLQKEEYHTHLAVLYLEEVLLQRASASGKGAEAT ETQAKLRRLLQKSDLYRVHFLLERLQGAGLPMESAILHGKLGEHEKALHILVHELQDFAA AEDYCLWCSEGRDPPHRQQLFHTLLAIYLHAGPTAHELAVAAVDLLNRHATEFDAAQVLQ MLPDTWSVQLLCPFLMGAMRDSIHARRTMQVALGLARSENLIYTYDKALSGLDDAHT >gi568815596f:105141859_105342974|GENSCAN_predicted_CDS_7|3054_bp atgggcctcgtgggagggctgcgcatcccaaccatgccttcgtctcctggtggggcaccg gcgcgcctggcggccgccccacgtgacgggaagcggcggctgcggggtcgggccagccca ggagccgcgggccggagcggggcggcggggccccaggccgcggggcggcgcgggacggcg ggcgccggcgccggactgttgagtgaccctcaggaagcctcctacctggctgctaaaaat gggggctcacagatcacacagcaggccttactgcacgtggactgcacgcgcagctgtggt caaaacatgtctgagcagagtttaatcactgatgaagtgctcatgggcaagcaccttgtc tatcaggaaggaaggcagctgcgggaagctgacaaagccagagcagatcagccagtagac atgatgagcatcaaagcctttacgcttgtctctgctgtggagcgggagctgctgatgggc gacaaggagcgcgtcaacatagagtgcgtggagtgctgcggcagggacctctacgtgggc accaacgactgcttcgtctaccacttcctgttggaggagaggccagtgcctgctgggcca gccacgttcactgccaccaaacagctgcagagacacttgggcttcaagaagcccgtgaac gagctgcgtgcggcctcagcactcaacaggctgctggtgctgtgtgacaactccatcagc ctggtcaacatgctgaacctcgagccagtgccttcgggggcccgcatcaagggggcagcc acgtttgcactgaacgagaaccctgtgagtggggaccccttctgtgtagaagtttgcatc atctctgtcaaacgcagaaccatccagatgtttctggtgtacgaggaccgggtgcagatc gtcaaggaggtgtcgactgccgagcagcccctcgctgtggctgtggacggccacttcctg tgtctggctctgaccactcagtacatcatccacaattacagcacaggcgtctcccaggac ctgtttccctactgcagtgaggagaggccgccgatcgtcaagaggatagggagacaggag ttcctgctggcgggccccggagggctgggtgaggaggatgcatttgtgcgcctgatcttt ttgggtggcctcggtgactgcgtgttgaaactggagagtgagattggcgctgtgcagaat aaatcctcagtgacattgggcgtgtgtggcaagacagagttctctggttgggcgatgttc agccttaagcaggtaaatgcagtgaccctctttgacctccagggcatgtttgccacagtc gcagggatatcccagcgcgcccccgtgcactggtcggagaatgtgattggggcggctgtg tcctttccatacgtcatagcgctcgatgacgaattcatcacagtccacagcatgttggat cagcaacagaagcagacgctgccctttaaggagggccatatcctacaggactttgaagga agagtgatcgttgccacaagtaaaggagtttacatcttggttccattacctttggaaaaa caaatacaggatcttctagcaagccgcagagtagaagaggctttggttttagcaaaagga gcccggaggaacattccaaaggaaaaatttcaggtaatgtacagaaggattctgcagcag gcgggatttatacagtttgcacaacttcagttcctggaagctaaagagctcttcagaagc ggccagcttgatgtccgggagctgatctctctctaccccttcctgttgcccacctcctcc tccttcacccggtcccaccctcctcttcatgagtacgcagacctgaaccagctgacccag ggggaccaggagaagatggccaagtgcaaacgcttcctcatgagctacctgaacgaggtc cgcagcacagaggtagcaaatggctacaaggaggacatcgacacagccttgctcaaactg tatgcagaggctgaccacgacagcctgctggacctcctggtcactgagaacttctgtctt ctgacggacagtgctgcctggctagagaagcacaaaaagtattttgcacttggactgctc tatcattataataaccaagatgctgctgcagttcagttgtgggtgaacattgtgaatggc gatgtccaggactccacacgctcagacctgtatgaatacatcgtggattttcttacctac tgcttagacgaggaactagtgtgggcctatgctgattgggtcctgcagaaaagtgaagag gtcggagttcaggttttcaccaagagacctttggatgaacagcagaagaacagttttaat ccagacgacattatcaattgccttaaaaaataccctaaagcccttgtgaagtatctggaa catcttgtgatagacaagagactgcagaaagaagagtatcacacccacttagctgtgctg tacctggaagaggtgctgctgcagagggcctccgccagtggcaagggtgcagaggccacc gagacgcaggccaagctgcggcggctgctccagaaatctgatttataccgagtccacttt cttctcgagaggctgcagggagctggcctgcccatggagagcgccatcctgcacgggaag ctgggcgagcatgagaaggcgctgcatatcctggtgcacgagctgcaggactttgcagcg gccgaggactactgcctgtggtgctccgagggccgagacccaccccaccgccagcaactc tttcacacgctgctggccatctacctgcatgctggccccactgcccacgagctggccgtg gctgccgtggacctgctgaaccgccacgccaccgaatttgatgcagcccaggtgctgcag atgctgcctgacacctggtcagtgcagctcctctgcccattcctgatgggggccatgagg gacagcatccatgccaggaggaccatgcaggtggctctcggcctggccaggtccgaaaac ttaatctacacctacgataaggccctcagtggattggatgatgcccacacatag >gi568815596f:105141859_105342974|GENSCAN_predicted_peptide_8|105_aa VGAYTQAGGVAWDSSLQAIGTQIDKPYFPSIPDILRTSHFWRGGEEQERGDSCLSEEKEL LGQQLRVQQQFRVRAAATAHIPRHGRRQETPVGYPQSPAKEALRK >gi568815596f:105141859_105342974|GENSCAN_predicted_CDS_8|318_bp gtcggggcttacacacaggcagggggtgtcgcctgggactcctctttacaagccatcggg actcagatagataagccctatttcccaagcattcctgatattctacggacgtctcatttt tggagagggggcgaggagcaggagcgaggagacagctgcctgagtgaggagaaggaactc ctgggacagcagctccgggtgcagcagcagttccgagtccgtgcagctgcgaccgcccac atcccccgccatggtcgtcgccaggagaccccagtcggctacccacaaagccccgcgaag gaagcgttgcggaagtga