GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:38:08 Sequence gi568815577f:31559770_31768575 : 208806 bp : 42.81% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 51 161 111 2 0 61 89 95 0.661 7.18 1.02 Intr + 8940 9073 134 2 2 57 77 69 0.054 1.22 1.03 Intr + 27925 28034 110 1 2 21 115 50 0.124 0.01 1.04 Intr + 33220 33493 274 2 1 44 52 183 0.335 5.97 1.05 Intr + 37820 37942 123 2 0 33 51 151 0.765 4.68 1.06 Term + 38000 38168 169 2 1 40 45 192 0.505 6.47 1.07 PlyA + 38370 38375 6 1.05 2.03 PlyA - 38942 38937 6 1.05 2.02 Term - 45802 45361 442 0 1 10 31 245 0.294 4.74 2.01 Init - 54498 54305 194 0 2 59 76 140 0.472 8.44 2.00 Prom - 60527 60488 40 -6.15 3.06 PlyA - 60638 60633 6 1.05 3.05 Term - 68117 67893 225 2 0 26 49 270 0.945 12.70 3.04 Intr - 68903 68529 375 2 0 24 84 277 0.315 15.19 3.03 Intr - 81229 81159 71 2 2 33 105 47 0.006 -1.12 3.02 Intr - 88609 88485 125 0 2 24 74 82 0.093 -0.29 3.01 Init - 94686 94232 455 0 2 88 99 288 0.387 25.28 3.00 Prom - 100640 100601 40 -6.45 4.00 Prom + 104516 104555 40 -8.85 4.01 Init + 105810 105919 110 2 2 21 47 137 0.107 1.16 4.02 Intr + 107489 107606 118 2 1 88 103 184 0.999 19.45 4.03 Term + 108702 108809 108 2 0 83 41 127 0.997 5.03 4.04 PlyA + 109102 109107 6 1.05 5.25 PlyA - 109238 109233 6 1.05 5.24 Term - 112585 111630 956 2 2 88 44 848 0.990 71.63 5.23 Intr - 117486 117393 94 0 1 29 99 113 0.872 5.12 5.22 Intr - 118756 118626 131 2 2 8 44 134 0.714 0.49 5.21 Intr - 125471 125280 192 0 0 44 111 132 0.821 9.64 5.20 Intr - 125715 125629 87 1 0 111 67 71 0.983 6.32 5.19 Intr - 125964 125799 166 0 1 109 86 76 0.999 8.11 5.18 Intr - 128695 128538 158 2 2 82 94 165 0.999 15.31 5.17 Intr - 131184 131028 157 0 1 5 103 151 0.699 6.96 5.16 Intr - 132680 132580 101 0 2 32 82 68 0.684 -0.49 5.15 Intr - 133715 133525 191 1 2 16 38 253 0.895 11.41 5.14 Intr - 134520 134435 86 0 2 82 91 86 0.965 6.00 5.13 Intr - 135211 135044 168 1 0 8 98 142 0.854 6.42 5.12 Intr - 136452 136344 109 2 1 43 87 129 0.997 7.67 5.11 Intr - 136981 136800 182 1 2 14 11 222 0.974 4.84 5.10 Intr - 141402 141226 177 0 0 65 71 186 0.976 13.79 5.09 Intr - 142149 142007 143 1 2 97 97 86 0.995 9.55 5.08 Intr - 142610 142475 136 2 1 63 86 104 0.974 6.92 5.07 Intr - 144157 143996 162 1 0 101 70 39 0.855 2.65 5.06 Intr - 146588 146505 84 2 0 86 90 49 0.755 4.00 5.05 Intr - 169588 169457 132 1 0 -21 78 130 0.387 1.02 5.04 Intr - 172010 171894 117 2 0 64 94 167 0.421 14.64 5.03 Intr - 173302 173151 152 2 2 83 111 47 0.227 5.46 5.02 Intr - 175491 175476 16 0 1 122 116 3 0.208 0.70 5.01 Init - 186319 186143 177 1 0 83 60 91 0.355 5.11 5.00 Prom - 194046 194007 40 -7.05 6.00 Prom + 195596 195635 40 -5.35 6.01 Init + 205835 206323 489 1 0 53 42 394 0.479 26.54 6.02 Term + 206526 206783 258 2 0 30 48 191 0.530 3.87 6.03 PlyA + 208248 208253 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:31559770_31768575|GENSCAN_predicted_peptide_1|306_aa MVAFTRVQDSQGSLTVPSKRTWNNSVGFGPYAYLLGRTGAEGKEKYRDTLQTHIVIEEGQ QGKQEPTYCQARQLSAVQGLMCWKIKPNNSVVDIKYNDQRPWETRQQQTGCINKMSENGP EKEGVTCFSEDTWITKTTPSSLPKQSLPVALLSLSKLQTSAFFPLTLNSALWELNITISF RSLTVAMGYMNSNLRMEPKQALKYQPSSGSLHYLPFEQESWQKKRPTVTATTSPIHGLFL QGHFNSPPAERRRYSLHPAPLPGTFVLGTCNAVRKLSPVEGPHAGALQTSPAEGAAGGSS SCKAWR >gi568815577f:31559770_31768575|GENSCAN_predicted_CDS_1|921_bp atggtggccttcaccagagtccaggactcccagggttcacttaccgtgcccagcaagcga acttggaataacagcgtggggtttgggccttacgcttacctgctgggcagaaccggggct gaaggaaaggagaaatacagagacacactacagacacacatagtgatagaggagggtcag cagggaaagcaagaaccaacatactgccaggccaggcagctcagcgctgtgcaggggctg atgtgttggaaaatcaagcctaacaactctgtagtagacataaagtataatgatcaaaga ccgtgggagacaaggcaacagcagacaggatgtatcaacaaaatgtctgaaaatggtcca gagaaggaaggagtcacatgtttctcagaggatacgtggataacaaaaacaacgccatcc agtttgccaaaacaaagtctccctgtggccctgctgtcactttccaagctacagacaagt gcctttttccccctaactctcaactcagcattgtgggaactaaacatcacaatcagcttc aggagcctcactgttgccatgggttacatgaattcaaatctaaggatggaaccaaaacag gcccttaaatatcagccctcgagtgggagtctgcattaccttccattcgaacaagaatcg tggcagaaaaaacgtccaacggtgaccgcgacaacatctcccatccacgggctcttctta caaggacactttaactctcctcctgcggagagaaggcggtacagcttgcatccggctccc ttgcccgggacattcgttcttggaacttgcaatgctgtgaggaagctcagtcccgtggag gggcctcacgcaggcgcactgcagacatccccggcagagggtgcagccggcggcagcagc agctgcaaggcatggaggtga >gi568815577f:31559770_31768575|GENSCAN_predicted_peptide_2|211_aa MPSPSGMKRLLILSGALWITSIPVVVAPSQELSTTNKKIDSTIAPGLDEFVRLIMDFYST LNILLGCKGGAGSSYMAEARQKAKEDAPHTFKRPGLARTHSPSGEQHQGDGAKPLMRNPP PRSHRLPPGPPSTLGITIRQEIWVRSQFQTVSGRKWRLGICTLLLLKPPMCFRSLWKVFE ILSTAPKSTQTLFEGSVSHSTAEKKRLRVVK >gi568815577f:31559770_31768575|GENSCAN_predicted_CDS_2|636_bp atgccaagcccatcaggcatgaaacgtttgctgatcttgtcgggggctttgtggatcaca agcattccagtagtggtggcaccgagtcaggaactctcaacaacaaataaaaagattgat tccaccattgctcctggattggatgagtttgtcaggttaataatggatttctactccacg ctcaacatcctccttggatgcaaagggggagcaggctcgtcttacatggcagaagcaaga cagaaagcgaaggaggatgcgccacacacttttaaacgaccaggtctcgcaaggacacac tcaccgtcaggagaacagcaccaaggggatggagctaagccactgatgagaaatccaccc ccaagatcccatcgcctcccaccaggcccaccttcaacactggggattaccattcgacag gagatttgggtgagatcacagttccaaaccgtatcagggagaaagtggaggttgggaatc tgcacactgcttctcttaaaaccacccatgtgctttcgaagcctttggaaagtcttcgag atactctccaccgcccccaagtctacccagaccctgttcgaaggctctgttagtcactct acagcagagaaaaaaaggctaagggtggttaagtaa >gi568815577f:31559770_31768575|GENSCAN_predicted_peptide_3|416_aa MAGCRSRAMPRREAAKAGEKSSPVPVGGTALSTGPPQPLALVLSPSLHGAGRACRLLRVR ARQAHAHLELQLARKCPHSPCSRSCLSLHTSLQAEGASSGLGQPRKGLPQCSGGLKGSSS AAKVGAQAEEVPRASEASEDCQHAVTSHWDYSCSLMLLSFTLKDFLWRFLKGKTVVTSTD SAIGVGITCYSKSRSPIPGLQTSTSLWTARNRATWQEPCGTNTSVLEDQDEDKSPKKNTP WQISNGTSFVIISRKRPSEGNYQKEKDLYIQYFNQGSESDQLEFVEHLISRMCHYHHGRI NSYLKPMLQQDFITALPEQGLDHIAENILLYLVTRSLCAAELDERVIVTGSSDSTVRVWE VNTGEVLNTLIHHNEDVLHLRFSSGLMVTCSKDRSIAVWDMASATDITLRGVLVVH >gi568815577f:31559770_31768575|GENSCAN_predicted_CDS_3|1251_bp atggcgggctgcaggtcccgagccatgccccgcagggaggcagctaaggccggtgagaaa tcgagcccagtgccggtgggcggcactgctctgagtacagggcctccacagccgctggcc ctggtgctaagcccctcattgcatggggccggcagggcctgccggctgctccgagtgcgg gcccgccaagcccacgcacacctggagctccagctggcccggaagtgcccgcacagcccc tgttcccgctcatgcctctccctccacacctccttgcaagctgagggagccagctccggc cttggccagcccagaaaggggctcccacagtgcagcggtgggctgaagggctcctcaagt gctgccaaagtgggagcccaggcagaggaggtgccgagagcgagtgaggcctctgaggac tgccagcatgctgtcacctctcactgggactacagttgctctctgatgttgctttctttc actctaaaggacttcctatggcgtttcttgaaaggcaaaactgttgtcacttccactgac agtgccataggtgtgggcatcacctgctacagcaaatccaggtccccaatccctgggctg cagaccagtaccagtctgtggactgctaggaaccgggccacatggcaggagccatgtgga actaacacttcagttctggaagatcaagatgaagataagtccccaaagaaaaatactcct tggcagataagtaatggaacatcatttgtgatcatctccagaaagaggccatcagaagga aactaccaaaaagaaaaagacttgtatattcaatattttaaccaggggtctgaatcagat cagttggaatttgtggaacatcttatttcacgaatgtgtcattatcaccatggacgtatc aactcttacctgaagcccatgttgcagcaggactttattactgctttacccgagcaaggc ttagatcacatagcagaaaacattcttttgtacctggttaccaggtctctgtgtgcagca gagctggatgagcgtgtcattgtaactggctcttcggattctacggtgagagtgtgggaa gtgaacacgggtgaagttcttaacacattaatccaccacaatgaggatgtactgcactta cgcttcagcagtggactgatggtgacctgttccaaggaccgctccattgctgtgtgggac atggcttctgcgaccgatatcactttacgtggtgtcctggttgtccactga >gi568815577f:31559770_31768575|GENSCAN_predicted_peptide_4|111_aa MKGPPVLSRLCSVNVIPWCTAAPSTQDTVGMPPPRVVHVGDLGNVTADKDGVADVSIEDS VISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ >gi568815577f:31559770_31768575|GENSCAN_predicted_CDS_4|336_bp atgaaagggcctcctgtgctgtcgaggttgtgctctgtgaatgtcatcccctggtgcaca gcagcaccttctacacaggatacagttggaatgccgccccctcgagttgtgcatgttgga gacttgggcaatgtgactgctgacaaagatggtgtggccgatgtgtctattgaagattct gtgatctcactctcaggagaccattgcatcattggccgcacactggtggtccatgaaaaa gcagatgacttgggcaaaggtggaaatgaagaaagtacaaagacaggaaacgctggaagt cgtttggcttgtggtgtaattgggatcgcccaataa >gi568815577f:31559770_31768575|GENSCAN_predicted_peptide_5|1357_aa MAEGERHISHGSRQEKRACAGKLPFTKLSNLVRLIHYHKNSTGKTCPQDPIISHQVPSTI TAIQAPAGPPSGEVMVAVLIRLCRPLLPPDVAWTSCRRRIFLVSQGLCPNTKHALARLVS QPRSAAARGLCDRRARSSRRRSANMDAVNAFNQEPLEISEGLGRGAGISFRISEDVGSSD RRAVVLKSEYFDSEGYWKLFSLMDMKPPISRAKMILITKAAIKAIKCKPEYKVPGLYVID SIVRQSRHQFGTDKDVFGPRFSKNITATFQYLYLCPSEDKSKIVRVLNLWQKNGVFKIEI IQPLLDMAAGTSNAAPVAENVTNNEGSPPPPVKVSSEPPTQATPNSVPAVPQLPSSDAFA AVAQLFQTTQGQQLQQILQTFQQPPKPQSPALDNAVMAQVQAITAQLKTTPTQPSEQKAA FPPPEQKTAFDKKLLDRFDYDDEPEAVEESKKEDTTAVTTTAPAAAVPPAPTATVPAAAA PAAASPPPPQAPFGFPGDGMQQPAYTQHQNMDQFQPRMMGIQQDPMHHQVPLPPNGQMPG FGLLPTPPFPPMAQPVIPPTPPVQQPFQASFQAQNEPLTQKPHQQEMEVEQPCIQEVKRH MSDNRKSRSRSASRSPKRRRSRSGSRSRRSRHRRSRSRSRDRRRHSPRSRSQERRDREKE RERRQKGLPQVKPETASVCSTTLWVGQLDKRTTQQDVASLLEEFGPIESINIAWALNKGI KADYKQYWDVELGVTYIPWDKVKPEELESFCEGGMLDSDTLNPDWKGIPKKPENEVAQNG GAETSHTEPVSPIPKPLPVPVPPIPVPAPITVPPPQVPPHQPGPPVVGALQPPAFTPPLG IPPPGFGPGVPPPPPPPPFLRPGFNPMHLPPGFLPPGPPPPITPPVSIPPPHTPPISIPN STIAGINEDTTKDLSIGNPIPTVVSGARGNAESGDSVKMYGSAVPPAAPTNLPTPPVTQP VSLLVFKVLLWLLEKQTLEVGKSGKETGSDAVAADEMGASPRWAGAMEVRGDRSRVQITG VTIGARTVTDSRSNVDSGAGTQGVAPGPVIGLQAPSTGLLGARPGLIPLQRPPGMPPPHL QRFPLMPPRPMPPHMMHRGPPPGPGGFAMPPPHGMKGPFPPHGPFVRPGGMPGLGGPGPG PGGPEDRDGRQQPPQQPQQQPQPQAPQQPQQQQQQQPPPSQQPPPTQQQPQQFRNDNRQQ FNSGRDQERFGRRSFGNRVENDRERYGNRNDDRDNSNRDRREWGRRSPDRDRHRDLEERN RRSSGHRDRERDSRDRESRREKEEARGKEKPEVTDRAGGNKTVEPPISQVGNVDTASELE KGVSEAAVLKPSEELPAEATSSVEPEKDSGSAAEAPR >gi568815577f:31559770_31768575|GENSCAN_predicted_CDS_5|4074_bp atggcagaaggtgaaaggcacatctcacatggcagcagacaagagaagagagcttgtgca gggaaactcccctttacaaaactatcaaatcttgtgagacttattcactatcacaagaat agcacgggaaagacctgcccccaggatccaattatctcccaccaggtcccttccacaata acagcaatacaagctccagctggtcctccttcgggggaggtgatggtggcagtcctcatc aggctctgcaggcctctcctccctccagatgtagcctggacttcctgtcggaggcgcatc tttctcgtttcccagggcctgtgccctaacactaagcatgctttggcccggctggtttcc cagccccgctccgccgcggcgcgaggtctatgtgaccggcgggcccggagcagccgccgc cgcagcgcgaacatggacgccgtcaacgccttcaaccaggagccattggagataagtgaa ggcttgggaagaggagcaggtatcagttttaggatttcagaagatgtaggaagcagtgat cgaagggcagtagtgctgaaaagtgagtattttgattctgaaggctactggaaactcttt tcgcttatggatatgaaacctcccatctctagagccaagatgattctcatcactaaagct gctattaaagctattaagtgtaaaccagaatacaaggttccgggattatatgtaattgac tcaattgtgcgacagtctcgtcatcagtttggaactgataaagatgtttttgggccaaga ttctctaaaaacataactgccacattccaatatttatatctttgtccatctgaagataag agtaaaatagttcgtgtgctgaacctttggcaaaaaaatggagtgttcaaaattgaaatt attcaacctcttttggacatggcagcgggaaccagtaatgcagccccagtagcagaaaat gttaccaataatgaaggctcacctccacctccagtaaaagtttcttctgaacctcccaca caagccactccaaactccgtcccagctgtaccacagttgcccagctctgatgcttttgct gctgtggctcagctgtttcagacaactcaaggccaacagcttcagcagatccttcagact tttcaacagcctccaaaaccacagtctcctgcccttgacaatgctgtgatggctcaggtt caggctatcacagctcagttaaagacaactcctacacaaccatctgaacaaaaagctgct ttccccccacctgaacagaaaactgcatttgacaagaagttgcttgatagatttgactat gatgatgagccagaagctgtggaagaatcaaagaaagaggataccactgccgtcaccacg acagcacctgctgccgcagtaccccctgcacccaccgccaccgtgcctgctgctgctgca cccgctgctgcctctcctcctcctccacaggcaccatttggctttcctggagatggcatg cagcagccagcatacacacagcatcaaaatatggatcagtttcagccacgaatgatggga atacaacaggatccaatgcaccatcaggttccacttcctcctaatggacaaatgccagga tttggacttcttcctacacctccatttcctcccatggctcagcctgtgattcctccaact ccaccagtgcagcagcctttccaagcttcttttcaggcacaaaatgaaccacttacacag aagccgcatcagcaggaaatggaagtagaacaaccttgtattcaagaggttaagcgacat atgtctgataacagaaagtcaagatctaggtcagcatccaggtcaccaaaaaggaggcga tctagatctggttctagatctcgaaggtctcggcatcgacgttctcgatctcggtccagg gatagacgccgacattctccccgatctcgatctcaagaaagacgggatcgagaaaaagag agagaacgtcgacaaaaaggcctccctcaagtgaaaccggaaactgcaagtgtttgcagt actaccctctgggtggggcagctggacaaaagaactactcagcaggatgttgccagtctc ttggaagagtttggtccaattgaatcaattaatattgcctgggccttaaacaaaggaata aaggcagattataagcagtattgggatgtagaacttggtgttacttatattccatgggac aaagtcaagcctgaggaactggagagtttttgtgaaggaggaatgttggacagtgacaca cttaacccagattggaaaggaattcctaagaagcctgaaaatgaagttgctcaaaatgga ggtgctgaaacctcacacacagaaccagtatcacccatacctaaaccattacctgtgcct gtccctcctattcctgttcctgcacctataacagtgccacctccacaggtcccaccacat caaccgggtccacctgtagttggtgctctccagccgcctgctttcacgcctcctctggga ataccgcctccaggctttggtcctggtgttcctcctccccctcctcctccaccatttttg cgcccaggattcaacccaatgcatttaccaccaggttttctgcctcctggacccccacct cctataactccaccagtatccattcctcctcctcacactccaccaataagcatcccaaac tctactatcgctggtataaatgaagacactacaaaagacttatctattggaaatcccatt ccaacagtggtgtctggggctagaggaaacgccgagtctggtgacagcgtgaaaatgtat ggctctgccgtgccacctgctgcacccacgaatctgcccacccctcctgtaacccagcct gtttcacttcttgtttttaaagtattactgtggctgttagagaaacagactctagaagtt ggtaagagtggaaaggagaccggttctgatgctgttgcagcagatgagatgggagctagc ccacgttgggctggtgcaatggaggtgagaggggacagatccagggtacaaataacaggt gtaaccattggagcacggacagtcacagacagtaggagtaatgtggattcaggtgcaggc actcaaggagttgcccctggtcctgtaattggacttcaggcaccatctactggtcttctt ggcgcccggcccggtctcatcccactccagcgccctccaggaatgcccccacctcactta cagcggttccctttgatgccgccccgtcccatgccaccgcacatgatgcacagaggccca ccgccaggaccagggggctttgcgatgcctccacctcatggaatgaaaggtcccttccca ccgcatggcccctttgttaggcctggtggaatgccagggctcgggggcccagggccaggc ccagggggtcctgaagacagagacggaaggcaacagccgccgcagcagccacagcagcag ccacagccgcaggcgccccagcaaccacagcagcagcagcagcagcagccaccaccatca caacagcctccaccaacacagcagcagccacagcagtttagaaatgataacaggcagcag ttcaattcaggtagagaccaagaaaggtttggaagaagatcttttggaaatagggtggaa aatgaccgggaacggtatgggaaccgtaatgatgatagagataatagtaaccgtgacagg agagagtggggaaggaggagccctgaccgggacaggcacagagacttggaagagagaaat agacgctctagtgggcatcgagacagagagagagattctagagatagagagtctcgtaga gagaaggaagaagcccgaggaaaggaaaagcctgaggtgacagacagggcaggtggtaac aaaaccgttgaacctcccattagccaagtgggaaatgtagacactgcttcagaacttgag aagggggtgtctgaggctgcagtcctaaagccttctgaagagttacctgctgaggctacc tcatccgttgaacccgaaaaggattctggctcagcagcagaggctcctcgttag >gi568815577f:31559770_31768575|GENSCAN_predicted_peptide_6|248_aa MWESLELLRDLLNGFDQKADSDMDNKAQAEVASNGYEELVGNWNKGHSSYAKRLVAFCFC LRDLWNFELERDDLGYLVEEISKGQSVQEKAEHKSLKNLQPDNAIEKKIPFSGERFKPAA EICISKEELNVNHQDNGEHVSGACQRPSWHPGDLRGKNGFVGQISEDVWKCLDVQAEVCC RARALMENLCLGSAEGKCGVGASHRVSAGALPSVAVRRGPPTSRHQNGISTNSLHGAPGK GAGTQRQL >gi568815577f:31559770_31768575|GENSCAN_predicted_CDS_6|747_bp atgtgggaaagtttggaacttcttagagacttgttgaatggctttgaccaaaaagctgat agtgatatggacaataaagcccaggctgaggtggcctcaaatggatatgaagaacttgtt gggaactggaataaaggtcactcttcctatgcaaagagactggtggcattttgcttctgc cttagagatttgtggaactttgaacttgagagagatgatttagggtatctggtagaagaa atttctaaggggcaaagtgttcaagagaaagcagagcataaaagtttgaaaaatttgcaa cctgacaatgcaatagaaaagaaaatcccattttctggggagagattcaagccagctgca gaaatttgcataagtaaggaggagctgaatgttaatcaccaagacaacggggaacatgtc tccggggcatgtcagagaccttcatggcatcccggagacctgagaggaaaaaatggtttt gtgggccagatttcagaggatgtatggaaatgcctggatgtccaggcagaagtttgctgc agagccagggccctcatggagaacctctgcttggggagtgcagaagggaaatgtggggtt ggagcctcacaccgagtctccgctggggcactgcctagtgtagctgtgagaagagggcca ccaacctccaggcaccagaatggtatatccaccaacagcttgcatggtgcacctggaaaa ggcgcaggcactcaacgccagctgtga