GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:30:33 Sequence gi568815584f:54409960_54638380 : 228421 bp : 40.55% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1574 1747 174 2 0 27 60 154 0.367 4.63 1.02 Intr + 5940 5971 32 0 2 88 91 26 0.744 -0.24 1.03 Intr + 7892 7992 101 0 2 41 100 73 0.624 2.71 1.04 Term + 10587 10757 171 2 0 91 49 192 0.995 12.44 1.05 PlyA + 11686 11691 6 1.05 2.21 PlyA - 11814 11809 6 1.05 2.20 Term - 13680 13664 17 1 2 88 41 23 0.059 -4.98 2.19 Intr - 19472 19269 204 0 0 115 72 136 0.662 12.95 2.18 Intr - 20445 20366 80 2 2 89 -24 74 0.161 -5.42 2.17 Intr - 22261 22149 113 1 2 108 75 -2 0.252 -1.24 2.16 Intr - 26478 26410 69 0 0 48 105 84 0.529 4.56 2.15 Intr - 31510 31288 223 0 1 9 94 323 0.079 22.21 2.14 Intr - 33509 33308 202 0 1 42 97 57 0.004 -0.58 2.13 Intr - 34554 34432 123 1 0 89 89 7 0.007 0.54 2.12 Intr - 36321 36256 66 1 0 84 54 68 0.118 0.96 2.11 Intr - 37458 37333 126 1 0 76 47 129 0.125 7.33 2.10 Intr - 40248 40155 94 0 1 98 11 95 0.121 1.42 2.09 Intr - 61136 61010 127 1 1 32 72 173 0.079 9.86 2.08 Intr - 63205 63058 148 2 1 109 5 80 0.068 0.17 2.07 Intr - 68200 68155 46 1 1 85 46 69 0.088 -0.54 2.06 Intr - 71499 71407 93 0 0 68 29 117 0.004 3.04 2.05 Intr - 72243 72194 50 1 2 60 68 101 0.992 2.78 2.04 Intr - 73808 73712 97 2 1 99 71 156 0.997 13.66 2.03 Intr - 78449 78364 86 0 2 98 25 70 0.053 0.32 2.02 Intr - 95368 95233 136 1 1 111 80 96 0.104 10.32 2.01 Init - 96925 96866 60 1 0 70 34 53 0.036 -0.60 2.00 Prom - 98732 98693 40 -6.25 3.06 PlyA - 99297 99292 6 1.05 3.05 Term - 100251 99941 311 1 2 68 52 350 0.632 23.64 3.04 Intr - 100370 100281 90 0 0 60 61 168 0.932 10.35 3.03 Intr - 102305 102192 114 0 0 81 22 81 0.296 0.30 3.02 Intr - 107977 107898 80 0 2 54 89 53 0.255 0.28 3.01 Init - 110626 110505 122 1 2 69 27 136 0.308 5.31 3.00 Prom - 112937 112898 40 -6.35 4.00 Prom + 118705 118744 40 -6.65 4.01 Init + 119169 119264 96 2 0 87 2 91 0.353 0.76 4.02 Intr + 120915 121091 177 2 0 46 87 102 0.515 5.09 4.03 Term + 128104 128424 321 0 0 83 49 113 0.350 0.74 4.04 PlyA + 129304 129309 6 1.05 5.09 PlyA - 129343 129338 6 1.05 5.08 Term - 156714 156395 320 1 2 42 46 202 0.744 5.46 5.07 Intr - 157065 156986 80 2 2 132 52 100 0.196 9.08 5.06 Intr - 157630 157323 308 1 2 76 80 156 0.280 7.92 5.05 Intr - 157890 157817 74 1 2 53 84 162 0.658 10.51 5.04 Intr - 158245 157957 289 1 1 21 25 321 0.368 15.00 5.03 Intr - 158445 158342 104 1 2 97 107 113 0.838 13.07 5.02 Intr - 174395 174264 132 0 0 54 93 44 0.147 1.20 5.01 Init - 177540 177090 451 0 1 86 43 154 0.214 6.82 5.00 Prom - 185467 185428 40 -5.75 6.00 Prom + 191420 191459 40 -5.95 6.01 Init + 196913 197136 224 1 2 109 47 137 0.947 9.80 6.02 Intr + 201344 201563 220 2 1 70 32 157 0.109 5.68 6.03 Intr + 212801 212941 141 1 0 37 74 96 0.179 2.73 6.04 Term + 218412 218564 153 2 0 39 43 155 0.209 3.04 6.05 PlyA + 218575 218580 6 1.05 7.03 PlyA - 219463 219458 6 1.05 7.02 Term - 223667 223566 102 2 0 81 33 78 0.473 -1.10 7.01 Init - 224620 224462 159 1 0 72 81 112 0.760 8.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 71499 71394 106 0 1 68 33 136 0.985 4.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:54409960_54638380|GENSCAN_predicted_peptide_1|159_aa XGELSKYRVPNLLDLYQQCGIITHHHPIADGGTPDIASCCEIMEELTTCLKNYRKTLIHC YGGLGRSCLAACLLLYLSDTISPEQAIDSLRDLRGSGAIQTIKTSGIEKDPVSLPLAPSQ YWISPKGVYSLWAGTFSETRNSLVQTAAPPFVRPRNYTE >gi568815584f:54409960_54638380|GENSCAN_predicted_CDS_1|480_bp nnaggggaactgtcaaaatatagagtcccaaaccttctggatctctaccagcaatgtgga attatcacccatcatcatccaatcgcagatggagggactcctgacatagccagctgctgt gaaataatggaagagcttacaacctgccttaaaaattaccgaaaaaccttaatacactgc tatggaggacttgggagatcttgtcttgctgcttgtctcctactatacctgtctgacaca atatcaccagagcaagccatagacagcctgcgagacctaagaggatccggggcaatacag accatcaagacctccggaattgaaaaggatcctgtgagtcttccactcgccccctcccag tactggatcagccccaagggtgtttattcactatgggctggaacattttcagagacaagg aattcactggttcagacagctgctccaccttttgtccgccctaggaactacacggaatga >gi568815584f:54409960_54638380|GENSCAN_predicted_peptide_2|719_aa MPNQATSATECHCSSTMVKVLFMAFSRNKAEARTGGYDAAPALALSPLHTPADHKQKTFL PSYLAASYRGFLLIFAFLWQLETFRPKALALGLKSESLVVCDVAEDLVEKLRKFRFRKET NNAAIIMKIDKDKRLVVLDEELEGISPDELKDELPERQPRYPFLSALKDLFSLVVFEIRN TEDLTEEWLRGITELVEPDYQLITGSTLWCFRTQGLQNISSTDLMFYNSDFIPRNNLGRR HLSGVVEEDPPSVWVSTIQSAGALERTKTEKANMPVYPVELVRTQCFSPSGDAAIRRHLG RRQQPSPDTNPAGTPSAKGRMDDTATNIEKLHPAGQPEALYLHAAVTQGFGKVHRLQAAD LCLAVQIGNMASGGSDESNFTKNVIGFYYFKKEIEMRLCLELVGSWSDFKNEATDPRGGA ACQSRAVHPHSSALGWSMGLGAMEQGVALAGEAWAAQEPMEGVGGSGMAGCRSRALPRGK AAKARRAERAGAAAVAVAVAAVAGGSEWFYPEGPAPPFSAGNGAAPRSSSPAMAFTFAAF CYMLALLLTAALIFFAIWHIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFFCVMFLC AAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPTTIMNADILAYCQGVSASALLTLLG CVGISVGSCPVYHGTFSSIPGLYPLVAMSTLLPPHDNQKCLWTLPDVLRGKIALENALK >gi568815584f:54409960_54638380|GENSCAN_predicted_CDS_2|2160_bp atgcccaatcaggctacctctgccactgaatgccattgctctagcaccatggtgaaggtg ctttttatggccttttcacggaacaaggcggaagccagaacgggtggatatgatgctgca cctgctctagccctttctcctctccacactccagcagatcacaagcagaagactttccta ccctcctacttggcagcttcctacagaggatttctccttatctttgctttcctatggcag ttggaaacctttcgacccaaagctttggcactgggccttaaaagtgagtctttggttgtt tgtgatgttgccgaagatttagtggaaaagctgagaaagtttcgttttcgcaaagaaacg aacaacgctgctattataatgaagattgacaaggataaacgcctggtggtactggatgag gagcttgagggcatttcaccagatgaacttaaagatgaactacctgaacgacaacctcga tatccttttcttagtgctttaaaagatttattttctttagtggtatttgaaataagaaat accgaagacctaactgaagaatggttacgtgggatcacagaactggttgagccagattac cagctcatcacaggctcgaccctgtggtgcttcagaacacagggtctgcagaatatctca agcactgatcttatgttttacaacagtgattttatccccaggaacaatttggggagacgg catctgagtggagtagtggaggaagatccaccctcggtgtgggtaagcaccatccaatct gctggggccctggaaagaacaaaaacagaaaaggcaaatatgccagtctacccggtggag ctggtgagaacacagtgtttctccccttcaggggatgcagcaataaggcgccatcttgga aggagacagcagccctcaccagacaccaatcctgctggcactcccagtgcaaagggcaga atggatgacacagccaccaatattgagaaactacatcctgctgggcagcctgaagctctc tacttgcatgctgccgtaacccaaggttttggaaaggtacataggctccaagcagctgac ttgtgtctagctgtccagattgggaacatggctagtgggggcagtgatgaaagtaacttt acaaaaaatgttattggattttattattttaaaaaagaaatagagatgagattgtgcctg gaattggtgggttcttggtctgacttcaagaatgaagccacggaccctcgtggtggagct gcctgccagtcccgcgccgtgcacccgcactcctcagcccttgggtggtcgatgggactg ggcgccatggagcagggggtggcgctcgctggggaggcttgggctgcacaggagcccatg gagggggtgggaggctcaggcatggcgggctgcaggtcccgagccctgccccgcgggaag gcagctaaggcccggcgcgctgagagggctggcgccgcggcggtagcggtggcggtcgcg gctgtggccgggggaagtgaatggttttacccagagggccctgcgccgcctttctccgct ggcaacggcgccgctccccgctcctcctccccagccatggcgttcacgttcgcggccttc tgctacatgctggcgctgctgctcactgccgcgctcatcttcttcgccatttggcacatt atagcatttgatgagctgaagactgattacaagaatcctatagaccagtgtaataccctg aatccccttgtactcccagagtacctcatccacgctttcttctgtgtcatgtttctttgt gcagcagagtggcttacactgggtctcaatatgcccctcttggcatatcatatttggagg tatatgagtagaccagtgatgagtggcccaggactctatgaccctacaaccatcatgaat gcagatattctagcatattgtcagggtgtctcagcttcggcactgttgacgcttctgggc tgtgtgggaatttctgtgggaagctgtcctgtatatcatgggacgttcagcagcatccct ggcctttacccactagttgccatgagcaccctcctgccccctcatgacaaccaaaaatgt ctctggacattgccagatgtcctgaggggcaaaattgccctggaaaacgcccttaagtag >gi568815584f:54409960_54638380|GENSCAN_predicted_peptide_3|238_aa MVASGSTGVGLTGKEHKKIDGNVQYLDRGLSCIDDVFDNTWKMRQAGSHYPQQTNARTEN QTLHILTLQADMANSGKKQIVKNSEKYQGKNHDRKHMNSRDGIMGETKLSKTPQCGTTPR CPSDTDQTVMQGSDAEKVGVLGTRGHRIPDVLRPAALRLRCPLLLVATPLWRYPPLRLSP LGHLPSQYQAGGHDEAGKDHRDVEKRRVFIKRYQKHRSHQGLALGRARLQPRSPAQPG >gi568815584f:54409960_54638380|GENSCAN_predicted_CDS_3|717_bp atggttgcttccggaagtacaggagtaggactgactgggaaagagcataagaaaattgac ggtaacgttcagtatcttgacagaggcttaagctgtatagatgatgtgtttgataacact tggaaaatgcgtcaagctggaagccattatcctcagcaaactaatgcaagaacagaaaat caaacattgcatattctcactttgcaggcagacatggcaaacagtggaaagaagcaaata gtgaaaaactcagagaagtaccaagggaaaaaccatgacagaaaacacatgaacagtaga gatggtattatgggagaaacaaagctgagtaaaacaccccaatgcggtaccaccccaaga tgccccagcgacacggaccaaacggtgatgcagggcagtgacgcagagaaagtcggagtt ctaggcacccggggacaccgaatccccgacgtcctccggcccgcggccctccggctccgg tgccccctgctgctcgtcgcgaccccactctggcggtatccgccccttcgtctctcaccc ctgggacacttaccatcccaataccaggccggtggtcacgatgaagcaggtaaagaccac cgcgatgtagaaaagcggcgagtattcataaagcgttaccagaaacaccgcagccatcag ggtcttgctctgggtagagcccggctccagccgcggagcccagcccagcccggctga >gi568815584f:54409960_54638380|GENSCAN_predicted_peptide_4|197_aa MTKTRPFLSSQCISAKEYYTLQFALLQFAPPAWQFTFTFSKSIKKDSKEEIYCQLPRDTK IEDFGTVPRSRYPLVALLTLADEDDREIYDIQLFMSANNNFTPSNNSSSEEKNTDRSLLE KVGLSESEVEPSEENSKDCVVCQNGTVNWVLLPCRHTCLCDGCVKYFQQCPMCRQFVQES FALCSQKEQDKDKPKTL >gi568815584f:54409960_54638380|GENSCAN_predicted_CDS_4|594_bp atgaccaaaaccaggccctttttatctagtcagtgtatcagtgccaaggagtactacacg ttacagtttgcgctgctacagtttgcacccccagcatggcaatttacctttacattttca aaaagtattaaaaaggatagcaaagaagaaatatattgccagttaccaagagatactaaa attgaagactttggtacagtacccagatctcgctatccattggtagcgctattgacctta gctgatgaggatgaccgggaaatttatgatattcaacttttcatgtctgcaaataataat ttcactccctccaacaattcctcttcagaagaaaaaaacacagacagaagtttgttggaa aaggtgggactctctgaaagtgaagttgagccatcggaagagaacagcaaggactgtgtt gtttgccagaatgggactgtgaactgggtactcttaccatgcagacacacatgcctgtgt gatggctgtgtgaagtattttcagcagtgcccaatgtgcaggcagtttgttcaggaatct tttgcactttgcagtcaaaaagagcaagataaagacaaaccgaagactctttga >gi568815584f:54409960_54638380|GENSCAN_predicted_peptide_5|585_aa MTRIPTFTTSIQHSTGSPSWSNQTRERHQGIKISKEEVKLSLFADDMIVYLENPKDSSKK LLELVNEFSKVSRYKINVHRSVVLLYTNSNQAENQIENSTPFTIAANKIKCLGIYLTKDM KDPYKENYKTLLKEIIDDANKWKHIPFSWIAAMSYLQSSPTPAHARHSILNPNLNTEMQC AKNEVSLAHNYNSKGNCMREKSIIHATVATTRKGLTEFTVGKIPKRPAEGLASAWPEAEK RREERGLGRRDGGGGRRTLTGAVGLAFEDVQLGAVGQRVLQAELEEAGLGLAHALEQRQQ RNSLLALVPALKPARQHPDLVAKHHVWRSRSPAAGDAFLFQGGSGARTLQLWDGDFRRPG VSSPLGGNSDPSRPHSRAAITTGRRERAAVRPRLAEERALGGRGHQRVLGPGEGASRGRL GSPKSSRPLRPLRTRCGRWAIPHRVTARSLLPREAAERLQLHLQQQPASPPPGAPLDLGR RGRHSAGHTPREGTAGVRSAAPFGQRGGPAGNADRRRRHTPPNDSKESSAAGVPPVPSRF SWPPPAGRRTQRLSLQGAASFQQHAASGSQNFSGGGGSPRGGDPQ >gi568815584f:54409960_54638380|GENSCAN_predicted_CDS_5|1758_bp atgacgaggatacccactttcaccacttctattcaacatagtactggaagtcccagctgg agcaatcaaacaagagaaagacatcagggcatcaaaatcagtaaagaggaagtcaaactg tcactgtttgctgatgatatgattgtatacctagaaaaccctaaagactcatccaaaaag ctcctagaactagtaaatgaattcagcaaagtttcaagatacaaaattaatgtacacaga tcagtagttctgctatacaccaacagcaatcaagctgagaatcaaatcgagaactcaacc cctttcacaatagctgcaaacaaaataaaatgcttaggaatatacctaaccaaggacatg aaagatccctataaggaaaactacaaaacactgctgaaagaaatcatagatgatgcaaac aaatggaaacacatcccattctcatggatagctgctatgtcttatctgcaaagctctcct actcctgcacatgcaagacactccatcctcaacccaaacctcaacactgaaatgcagtgt gccaaaaatgaagtcagtctggcacataattataattccaaaggaaactgcatgagagaa aaatcaatcattcatgctaccgtagctaccacacggaaagggctgacggagttcacagtg ggcaaaatcccaaagcggccggcggagggactggcctcggcctggcccgaggcagagaag cggagggaggagcgggggttgggcaggcgggacggcggcggcggccgccgcacacttacc ggggctgttggcctcgcgttcgaggacgtgcagctcggcgcagtcggccagcgagtgctc caggcagagctggaggaagcgggcctgggtctggctcacgcgcttgagcagcgacagcag cgcaacagtctgctcgcactcgttccagcccttaaaccagcccgccagcaccccgacctg gtcgcgaaacatcatgtttggcggagccgcagccccgcggccggagacgcgttcctgttc caaggtggctctggtgcccggacgctgcagctttgggatggtgatttccggcgtcctggg gtctcctccccgctgggcggcaactcggacccttcacgcccgcattcccgagcggcgatc accactggaaggagagaaagagctgcagtgagacctcggctagcggaggagcgcgccctg gggggacgagggcatcagagggtgctggggccaggggagggggcgtcccgcggaaggttg ggttcgccgaaatccagccgccccctccgccccctccgcacccgatgtgggagatgggct attccccaccgggtaacagcgagaagcctgcttcccagagaagcagccgagcggctgcag ctccatctacagcaacagccagcatctccaccgccaggcgcccctctggatttaggccgc cgagggcggcacagcgctgggcacacgccccgcgaggggacggccggggtccgcagcgct gctcccttcggccagcggggcggccccgcggggaatgcggatcggcgccgcaggcacacg ccccccaacgacagcaaagaaagttcggctgcgggggttcccccagtcccatcccggttc tcgtggccgccgccggcggggaggagaacccagcgactcagccttcaaggagccgctagc ttccaacaacacgctgcttccggctcccagaacttctcgggcggagggggaagcccccgg ggaggggacccccagtga >gi568815584f:54409960_54638380|GENSCAN_predicted_peptide_6|245_aa MVFSAPFGGKIFTGLCPGSWLVRRWSLESRVQPASGDSQGVDSPELGLPLDSEHEALAER GIKERQEFSAFKVISGLHSLTERFILLNEQHSGHQWPSEWAEIPWAAPDPSKPLNQPLHK NRSLEKAFEETLPYASLFPSIVASGISKAWGVLLLKGCLCISSTGGITWEVLDTQILRPH PSPAESEPAFQQEPQRGLHQGRELNNGMDEEYLLQQYNSASDEIADALKVKHETIGEAAM EIGLQ >gi568815584f:54409960_54638380|GENSCAN_predicted_CDS_6|738_bp atggtcttcagtgctccctttggagggaagatcttcactgggctgtgtcctggctcttgg ttggtgaggaggtggagtttggagagcagggtgcagcctgccagtggagattctcaggga gtggactcaccagagctgggattaccactggactcggaacatgaggccttggccgaacgt ggcataaaggagagacaggagttcagtgctttcaaggttatcagtggccttcacagcttg acagagcgcttcattctgctgaatgagcaacactcaggacaccagtggccttccgagtgg gcagaaattccttgggcagctccagatcccagcaaaccacttaatcagcctttacataaa aatcgtagcctagagaaggcatttgaggaaactcttccttatgctagccttttcccctcc attgtggcctctggaatcagtaaggcgtggggagttttgctcctcaaagggtgtctgtgc atcagcagcactggtggcatcacctgggaggtgttagatacacagattctcaggccccac cctagccctgctgaatcagaacctgcatttcaacaagaaccccagaggggtctgcaccaa ggaagggaattgaataatggcatggatgaagaatatttactgcagcagtataactctgca tcagatgaaattgctgatgctctgaaagtcaagcatgagactattggggaagcagctatg gaaataggccttcagtag >gi568815584f:54409960_54638380|GENSCAN_predicted_peptide_7|86_aa MVLDSHRSADPIVNCACKGSRLHTPYDQPPSLSPLPWPVETLSSMKPVPGAKKQNQQAPP PPSPLPPRHQAPHSLTWLTGMALAPN >gi568815584f:54409960_54638380|GENSCAN_predicted_CDS_7|261_bp atggtattagattctcatagaagcgcagaccctattgtgaactgtgcatgcaagggatct aggttgcacactccttatgaccaacccccatccctgtccccactaccctggcccgtggaa actttgtcttccatgaaaccagtccctggtgccaaaaagcaaaaccagcaggctccacct cctccatccccactaccccctaggcaccaggcaccccactctctgacatggctgactggc atggcattagctcctaactaa