GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:13:57 Sequence gi568815583r:63055556_63256575 : 201020 bp : 42.44% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1430 1563 134 1 2 92 78 239 0.999 21.82 1.02 Intr + 4008 4125 118 0 1 104 55 232 0.983 21.05 1.03 Intr + 5314 5384 71 0 2 104 98 137 0.999 13.36 1.04 Intr + 5643 5718 76 0 1 69 79 87 0.976 4.50 1.05 Intr + 6063 6233 171 2 0 62 102 40 0.782 2.02 1.06 Intr + 6660 6722 63 2 0 69 75 119 0.986 6.70 1.07 Intr + 7021 7090 70 0 1 50 116 42 0.996 1.04 1.08 Term + 8509 8591 83 2 2 69 41 169 0.992 7.18 1.09 PlyA + 8638 8643 6 1.05 2.03 PlyA - 8811 8806 6 1.05 2.02 Term - 12492 12195 298 2 1 20 46 194 0.581 2.15 2.01 Init - 15279 15224 56 0 2 53 83 103 0.978 7.11 2.00 Prom - 17966 17927 40 -6.05 3.00 Prom + 19388 19427 40 -7.25 3.01 Init + 20454 20753 300 2 0 40 115 255 0.961 20.70 3.02 Term + 22458 22502 45 2 0 54 55 69 0.374 -3.57 3.03 PlyA + 23112 23117 6 -0.45 4.02 PlyA - 23191 23186 6 1.05 4.01 Sngl - 25543 25040 504 1 0 27 43 252 0.702 10.39 4.00 Prom - 26326 26287 40 -4.75 5.00 Prom + 27204 27243 40 -3.95 5.01 Sngl + 27493 27816 324 0 0 61 48 196 0.599 8.65 5.02 PlyA + 29929 29934 6 1.05 6.04 PlyA - 29942 29937 6 1.05 6.03 Term - 33894 33727 168 0 0 75 44 96 0.251 0.80 6.02 Intr - 34198 34079 120 1 0 80 63 42 0.332 0.67 6.01 Init - 44044 43967 78 1 0 68 84 49 0.460 3.61 6.00 Prom - 46377 46338 40 -3.45 7.00 Prom + 50485 50524 40 -3.75 7.01 Sngl + 54330 54620 291 2 0 60 42 208 0.631 8.80 7.02 PlyA + 55010 55015 6 1.05 8.00 Prom + 57527 57566 40 -1.25 8.01 Init + 66317 66673 357 1 0 79 86 313 0.981 25.35 8.02 Intr + 67081 67147 67 0 1 95 98 20 0.980 1.26 8.03 Intr + 71304 71494 191 1 2 117 79 204 0.993 20.78 8.04 Intr + 71798 72134 337 1 1 58 105 313 0.978 24.17 8.05 Intr + 73930 74095 166 2 1 35 94 85 0.867 1.90 8.06 Term + 85725 86250 526 0 1 103 44 423 0.980 32.15 8.07 PlyA + 86333 86338 6 1.05 9.04 PlyA - 86411 86406 6 1.05 9.03 Term - 100176 99998 179 1 2 135 48 26 0.482 0.17 9.02 Intr - 100966 100858 109 1 1 54 98 93 0.617 5.84 9.01 Init - 116454 116449 6 0 0 84 87 0 0.108 0.44 9.00 Prom - 124185 124146 40 -3.25 10.00 Prom + 129662 129701 40 -6.45 10.01 Sngl + 132110 132367 258 1 0 76 49 219 0.224 11.70 10.02 PlyA + 132947 132952 6 1.05 11.00 Prom + 132968 133007 40 -11.04 11.01 Init + 134070 134241 172 2 1 92 46 228 0.024 18.66 11.02 Term + 152652 152785 134 1 2 121 41 39 0.187 -0.23 11.03 PlyA + 154030 154035 6 1.05 12.05 PlyA - 154631 154626 6 1.05 12.04 Term - 155115 154953 163 2 1 91 37 109 0.239 2.53 12.03 Intr - 187298 187186 113 2 2 45 51 89 0.080 -0.84 12.02 Intr - 187480 187425 56 2 2 118 97 31 0.850 4.78 12.01 Init - 193966 193813 154 1 1 42 70 115 0.799 5.39 12.00 Prom - 196720 196681 40 -4.55 13.02 PlyA - 197123 197118 6 1.05 13.01 Term - 200196 199946 251 1 2 71 49 153 0.805 4.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_1|261_aa AEADVASLNRRIQLVEEELDRAQERLATALQKLEEAEKAADESERGMKVIESRAQKDEEK MEIQEIQLKEAKHIAEDADRKYEEVARKLVIIESDLERAEERAELSEGQVRQLEEQLRIM DQTLKALMAAEDKDSVFIFSHPSPFSLLLPLACLPPFLPLIENISKCAELEEELKTVTNN LKSLEAQAEKYSQKEDRYEEEIKVLSDKLKEAETRAEFAERSVTKLEKSIDDLEDELYAQ KLKYKAISEELDHALNDMTSM >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_1|786_bp gctgaagccgacgtagcttctctgaacagacgcatccagctggttgaggaagagttggat cgtgcccaggagcgtctggcaacagctttgcagaagctggaggaagctgagaaggcagca gatgagagtgagagaggcatgaaagtcattgagagtcgagcccaaaaagatgaagaaaaa atggaaattcaggagatccaactgaaagaggccaagcacattgctgaagatgccgaccgc aaatatgaagaggtggcccgtaagctggtcatcattgagagcgacctggaacgtgcagag gagcgggctgagctctcagaaggccaagtccgacagctggaagaacaattaagaataatg gatcagaccttgaaagcattaatggctgcagaggataaggactcagttttcattttttcc catccctctcctttttctctcctccttcctttggcttgtctcccaccctttctgcctctg atcgaaaacattagcaaatgtgccgagcttgaagaagaattgaaaactgtgacgaacaac ttgaagtcactggaggctcaggctgagaagtactcgcagaaggaagacagatatgaggaa gagatcaaggtcctttccgacaagctgaaggaggctgagactcgggctgagtttgcggag aggtcagtaactaaattggagaaaagcattgatgacttagaagacgagctgtacgctcag aaactgaagtacaaagccatcagcgaggagctggaccacgctctcaacgatatgacttcc atgtaa >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_2|117_aa MHVGFGEGSLTVCRGNRERLEWKNFGENLENPEGQYEFSRKDSGPSDSLLWICSASAAVE LVCMGGPKAGTRHAHILGSVFPSGSMETPSCLKLQVPKLSDAFGLAHQSSCSYGLAL >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_2|354_bp atgcacgtggggtttggtgaaggttcactgacggtgtgccgtgggaacagggagaggcta gaatggaagaactttggagaaaatctggagaatccagagggtcagtatgaattcagcagg aaggacagtggtccctccgacagcctcctctggatctgctctgcttcagctgctgtggag ctggtgtgcatgggtggcccaaaggcagggacaagacatgctcatatccttggaagtgtt ttcccttccggttctatggagactcccagctgtctgaaacttcaagtgcccaaactcagt gatgcttttggtttggcccaccagagcagctgtagctatggcctggccctttga >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_3|114_aa MEANRTLLLGLASDVNEGSSALIYSREATRQILWSRGIHCLQREANDREGERCFVCGREA VVVSPEGPAGHRGAGGAEELAPLGGCCMGMGSEGTAHIMMGGLKEPLVSGKANG >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_3|345_bp atggaagccaacagaacactgctcttaggcctggcatcggatgtaaatgaaggcagttca gcccttatttattctagagaagccacgaggcagatcctgtggagccgtggaatccactgt ctccagagggaagctaatgaccgggaaggtgaaagatgctttgtctgtggcagggaagca gttgttgtgtctccggaggggccggctgggcacaggggggccggaggagcagaagagttg gctcctcttggcggatgctgcatggggatggggtcagaggggacagcacacatcatgatg ggaggtctgaaggagcctttggtttctggtaaagcaaatggatga >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_4|167_aa MFAQLVEVAVTPPPGAAMVCTIVREASRLKTHVVTSLRLISGVPAILIPGLSVKNVIEAP GQWSVKGKMFPGVRCPGQPEPKAWSRRLAFIPGVGWVPVSCPIPEILFRQKPKLTRKLQL GPPLASPGPAVTEQLASFCQPPCLKVKHTKAGAASPTEGTAKQPLRC >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_4|504_bp atgtttgctcagctagtggaggtggcagtgactcctccacctggagcagcaatggtgtgc accattgttagggaggcgagtaggctaaaaactcacgtagtcacttctctaaggttaatt tctggagtgcccgctatcctcattcctgggctctcagtcaagaatgtaattgaagcccca ggacaatggagtgtgaaaggaaagatgttccctggtgtcagatgccctggccagccagaa ccaaaagcatggagtagacgcttggcttttattcctggtgttggctgggtccctgtgtcc tgccctattcctgaaatcctctttaggcaaaagccaaagctaactaggaagctccagctt gggcctcctctggcatcgccaggaccagctgtgacagaacagcttgcttccttctgccaa cccccttgtttaaaggttaaacatacaaaggcaggagccgcaagcccaacagagggtact gcaaagcagcccctcagatgctaa >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_5|107_aa MHQVSASEKPIAEESSGNEVIWKRKPPSVSATHAREDRPLPQAPVAAKRCGRVRKGVERH SPHFNERRSPSVILSRCRFCCGGGWLQPLRFLPALGDAGAAGLWATL >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_5|324_bp atgcatcaggtgagtgcgtctgagaagcctatagctgaggagtcctcggggaatgaggtc atctggaaacgcaagcctccctcggtttcagcaactcatgcacgtgaagacaggcccctt ccccaggcccctgttgctgcgaaacggtgtgggagggtgaggaaaggtgtagagaggcac tccccacactttaatgagcgcaggagcccgtcagtgatcttgtccagatgcagattctgt tgtggtgggggctggctccagcctctgcgtttcttgccagctctgggtgatgccggtgct gctggcctgtgggccacactttga >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_6|121_aa MTLNEHAAFKHLFNKAHLAPPLIHLTFPLVSKLTAKPSSFTVTWTFHEVGTWTHSEDHGE LSLTYLSNIAPSLHKHYPMTSSLPELPGQELLLDTSQDEDLTVSQDKAFACFWPVLTIKK E >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_6|366_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaacctttccactggtatcaaaactgaccgcaaagcccagctccttc actgttacatggacatttcatgaggttggtacctggacacattctgaggaccatggagaa ctgtcacttacataccttagcaacattgctccatctctacacaagcattatccgatgact tcatcccttccagagcttcctgggcaggagctgctgctggacacttcccaggatgaggat ctcaccgtctctcaggacaaggcttttgcttgcttttggcctgttctcaccattaaaaag gaataa >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_7|96_aa MTCNCIRDGELGVSRAGPDNQHWSPSDKRLQCLQEGKPECQKFPLLVDWMADWGPADLQP ADDLSMEIVHIPVLWFQVPQQPLLIGWLNYDPAVGR >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_7|291_bp atgacttgtaattgtatcagagatggagagttgggagtaagtagagcaggacctgacaat cagcactggagtccgtctgataaaaggctacagtgtctacaagaagggaagcctgagtgt cagaagtttccactccttgttgattggatggcagactggggcccagctgatttgcagcca gctgatgacctgtccatggagattgtacacatccctgtactctggtttcaggttcctcag cagcctttgctcataggctggcttaactatgatcctgctgtgggaagataa >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_8|547_aa MYRLMSAVTARAAAPGGLASSCGRRGVHQRAGLPPLGHGWVGGLGLGLGLALGVKLAGGL RGAAPAQSPAAPDPEASPLAEPPQEQSLAPWSPQTPAPPCSRCFARAIESSRDLLHRIKD EVGAPGIVVGVSVDGKEVWSEGLGYADVENRVPCKPETVMRIASISKSLTMVALAKLWEA GKLDLDIPVQHYVPEFPEKEYEGEKVSVTTRLLISHLSGIRHYEKDIKKVKEEKAYKALK MMKENVAFEQEKEGKSNEKNDFTKFKTEQENEAKCRNSKPGKKKNDFEQGELYLREKFEN SIESLRLFKNDPLFFKPGSQFLYSTFGYTLLAAIVERASGCKYLDYMQKIFHDLDMLTTV QEENEPVIYNRARFYVYNKKKRLVNTPYVDNSYKWAGGGFLSTVGDLLKFGNAMLYGYQV GLFKNSNENLLPGYLKPETMVMMWTPVPNTEMSWDKEGKYAMAWGVVERKQTYGSCRKQR HYASHTGGAVGASSVLLVLPEELDTETINNKVPPRGIIVSIICNMQSVGLNSTALKIALE FDKDRSD >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_8|1644_bp atgtaccggctcatgtcagcagtgactgcccgggctgccgcccccgggggcttggcctca agctgcggacgacgcggggtccatcagcgcgccgggctgccgcctctcggccacggctgg gtcgggggcctcgggctggggctggggctggcgctcggggtgaagctggcaggtgggctg aggggcgcggccccggcgcagtcccccgcggcccccgaccctgaggcgtcgcctctggcc gagccgccacaggagcagtccctcgccccgtggtctccgcagaccccggcgccgccctgc tccaggtgcttcgccagagccatcgagagcagccgcgacctgctgcacaggatcaaggat gaggtgggcgcaccgggcatagtggttggagtttctgtagatggaaaagaagtctggtca gaaggtttaggttatgctgatgttgagaaccgtgtaccatgtaaaccagagacagttatg cgaattgctagcatcagcaaaagtctcaccatggttgctcttgccaaattgtgggaagca gggaaactggatcttgatattccagtacaacattatgttcccgaattcccagaaaaagaa tatgaaggtgaaaaggtttctgtcacaacaagattactgatttcccatttaagtggaatt cgtcattatgaaaaggacataaaaaaggtgaaagaagagaaagcttataaagccttgaag atgatgaaagagaatgttgcatttgagcaagaaaaagaaggcaaaagtaatgaaaagaat gattttactaaatttaaaacagagcaggagaatgaagccaaatgccggaattcaaaacct ggcaagaaaaagaatgattttgaacaaggcgaattatatttgagagaaaagtttgaaaat tcaattgaatccctaagattatttaaaaatgatcctttgttcttcaaacctggtagtcag tttttgtattcaacttttggctataccctactggcagccatagtagagagagcttcagga tgtaaatatttggactatatgcagaaaatattccatgacttggatatgctgacgactgtg caggaagaaaacgagccagtgatttacaatagagcaagattttatgtttacaataaaaag aaacgtcttgtcaacacaccttacgtggataactcctataaatgggctggtggtggattt ctgtctacagtgggtgaccttctgaaatttgggaatgcaatgctttatggttaccaagtt gggctgtttaagaactcaaatgaaaatcttttacctggatacctcaaaccagaaacaatg gttatgatgtggaccccagtccctaacacagagatgtcttgggataaagagggtaaatat gcaatggcgtggggtgttgtggaaaggaaacaaacgtatggttcgtgtagaaagcaacgg cattatgcttcacatactggaggggcagtgggtgccagtagtgtcctgctggtccttcct gaagaactggatacagagactataaataacaaggttcccccaagaggaatcattgtttct atcatatgtaacatgcaatctgttggcctcaatagcaccgctttgaagattgcccttgaa tttgataaagacagatcagactga >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_9|97_aa MILARDLLHPSLEEEKKKHKKKRLVQSPNSYFMDVKCPGCYKITTVFSHAQTVVLCVGCS TVLCQPTGGKARLTEGISFGILQPSDEIDDYKCLYLH >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_9|294_bp atgattttggctagagatttactacatccgtccttggaagaggaaaagaaaaaacataaa aagaaacgcctagtacaaagtccaaattcttactttatggatgtaaaatgtccaggttgc tacaagatcaccacggttttcagccatgctcagacagtggttctttgtgtaggttgttca acagtgttgtgccagcctacaggaggaaaggccagactcacagaaggtatatcatttggc attctccaacccagtgatgagattgatgattataaatgtctctatcttcactga >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_10|85_aa MAEATFRGLLKDWLGQEGSACARGQNPSSLGADTVTLNSLELCSYLVDSRNETFRKQQQQ QQQPSGYCMINQLMMTHFIFYPHRK >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_10|258_bp atggctgaagccacgttcagaggcttgctcaaggactggcttgggcaagagggctcagcc tgtgccagagggcagaatccttcttcattaggggcagacactgtgactttgaactcatta gagctttgttcctatctagtggactctaggaatgagacattccgtaagcagcagcagcag cagcagcagccctctggctactgtatgatcaatcagttaatgatgacacatttcatcttt tacccgcatagaaaatga >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_11|101_aa MAKTYDYLFKLLLIGDSGVGKTCLLFRFSEDAFNTTFISTIGEGGAAARDRVENWGAGIL NSNPGTLNICAGWTVATFSAGQCKGFRGPYMRRNTIFKSLS >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_11|306_bp atggcgaagacgtacgattatctcttcaagctcctgctgatcggcgactcgggggtaggc aagacctgcctcctgttccgcttctcagaggacgccttcaacaccaccttcatctccacc atcggtgagggaggggccgcggcccgggaccgggtagagaattggggggcgggaatccta aactctaacccagggaccctcaacatctgtgctgggtggactgtggccacattttcagct ggccagtgtaagggttttagaggcccttacatgagaagaaatacaatttttaagtctctg agttga >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_12|161_aa MRHRGSPLHICKLEVVVATGACQELTMTWVKNSDLDKSSNRKIHRKILLGTDRTILLQEN MLTAPADSTLSLEEKLSSMKPVPGDKKVGDHCYIEPLKVVSKKFQENTVFHVRNPATFPT SKLSQCFGVKKLKNDPQKVGKLEVEGTWTAPKVPLCRQQEA >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_12|486_bp atgagacacagagggagccctttgcacatctgcaagctggaggtggtggtggccacagga gcctgtcaagagctcacaatgacctgggtgaaaaactcagaccttgataaaagcagcaac agaaaaatacacagaaaaatcctgctaggaactgataggaccatcttgttgcaggaaaac atgctcacagctcccgctgattctacattgtccctggaggaaaaactgtcttccatgaaa ccagtccctggtgacaaaaaggttggggaccactgctatatagaaccccttaaagtggtt tccaagaaatttcaggaaaacacggtcttccatgtcagaaaccctgccacgttccccaca tccaaattatctcagtgctttggtgtgaaaaaattgaaaaatgatccacagaaggttggg aagctggaggtggaaggtacctggacagcaccaaaggttcccttgtgcagacagcaggaa gcctag >gi568815583r:63055556_63256575|GENSCAN_predicted_peptide_13|83_aa XVRLAAPEIYGALQRERAQPAHALLLSSSAALQTEEPFTSTQQDSSPMSPFSFHTYLFNV SDPIFNIVKGFFICDVIDQHNAL >gi568815583r:63055556_63256575|GENSCAN_predicted_CDS_13|252_bp ntagtcagactggcagcaccagagatttatggagctcttcagagagagagggcccaacct gcccatgccctgcttctaagcagctcagctgccttgcagacagaggagccttttacaagc acccagcaggactcttctccaatgtcaccattctctttccacacatacctcttcaatgtt tctgatccaatttttaatattgtcaaaggatttttcatttgtgatgtcatagaccagcat aatgccctataa