GENSCAN 1.0 Date run: 6-Nov-116 Time: 07:57:36 Sequence gi568815586f:130062943_130264685 : 201743 bp : 47.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 310 305 6 1.05 1.06 Term - 1618 1493 126 1 0 90 38 107 0.006 4.18 1.05 Intr - 18276 18149 128 1 2 35 89 101 0.544 5.30 1.04 Intr - 19001 18896 106 2 1 96 115 -57 0.337 -2.41 1.03 Intr - 22318 22250 69 1 0 85 96 62 0.650 6.08 1.02 Intr - 25702 25587 116 2 2 46 85 42 0.102 -0.13 1.01 Init - 34853 34664 190 2 1 38 48 173 0.276 7.50 1.00 Prom - 39308 39269 40 -5.56 2.08 PlyA - 39385 39380 6 1.05 2.07 Term - 41415 41237 179 1 2 107 48 64 0.943 2.15 2.06 Intr - 42996 42827 170 2 2 27 35 174 0.248 5.59 2.05 Intr - 52042 51879 164 1 2 46 69 64 0.035 -0.93 2.04 Intr - 56540 56422 119 0 2 61 78 150 0.520 11.48 2.03 Intr - 57371 57245 127 2 1 86 30 34 0.244 -2.35 2.02 Intr - 58147 58119 29 2 2 135 83 6 0.323 2.63 2.01 Init - 61711 61654 58 1 1 62 94 61 0.323 5.62 2.00 Prom - 70328 70289 40 -5.46 3.03 PlyA - 74126 74121 6 1.05 3.02 Term - 74837 74567 271 1 1 91 44 178 0.529 8.66 3.01 Init - 82061 81922 140 2 2 64 55 83 0.304 2.21 3.00 Prom - 93793 93754 40 -3.36 4.00 Prom + 94990 95029 40 -5.06 4.01 Sngl + 100001 101746 1746 1 0 98 47 3595 0.997 349.18 4.02 PlyA + 102384 102389 6 1.05 5.00 Prom + 120567 120606 40 -5.06 5.01 Init + 120858 120960 103 2 1 99 68 66 0.244 4.14 5.02 Intr + 124084 124145 62 2 2 89 96 -2 0.069 -0.85 5.03 Intr + 131585 131709 125 1 2 88 36 78 0.078 1.98 5.04 Intr + 135148 135281 134 1 2 71 115 69 0.430 8.19 5.05 Term + 135398 135453 56 0 2 119 54 37 0.796 1.12 5.06 PlyA + 137162 137167 6 1.05 6.06 PlyA - 140676 140671 6 1.05 6.05 Term - 140961 140810 152 1 2 96 43 81 0.639 2.47 6.04 Intr - 141862 141821 42 2 0 90 55 77 0.773 2.81 6.03 Intr - 148486 148403 84 2 0 60 60 66 0.252 0.79 6.02 Intr - 153192 153029 164 2 2 55 84 66 0.408 2.52 6.01 Init - 156906 156749 158 0 2 77 107 124 0.975 10.58 6.00 Prom - 175922 175883 40 -4.46 7.08 PlyA - 176225 176220 6 1.05 7.07 Term - 178323 178039 285 0 0 65 49 161 0.848 5.30 7.06 Intr - 179609 179477 133 1 1 39 91 64 0.027 2.45 7.05 Intr - 185207 184703 505 0 1 66 86 540 0.081 43.64 7.04 Intr - 188206 188008 199 1 1 90 105 -1 0.397 0.72 7.03 Intr - 192058 191984 75 1 0 68 94 57 0.344 4.01 7.02 Intr - 196308 196130 179 1 2 30 81 63 0.394 -0.56 7.01 Init - 196833 196701 133 0 1 65 85 86 0.632 6.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 33552 33444 109 2 1 17 45 221 0.855 8.68 S.002 Term - 185207 184699 509 0 2 66 47 539 0.905 42.17 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:130062943_130264685|GENSCAN_predicted_peptide_1|244_aa MRAQEDAFEIPETALSLPLSGDERAKVLPSFAKGQPGSSRIEFLQGHRRFDAALTPPGPS TVPAAQPVVLFDGSPDDLVQHLRIFHSCQACVSTASPCNWYLNGEFSNVGAESSGLRMET RTAAQKLPLSSYHTRHLPCVGLDCSFSGILNTQTPTSVKKGSSTWVTCDLLQMTPEKLGL PAPQIGQILPNTMNYTKHGTLMQAVVTLDPAELPILGSKGWKLDNIGVVSITAAPLIGSL LLKP >gi568815586f:130062943_130264685|GENSCAN_predicted_CDS_1|735_bp atgagagctcaggaagatgcctttgaaattcctgaaacagccctctctcttcccctgtct ggagacgagagggccaaggtgctcccgtcatttgccaagggacagccaggcagctcaagg attgaatttctccagggccaccgtcgctttgatgccgccctgacacccccagggcctagc acagttcctgctgcccagcctgtggtcctttttgacggtagccctgatgacctggtacag cacttacggattttccactcctgccaggcatgtgtgagcacagcctccccgtgcaattgg tacctgaacggggaattctccaatgttggtgctgagtcatcagggctgcggatggagacc cggactgcagctcagaagctccctttgagttcttatcacacaagacacctaccctgtgtg gggctggactgttcattttcgggcattttaaatacacagactccaacttctgtcaaaaaa ggttcatctacctgggttacatgtgacctgctacagatgacacctgagaagctgggtctt ccagctccccaaatcggacaaattcttcctaacacaatgaactacacgaagcatggtact ttgatgcaagctgtggtcaccctggacccagcagagctcccaatcttgggtagcaaggga tggaagctggacaatattggagttgtgtccatcacagctgccccacttataggttcgctg ctgctaaagccatag >gi568815586f:130062943_130264685|GENSCAN_predicted_peptide_2|281_aa MHAFHDYTPLAVLSHMESGDFASSGHSVEGLLCAREWCPPAQHTGSIQGTGSGDQGATCT FASPDRGRDTGGPQIQHKGEGAALIFMEGDEAKPAVVTEEGCVGGGADGVESRPCVVGPI VAPTLQTKKLSPERLRNLLRVTELVGGNRVSSPGILPPELSVSIIKSAALMPRTRQKTLS WKVNIQDWGPLRALTFHTWLVPASDPPPSDFDPCQGEEELFQALRDSLLHSSGKGKSGDT WQMPPGRQGLHLVTAPYDVKTRARCFRRSSLKPSLIVTKST >gi568815586f:130062943_130264685|GENSCAN_predicted_CDS_2|846_bp atgcatgccttccatgactacacccccctggccgtcctctcccacatggagtcaggagat tttgcctcttctgggcattccgtggagggcttgttgtgcgcgagggaatggtgtcctccg gcccagcacacgggcagcatccaaggcactgggtcaggcgaccaaggagctacctgcact ttcgccagcccagacaggggacgggacaccggagggccccagatccaacacaaaggagag ggagccgcgctcatcttcatggaaggggatgaagccaaacccgccgtcgtcactgaggag ggctgcgtgggaggcggtgccgatggggtggagtccaggccctgcgtggtcggcccaatt gttgcccccacgttgcagacgaagaaactgagccctgagaggctgagaaacctccttcgg gtcacagagcttgtgggtggcaatcgggtttctagccctggcattctgccaccagagctt tctgtatctattataaagagcgcggcgctcatgcccaggacccgtcagaagacgctgtcc tggaaggtgaacattcaagactggggtcccctgcgagcgctgacttttcacacctggctt gttccagccagtgacccacctcctagcgactttgacccctgccagggcgaggaggagctt ttccaagcacttagggactcacttctgcacagctcagggaaagggaaaagtggagacacc tggcagatgccacctggtcgccaaggtttacacctcgtgactgccccatatgatgtgaag acaagggcacgttgcttccggcgttcttccctcaaacccagtctaattgtgacgaaaagc acctga >gi568815586f:130062943_130264685|GENSCAN_predicted_peptide_3|136_aa MATTLEMKKEKNDEEKKMHQSSRASPAQSLLQTALFVLAGLCLYWSGQIFGYILTLFSKT QEPPPPFHLAIIPFPDLHPSIQALGLAPGLQAASPLKLHAAKSEPEPTYSPFHATNFWRP GAFESPTENTFLTQFF >gi568815586f:130062943_130264685|GENSCAN_predicted_CDS_3|411_bp atggcaacgactttggaaatgaagaaagaaaagaatgatgaagaaaaaaagatgcatcaa tccagtcgggccagccctgctcagagcctgctgcaaacagccctgtttgtgttggctggt ctgtgcctctattggtcaggacaaatctttggttatattctcaccctcttctccaagacc caggagccacctccccctttccacctggctatcatcccattcccggatctgcaccccagc atccaagcactggggctggcccctggcctccaagcagcttcacccctgaagctgcacgcg gccaagtccgagcccgagcctacttattctccatttcatgcaacaaatttctggagacct ggagcctttgaatcacccacagaaaacacctttctgactcaattcttctga >gi568815586f:130062943_130264685|GENSCAN_predicted_peptide_4|581_aa MQRPGPRLWLVLQVMGSCAAISSMDMERPGDGKCQPIEIPMCKDIGYNMTRMPNLMGHEN QREAAIQLHEFAPLVEYGCHGHLRFFLCSLYAPMCTEQVSTPIPACRVMCEQARLKCSPI MEQFNFKWPDSLDCRKLPNKNDPNYLCMEAPNNGSDEPTRGSGLFPPLFRPQRPHSAQEH PLKDGGPGRGGCDNPGKFHHVEKSASCAPLCTPGVDVYWSREDKRFAVVWLAIWAVLCFF SSAFTVLTFLIDPARFRYPERPIIFLSMCYCVYSVGYLIRLFAGAESIACDRDSGQLYVI QEGLESTGCTLVFLVLYYFGMASSLWWVVLTLTWFLAAGKKWGHEAIEANSSYFHLAAWA IPAVKTILILVMRRVAGDELTGVCYVGSMDVNALTGFVLIPLACYLVIGTSFILSGFVAL FHIRRVMKTGGENTDKLEKLMVRIGLFSVLYTVPATCVIACYFYERLNMDYWKILAAQHK CKMNNQTKTLDCLMAASIPAVEIFMVKIFMLLVVGITSGMWIWTSKTLQSWQQVCSRRLK KKSRRKPASVITSGGIYKKAQHPQKTHHGKYEIPAQSPTCV >gi568815586f:130062943_130264685|GENSCAN_predicted_CDS_4|1746_bp atgcagcgcccgggcccccgcctgtggctggtcctgcaggtgatgggctcgtgcgccgcc atcagctccatggacatggagcgcccgggcgacggcaaatgccagcccatcgagatcccg atgtgcaaggacatcggctacaacatgactcgtatgcccaacctgatgggccacgagaac cagcgcgaggcagccatccagttgcacgagttcgcgccgctggtggagtacggctgccac ggccacctccgcttcttcctgtgctcgctgtacgcgccgatgtgcaccgagcaggtctct acccccatccccgcctgccgggtcatgtgcgagcaggcccggctcaagtgctccccgatt atggagcagttcaacttcaagtggcccgactccctggactgccggaaactccccaacaag aacgaccccaactacctgtgcatggaggcgcccaacaacggctcggacgagcccacccgg ggctcgggcctgttcccgccgctgttccggccgcagcggccccacagcgcgcaggagcac ccgctgaaggacgggggccccgggcgcggcggctgcgacaacccgggcaagttccaccac gtggagaagagcgcgtcgtgcgcgccgctctgcacgcccggcgtggacgtgtactggagc cgcgaggacaagcgcttcgcagtggtctggctggccatctgggcggtgctgtgcttcttc tccagcgccttcaccgtgctcaccttcctcatcgacccggcccgcttccgctaccccgag cgccccatcatcttcctctccatgtgctactgcgtctactccgtgggctacctcatccgc ctcttcgccggcgccgagagcatcgcctgcgaccgggacagcggccagctctatgtcatc caggagggactggagagcaccggctgcacgctggtcttcctggtcctctactacttcggc atggccagctcgctgtggtgggtggtcctcacgctcacctggttcctggccgccggcaag aagtggggccacgaggccatcgaagccaacagcagctacttccacctggcagcctgggcc atcccggcggtgaagaccatcctgatcctggtcatgcgcagggtggcgggggacgagctc accggggtctgctacgtgggcagcatggacgtcaacgcgctcaccggcttcgtgctcatt cccctggcctgctacctggtcatcggcacgtccttcatcctctcgggcttcgtggccctg ttccacatccggagggtgatgaagacgggcggcgagaacacggacaagctggagaagctc atggtgcgtatcgggctcttctctgtgctgtacaccgtgccggccacctgtgtgatcgcc tgctacttttacgaacgcctcaacatggattactggaagatcctggcggcgcagcacaag tgcaaaatgaacaaccagactaaaacgctggactgcctgatggccgcctccatccccgcc gtggagatcttcatggtgaagatctttatgctgctggtggtggggatcaccagcgggatg tggatttggacctccaagactctgcagtcctggcagcaggtgtgcagccgtaggttaaag aagaagagccggagaaaaccggccagcgtgatcaccagcggtgggatttacaaaaaagcc cagcatccccagaaaactcaccacgggaaatatgagatccctgcccagtcgcccacctgc gtgtga >gi568815586f:130062943_130264685|GENSCAN_predicted_peptide_5|159_aa MASGLRRVCLSAGLLPGALTHAVEFTHSQGLCSPGAPSMDSWELSGPLSRRGRKRAHTPV TSKSCFPIYVVIVSLTPSMQYSFQLARHRITVDKEFRPEAPRPVLGKPQMQKGLEGERRV RFQERPGTRADARGITAQLDAEHEMQELLGCRGSASGLL >gi568815586f:130062943_130264685|GENSCAN_predicted_CDS_5|480_bp atggcctctggtctccgacgggtctgcttgtctgcaggtcttctgcccggagccctgaca cacgcagtggagttcacacattcacagggtctgtgctcccctggtgcaccatccatggat tcctgggagcttagtggacccctcagcaggcgaggcaggaaaagggctcacactcctgtc acctccaaatcatgtttccccatctacgtcgttatcgtcagcttaactccttcaatgcaa tattcatttcagttagcaagacacagaataactgtggacaaggagttcagacctgaggca cctaggcctgtgctgggtaaacctcagatgcagaaaggcctggaaggagaaaggcgtgtt cgttttcaggagagacctggaactcgagcagatgccaggggaatcacagcacagcttgac gcagagcatgagatgcaggagctgctgggatgccgtgggagcgcctcgggcttgctttga >gi568815586f:130062943_130264685|GENSCAN_predicted_peptide_6|199_aa MKVLGWSGVGWGVVLQLLSAPVLSSGNPCLEPMRWLVTSTTHKDQILQEPGLRTRAIACV THTRMFSWIPYESVHMLVVYHWGALVGFFLFFYCTPSTQLMAVFMNKWIWMKLEAIILSK LSQGQKTKHRMFSLINKEIRSYRVAELIGGQPTCRRCLRSNIVFGSVDQDLDACGVIISL FPALRVYLKALKWCQHATE >gi568815586f:130062943_130264685|GENSCAN_predicted_CDS_6|600_bp atgaaggtcttggggtggtcgggggtggggtggggggtggtgctccagcttctgtcggct cctgtcctgagttcaggaaacccctgtcttgagccaatgcgatggctggtcacctctacc acccacaaagatcagatcctacaggagcctgggctccgcacccgggccattgcctgtgtg acgcataccagaatgttctcctggattccttatgaaagtgttcacatgctcgttgtctat cactggggagctctagtggggtttttcctgttcttttattgcacccctagcacgcagctc atggctgtgttcatgaacaagtggatatggatgaagctggaagccatcattctcagcaaa ctatcacaaggacagaaaaccaaacaccgcatgttctcactcataaacaaggaaatccgc tcctaccgggttgccgagcttattggaggtcagccgacctgccgccgttgcctgcgatct aacatagtcttcggttctgtggatcaggacctggatgcctgcggggtcattatttcatta ttccccgcactacgtgtgtatcttaaagctctcaagtggtgccagcatgcaacagaatag >gi568815586f:130062943_130264685|GENSCAN_predicted_peptide_7|502_aa MERREADALKKVVGDQGTGNLEGHCQLKILYSGGPGKPLEGLSRGVKEIVEYVGLEFREV VQPGEENLGLEFREVVQPGEGNLDGLGSSSHIGGNPGLDNWTCPYKQNCLKEKYSLPERR TLYPPIRKQEFCEPVVNTELLKIKLHKCTFKAIVLEIKVEENSASMHSVMTCAPEAHCVE ETLAGGCRHLFPSPSGVAVFFPPPPPPSFSLVPPPPLPSAIIITIITITTTIIITIITIT IIITTITIITIITTITIITIIITITIITSSPPSPSSSHHYHHHRHHHHHHHHHHHHHHHH ITITIIITTVTITTITIITTIITSPSPSSSPSSPSSPSCSPSSPSSSPSSLPSSLLVSLP LRNPQLRQLPRAEAMMPAWRLHVCPYLQLGLHHTCISIAVTSIVIYIQGLLGPPLPLDQR TLEHPEPRPALSCPPPPPIGGSISSHGFNDHLSGNDSQVDIISLNLSLNPRLASPHGSLT APGGGPMGMQREGFDWRNKDRK >gi568815586f:130062943_130264685|GENSCAN_predicted_CDS_7|1509_bp atggagagaagggaagcagacgccctcaagaaggtagtgggagatcagggcactgggaac cttgaaggccattgtcaactgaaaatcctctactctggtggccctgggaagcctttggaa ggtttgagcagaggtgttaaagaaatagttgaatacgtgggtctggagttcagggaagtg gtccagcctggagaagaaaatttgggtctggagttcagggaagtggtccagcctggagaa ggaaatttggatggattggggagtagctcgcatattggtggtaatccaggtcttgacaat tggacatgtccatataaacagaactgccttaaagaaaaatacagcctcccagagcgcaga actctttatccccccattcgcaaacaggaattttgtgagccagttgtaaacacagaatta ttaaaaattaaattacataaatgtacatttaaagcaattgtattggaaataaaggtagaa gagaactcagcttcaatgcactccgtgatgacctgtgctcctgaggctcactgcgtggag gaaacgctcgccgggggctgcaggcatctcttcccatctccgtctggtgtggcagttttc tttccccctcctcctcctccttctttttctcttgttcctccccctccacttccttctgcc attatcattaccatcatcaccatcactaccaccatcattattaccatcatcaccatcacc atcatcatcaccaccatcaccatcatcaccatcatcaccaccatcaccatcatcactatc atcatcaccatcaccatcatcacatcatcaccaccatcaccatcatcatcacatcactat catcaccaccgtcaccatcaccaccatcaccaccatcaccatcatcaccatcatcatcac atcaccatcaccatcatcatcaccaccgtcaccatcaccaccatcaccatcatcaccacc atcatcacatcaccatcaccatcatcatcaccatcttcaccatcatcaccatcatgctca ccatcatcaccatcatcatcaccatcatcactaccatcatcactactagtttctttaccc ctaaggaacccgcagctgcggcagctaccaagagcagaagcaatgatgccggcctggagg ctgcatgtgtgtccatatctacaactcggtctacaccacacctgcatttccatcgctgtc acttctatcgtcatctatatccagggcctactgggccctcctcttcccctggaccagagg accctggagcacccagagcccaggccagccctctcatgtccacctccacccccaatcggg gggtccatctcttctcacggctttaatgaccacctctctggcaacgactcccaagtggat atcatcagcctgaacctcagcctaaaccccagacttgccagtccccatggctccctgaca gctcctggcggaggtccaatgggcatgcagagggagggctttgactggaggaataaggac aggaagtag