GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:38:37 Sequence gi568815585r:45683386_45951629 : 268244 bp : 43.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 2175 2170 6 1.05 1.01 Sngl - 4664 4323 342 2 0 97 34 311 0.956 22.63 1.00 Prom - 14922 14883 40 -3.06 2.00 Prom + 15576 15615 40 -5.56 2.01 Init + 25993 26010 18 0 0 113 75 13 0.627 2.63 2.02 Term + 29797 30987 1191 0 0 113 54 1966 0.989 187.43 2.03 PlyA + 31138 31143 6 1.05 3.03 PlyA - 31614 31609 6 -0.45 3.02 Term - 32017 31818 200 2 2 68 39 140 0.212 4.66 3.01 Init - 47788 47653 136 1 1 57 113 40 0.147 3.71 3.00 Prom - 63091 63052 40 -3.76 4.05 PlyA - 63943 63938 6 1.05 4.04 Term - 71800 71298 503 2 2 43 39 293 0.510 14.54 4.03 Intr - 72756 72470 287 2 2 54 -32 205 0.435 2.09 4.02 Intr - 73501 73374 128 1 2 67 84 72 0.326 4.18 4.01 Init - 83836 83690 147 1 0 57 73 77 0.388 3.19 4.00 Prom - 86320 86281 40 -1.26 5.03 PlyA - 86982 86977 6 1.05 5.02 Term - 100672 99998 675 1 0 130 54 424 0.668 36.92 5.01 Init - 105889 105791 99 1 0 71 43 46 0.248 -1.34 5.00 Prom - 120198 120159 40 -1.56 6.00 Prom + 122224 122263 40 -3.36 6.01 Init + 123152 123285 134 1 2 71 -7 89 0.238 -2.69 6.02 Intr + 126778 126933 156 1 0 94 68 98 0.394 7.53 6.03 Intr + 133357 133414 58 1 1 42 45 108 0.090 0.79 6.04 Intr + 134651 134786 136 1 1 66 38 109 0.212 3.84 6.05 Intr + 155430 155620 191 1 2 127 70 84 0.351 9.80 6.06 Intr + 167848 168087 240 0 0 79 93 106 0.062 7.95 6.07 Intr + 170291 170318 28 1 1 75 100 -5 0.006 -2.91 6.08 Intr + 178256 178387 132 0 0 68 84 72 0.963 5.42 6.09 Term + 181241 181395 155 0 2 136 53 83 0.984 7.68 6.10 PlyA + 182863 182868 6 1.05 7.09 PlyA - 183139 183134 6 1.05 7.08 Term - 185204 185174 31 1 1 117 43 7 0.167 -3.57 7.07 Intr - 189607 189515 93 0 0 62 121 22 0.327 1.98 7.06 Intr - 190873 190744 130 2 1 31 100 78 0.294 3.05 7.05 Intr - 193406 193242 165 0 0 43 119 33 0.101 1.83 7.04 Intr - 213495 213381 115 0 1 84 84 109 0.643 10.12 7.03 Intr - 225235 225146 90 1 0 49 80 72 0.062 2.69 7.02 Intr - 227486 227374 113 0 2 95 79 14 0.070 1.30 7.01 Init - 227687 227636 52 2 1 78 99 17 0.069 2.22 7.00 Prom - 233100 233061 40 -3.16 8.02 PlyA - 233782 233777 6 1.05 8.01 Sngl - 236251 235922 330 1 0 39 36 282 0.982 14.12 8.00 Prom - 237259 237220 40 -6.16 9.03 PlyA - 241363 241358 6 1.05 9.02 Term - 250343 250239 105 2 0 86 49 56 0.271 -0.09 9.01 Init - 258197 258036 162 2 0 70 113 104 0.876 8.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:45683386_45951629|GENSCAN_predicted_peptide_1|113_aa MEELANAIGISTFNHLQVERILNKPGSKCKLAVNQIQCHPYLTQEKFIQYCQSKGITVTA CSCLGSPDKPRAKPKDPSLLEDPRIEAITANRNKTTAQILIRFPMQRSLVVIP >gi568815585r:45683386_45951629|GENSCAN_predicted_CDS_1|342_bp atggaagagctggcgaacgctattggcatctccaccttcaaccatctgcaggtcgagagg atcttaaacaagcctggctcaaagtgtaagctggcagttaaccagattcagtgtcacccg tacctaactcaggagaagtttatccagtactgccagtccaaaggtatcacagtgactgcc tgtagctgccttggctctcctgacaagccccgggccaagcccaaggacccttccctactg gaagatcccaggatcgaagcaatcacagccaaccggaataaaactacagcccagattctg atccggttccccatgcagaggagcttggtggtgattccctag >gi568815585r:45683386_45951629|GENSCAN_predicted_peptide_2|402_aa MAKYGQRGTAEPFPRLHNLYSTPRCAQQAALPRLSRRMASQHSYPLNRFSSVPLDPMERP MSQADLELDYNPPRVQLSDEMFVFQDGRWVNENCRLQSPYFSPSASFHHKLHHKRLAKEC MLQEENKSLREENKALREENRMLSKENKILQVFWEEHKASLGREESRAPSPLLHKDSASL EVVKKDHVALQVPRGKEDSTLQLLREENRALQQLLEQKQAYWAQAEDTAAPAEESKPAPS PHEEPCSPGLLQDQGSGLSSRFEEPKGPPARQEDSKELRALRKMVSNMSGPSGEEEAKVG PGLPDGCQPLQLLREMRQALQALLKENRLLQEENRTLQVLRAEHRGFQEENKALWENNKL KLQQKLVIDTVTEVTARMEMLIEELYAFMPARSQDPKKPSRV >gi568815585r:45683386_45951629|GENSCAN_predicted_CDS_2|1209_bp atggccaaatatggacagaggggcacagccgaacccttcccgaggctccacaacttgtac agcacccctcgctgcgcgcagcaggccgccctgccccggctgagccgcaggatggcgagc cagcactcctatccactgaaccgcttctcctccgtgcctttagaccccatggagcgcccc atgtcccaggccgacctggagctggactacaacccgccgcgggtgcagctcagcgacgag atgttcgtgttccaggacgggcgctgggtaaatgagaactgccgcctgcagtctccctac ttctccccatccgcctccttccaccacaagctgcaccacaagaggctggccaaggagtgc atgctgcaggaggagaacaagtctctgcgggaggagaacaaggccctgcgcgaggagaac cggatgctcagcaaggagaacaagatcctacaggtcttctgggaggagcacaaggcctcg ctgggccgagaggagagccgggccccctcgccactgctgcacaaagacagcgcgtccctg gaggtggtgaagaaggaccacgtcgccctgcaggtgccccgtggcaaggaggacagcacc ctgcagctcctccgggaggagaatcgcgcgctgcagcagctgctggagcagaaacaggcc tactgggcgcaggcagaggacacggccgcccctgccgaggaaagcaagcccgccccctca ccccacgaggagccctgcagccccgggctgctgcaggaccagggctccggcctctcctcc cgcttcgaggagcccaaagggcctccggcccggcaggaggactccaaggagctgcgcgcc ctgcggaagatggtcagcaacatgtccgggccctccggggaggaggaggccaaggtgggc ccgggcctgcccgacggctgccagcccctgcagctgctgagagagatgaggcaggcgctg caggccctgctcaaggagaaccggctcctgcaggaggagaacaggaccctgcaggtgcta cgggcagagcacaggggcttccaggaggagaacaaggccctgtgggagaacaacaagctg aagctgcagcagaagctggtcattgacaccgtgaccgaggtcaccgcgcgcatggaaatg ctcatcgaggagctctacgccttcatgccggccaggagccaggaccccaagaagcctagc agggtctga >gi568815585r:45683386_45951629|GENSCAN_predicted_peptide_3|111_aa MGSDILIMLNSLTSFYHKLYTSSGTIQKQTEVGRKVVTFPTGVQEASEIGDDEGARRIGE REEEEEKTKEKKEELKEKNRDNNEGRGKEETRGKERRSQIHNCLESNQDVL >gi568815585r:45683386_45951629|GENSCAN_predicted_CDS_3|336_bp atgggcagtgacatcctgatcatgcttaacagtctaacttccttctatcacaaactctac acttcctctgggacaatccagaaacaaacagaagtgggcaggaaggtggtcacttttccc actggagtccaagaagcctcagaaattggagatgatgagggagcaagaaggataggagaa agagaagaggaggaagaaaaaactaaggaaaaaaaggaagaactgaaggagaaaaacaga gacaataatgaaggcagaggtaaagaagagaccaggggaaaagagagaagaagccagatt cataattgtcttgagagcaaccaagatgttctttag >gi568815585r:45683386_45951629|GENSCAN_predicted_peptide_4|354_aa MQAPQGQSLKPAVEALLRQGLREPNTGEVKEEVCYHTPGCKNLLLVLINKTDGSWRMTVD YRKVNQVVIPIAAAVPDVVSLLEQINTSPGTWPYDPADPMVFEVSDRDAVWSLWQALIGE SQQRPLGFWSKALSSSADNYSPLERQLLAYYWALVETERLTMGHQVTMRPELPIMNWVLS DPSSHKAATPVIAQWAHEQSGRGGRDGGYAWAQQHGLPLVKARATTECPICQQQRPTLSP RYGTIPWSDQLATWWQVDYIGPLPSWKWQKFVLTGIDAYSRYGFAYAAHNASAKTTIRGL MECFVHCHGIPHSIASDQGTHFTAKEVQQWARAYGIHWSYHVPHHPEAAGLLEQ >gi568815585r:45683386_45951629|GENSCAN_predicted_CDS_4|1065_bp atgcaggccccacaaggacaatcactgaagcctgccgttgaagccttgttgagacaaggc cttagagagcctaacacaggtgaggtcaaggaggaagtctgttaccacactcctggttgt aaaaacctgctgcttgtacttattaataagacagatggatcttggagaatgacagtggat tatcgtaaggtcaaccaagtggtgattccaattgcagctgctgtaccagatgtggtttca ttgcttgagcaaattaacacatctcctggtacctggccatatgacccagcagatccaatg gtgtttgaggtgtcagatagggatgctgtttggagcctttggcaggccctcataggtgaa tcacagcagaggcctctaggattttggagcaaggccctgtcatcttctgcagataactac tctcctcttgagagacagctcttggcctattactgggctttggtggaaactgaacgtttg actatgggtcatcaagtcaccatgcgacctgaactgcccatcatgaactgggtgctttct gacccatctagccataaagccgccacccctgtcattgctcaatgggcccatgaacaaagt ggccgtggtggcagagatggaggttatgcatgggctcagcaacatggacttccacttgtc aaggctagagccactactgagtgcccaatttgccagcagcagagaccaacactgagccct cgatatggcaccattccttggagtgatcagctagctacctggtggcaggttgattatatt ggacctcttccatcatggaaatggcagaagtttgtcctcactggaatagatgcctactcc agatatgggtttgcctatgctgcccacaatgcttctgccaagactaccatccgtggactc atggaatgctttgtccactgtcatggtattccacacagcattgcctctgaccaaggcact cacttcacagctaaagaagtgcagcagtgggctcgtgcttatggaattcactggtcttac catgttccccatcatcctgaagcagctggattgttagaacagtag >gi568815585r:45683386_45951629|GENSCAN_predicted_peptide_5|257_aa MKISSIYVARAESDMEEKEHKIHFLIKMFLSQVYVSSRRAVTQSAPEQGSFHPHHLSHHH CHHRHHHHLRHHAHPHHLHHQEAGLHANPVTPCLCMCPLFSCQWEGRLEVVVPHLRQIHR VDILQGAEIVFLATDMHLPAPADWIIMHSCLGHHFLLVLRKQERHEGHPQFFATMMLIGT PTQADCFTYRLELNRNHRRLKWEATPRSVLECVDSVITDGDCLVLNTSLAQLFSDNGSLA IGIAITATEVLPSEAEM >gi568815585r:45683386_45951629|GENSCAN_predicted_CDS_5|774_bp atgaagatctcatccatctacgtagcaagagctgaatctgatatggaggagaaggagcat aagatacattttctaattaagatgttcctatcccaggtgtatgtgtccagtcggcgcgcc gtcactcagagcgctccagagcaaggcagcttccaccctcaccatctctcccaccaccac tgccaccaccgccaccaccaccacctccgccaccacgcccacccccaccaccttcaccac caggaggcggggctgcacgccaacccggtgacgccctgcctgtgcatgtgtcccttgttc tcctgccagtgggaaggccgcctggaggtggtggtgccccacctgcggcagatccatagg gttgacatcctccagggagccgagatcgtcttcctggccacggacatgcacctccccgcg ccggctgattggatcatcatgcactcctgccttggccaccactttctgttggtgctgagg aaacaggagaggcatgaagggcacccccagttctttgccaccatgatgctgattgggacc cccacccaggccgactgcttcacctatcgcctggagctcaacagaaaccatcggcgcctc aagtgggaggccacgccccggtctgttcttgagtgcgtggactcggtgattacggacggg gactgcctcgtcctcaacacctcgctggcacagctcttctctgacaacggcagccttgcc attgggattgccatcaccgcgacagaggtcctcccctcagaagctgaaatgtga >gi568815585r:45683386_45951629|GENSCAN_predicted_peptide_6|409_aa MAPIDIGDYESGEGGRKTMVEKVSIACYAHYLDDRIIHIPSLSDTRHQPPEDAQSHLDYP KEIRDEWPSPIRHSLGRSIMASQLNGWMKSCSAPVPGQPIQCDSSEYDDHGEKQPTRELL TNDVEKLEENTQMPPLEQDNSEVSYITSQGPQGDCSPPARSGSMWTQSLLSLAFDPDTWL VGIIPLSFQKPTSHPDASFTRFFSQQCLKKKMLEGHKGKRLPETKVDSVGERHQTINPAF PASWGNNTRSLTAPNHLESESEPRQTRRVPDTVPAPETRVCKGLHTFAGGSLPPPRKDLN LSPVKMYCFRHPARKSLTLEEARHHAIRTLLQLCKRPCEGQPRSSVNASINSASTRQPSL AYRSFYLQRKRMKAEECGSKEGARDYERVLKKFHPKERVANRIPSRRQP >gi568815585r:45683386_45951629|GENSCAN_predicted_CDS_6|1230_bp atggcaccaatagacattggggactatgagagtggggagggtggaaggaagacaatggtt gaaaaagtatctattgcgtgctatgctcactacctggatgacaggattattcatatacca agccttagtgacacgagacaccagcctcctgaggatgctcagtcccacctggactaccca aaggagatcagagacgagtggcccagtcctatccgtcattcactgggacgctccatcatg gctagtcagctgaatggatggatgaagtcctgttcagcaccagtgcctggacagccaatc cagtgtgacagctcagagtatgatgaccatggggagaaacagcccacgagggagctattg acaaatgatgtggagaaattggaggaaaatacccagatgcctcctctggagcaggacaac tctgaggtatcctacatcacctcccagggtccccagggagactgcagcccacctgcccgc agtgggtcaatgtggacacagtccctcctttctctggcttttgacccggatacatggctt gtaggcatcatcccgctttccttccagaagcccacctctcaccctgatgcctccttcact cggttcttctcacagcagtgtttgaaaaagaaaatgttggaagggcataaggggaagcgt ttaccagaaaccaaggttgacagcgtcggggaaagacatcagaccatcaatccagctttt cctgccagctggggaaacaacactcggtcactaacagcacccaatcacctagagagcgag agtgagccgaggcaaacacgccgggttccagacaccgtcccagcccccgagacccgggtt tgcaaagggctgcacacgttcgccggagggtcgctgccgcctccgagaaaggacttgaac ctgagcccggtgaagatgtattgtttccgtcatcctgctagaaaatcactcactctggag gaagccagacatcatgctataaggacccttttgcagctgtgtaagaggccatgtgaaggg cagccgaggtcttctgtcaatgccagcatcaactcagcatcaactcgccagccatctctt gcctacagatctttctacctgcaaaggaagaggatgaaggcagaagaatgcggcagcaag gagggagcgagagattatgagagggtgctaaagaaatttcatccaaaggagagagtggcc aacaggatcccctccagaaggcagccctga >gi568815585r:45683386_45951629|GENSCAN_predicted_peptide_7|262_aa MAGWEGGGLDSGPASQADAAQSADILSCARINRRKMSGQNKGGAGAHSPPTQFEVTLNLI THLGIVHQKQGSPENLTSIFQEGLELLGKHQSEPSIEIHLNRPLQFDLQYITFSGSLTPS FRFYNTFYQSLVVSAPCHLILSIGLINDLNCDSGLLIKSAKHWGFFASSNAKEIKCFAER RFEIFGKAQCSKGSPYSQKGFTSFDPTFAFQQLSFSPGKLVQSQTLLQCVTGGGEALAAS EYFGTHSIQALRSPRTFSMNVG >gi568815585r:45683386_45951629|GENSCAN_predicted_CDS_7|789_bp atggcgggctgggaaggagggggcctggattcagggccagcttcacaggcagatgctgct cagagcgcggacattctatcctgtgcccgcatcaacaggagaaagatgtcagggcaaaac aagggaggcgctggggcacacagtcctcccactcagtttgaagtgacattgaatctgatc actcacctaggaattgttcaccagaaacaaggatctcctgaaaacctcaccagtatcttc caggaaggacttgagttattgggtaagcaccaatctgaaccatccatagaaatacacctc aatcggcctcttcagtttgacctgcagtacatcaccttctctggtagtctgacacccagt ttccgcttctataacacattttatcagtctttggtggtcagtgctccttgccacttgatc ttgtccattggccttataaatgatctcaactgtgactctgggctactgattaagtctgct aaacattggggattttttgcctctagcaatgctaaagaaatcaagtgttttgctgaaagg cgatttgaaatatttggtaaagcccaatgtagcaagggcagcccttattctcagaaggga ttcacctcttttgatccgacatttgcatttcaacagctgagcttttcccctgggaagctc gtccagtctcagacccttcttcagtgtgtgactggaggaggggaggctctggctgcttct gagtactttgggacacactctattcaagctttgagaagccccagaaccttctcgatgaat gtaggttag >gi568815585r:45683386_45951629|GENSCAN_predicted_peptide_8|109_aa MPIVKQAKIYHNKLKVEGNCEYLTGWLQKFEKRYSIKFLRTHGAKASTDHKAAEKLIDKA AKVIADENLTPEQLSNADEISLFGHYSPQKTLTTADETVPIGIQMPRTE >gi568815585r:45683386_45951629|GENSCAN_predicted_CDS_8|330_bp atgccgatcgtgaaacaagcaaagatctatcacaataaactaaaggttgaagggaactgt gagtatttaacaggctggttgcagaaatttgagaaaagatacagcattaagtttttaagg actcatggtgctaaagcatctactgatcacaaagcagcggagaaactcattgacaaggct gccaaggtcattgctgatgaaaacttgacgccagaacaactctccaatgctgatgaaata tcactgtttgggcattacagcccccaaaagacactgactacagctgatgagacagtccct ataggaattcagatgccaaggacagaataa >gi568815585r:45683386_45951629|GENSCAN_predicted_peptide_9|88_aa MVPRGALALLPPSAEWRLWQLPFPSLAAELKVLCSSSDWWSLITWPQLQREAGKMPISFA VGASVTFSTCLVSMCEDEQCSVESRCTG >gi568815585r:45683386_45951629|GENSCAN_predicted_CDS_9|267_bp atggtcccacgaggagccctggcactgctgcctccctcagcagaatggagactgtggcag ctgccttttccttctctggctgctgaattgaaagtcctttgtagcagctctgattggtgg agcctgatcacatggccccagctgcagagggaggctgggaagatgccaatttcctttgct gttggagcatctgtgacattttcgacgtgtttagtttcaatgtgtgaggatgagcaatgc agcgtggagagccggtgcactgggtga