GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:31:06 Sequence gi568815596r:126457433_126658163 : 200731 bp : 39.63% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 251 246 6 1.05 1.02 Term - 3952 3690 263 2 2 8 37 230 0.213 4.60 1.01 Init - 10071 9627 445 0 1 90 53 177 0.219 10.95 1.00 Prom - 11264 11225 40 -5.05 2.00 Prom + 16672 16711 40 -5.85 2.01 Sngl + 25195 25605 411 0 0 45 47 361 0.997 23.74 2.02 PlyA + 26480 26485 6 1.05 3.00 Prom + 31453 31492 40 -5.85 3.01 Init + 34185 34259 75 2 0 63 98 66 0.771 6.24 3.02 Intr + 53259 53362 104 2 2 111 56 85 0.459 5.65 3.03 Intr + 54389 54493 105 2 0 93 101 17 0.213 1.91 3.04 Term + 55867 56029 163 1 1 86 36 99 0.125 0.93 3.05 PlyA + 56279 56284 6 1.05 4.07 PlyA - 56828 56823 6 1.05 4.06 Term - 60075 59968 108 0 0 119 54 59 0.954 3.13 4.05 Intr - 61231 61136 96 1 0 134 94 -6 0.898 3.89 4.04 Intr - 61737 61627 111 0 0 77 30 87 0.492 1.46 4.03 Intr - 62009 61881 129 2 0 66 91 36 0.470 1.67 4.02 Intr - 62845 62743 103 0 1 34 97 73 0.250 2.06 4.01 Init - 63239 63136 104 2 2 81 64 101 0.340 6.76 4.00 Prom - 64377 64338 40 -5.55 5.08 PlyA - 64399 64394 6 1.05 5.07 Term - 70093 69905 189 1 0 37 44 185 0.760 5.47 5.06 Intr - 70659 70526 134 1 2 17 49 54 0.055 -6.16 5.05 Intr - 71966 71854 113 1 2 69 40 137 0.079 6.10 5.04 Intr - 76896 76763 134 0 2 108 30 78 0.371 2.52 5.03 Intr - 77402 77288 115 1 1 69 84 109 0.362 8.13 5.02 Intr - 84314 84208 107 2 2 74 111 -3 0.038 -1.31 5.01 Init - 94602 94492 111 0 0 18 75 107 0.133 2.66 5.00 Prom - 98321 98282 40 -6.75 6.04 PlyA - 99021 99016 6 1.05 6.03 Term - 100179 99998 182 1 2 55 41 208 0.813 9.59 6.02 Intr - 100766 100229 538 2 1 77 -7 541 0.790 35.06 6.01 Init - 101500 101444 57 1 0 16 94 8 0.372 -4.44 6.00 Prom - 104020 103981 40 -5.75 7.00 Prom + 111466 111505 40 -6.95 7.01 Init + 111707 111849 143 1 2 69 98 69 0.851 5.65 7.02 Intr + 113720 113862 143 2 2 130 76 84 0.814 10.48 7.03 Term + 116154 116290 137 1 2 93 47 57 0.574 -0.70 7.04 PlyA + 118066 118071 6 1.05 8.04 PlyA - 118895 118890 6 1.05 8.03 Term - 119702 119695 8 0 2 139 49 0 0.026 -1.75 8.02 Intr - 126196 126086 111 2 0 94 80 84 0.050 7.63 8.01 Init - 128150 128096 55 2 1 75 81 31 0.040 2.60 8.00 Prom - 140284 140245 40 -3.35 9.00 Prom + 149565 149604 40 -0.95 9.01 Init + 163197 163278 82 2 1 72 69 72 0.503 4.88 9.02 Term + 163284 163546 263 1 2 16 48 183 0.582 1.80 9.03 PlyA + 167862 167867 6 1.05 10.00 Prom + 169599 169638 40 -2.05 10.01 Init + 181248 181352 105 2 0 37 100 81 0.415 4.55 10.02 Intr + 185187 185319 133 2 1 37 78 96 0.466 2.80 10.03 Term + 186146 186363 218 0 2 117 49 134 0.943 8.72 10.04 PlyA + 186378 186383 6 1.05 11.06 PlyA - 188585 188580 6 1.05 11.05 Term - 191532 191337 196 2 1 -49 37 242 0.507 1.30 11.04 Intr - 192477 192108 370 1 1 61 80 254 0.512 15.14 11.03 Intr - 194038 193684 355 1 1 -10 78 200 0.350 3.14 11.02 Intr - 195833 195734 100 1 1 93 80 117 0.669 10.59 11.01 Init - 199188 198824 365 0 2 34 -12 292 0.691 8.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 151645 151752 108 0 0 80 54 110 0.890 4.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:126457433_126658163|GENSCAN_predicted_peptide_1|235_aa MPFVGCTPSSILAGLGFTYTPLNVAYAPLSAFQPRPWMPILSKMLSWNAGSGGSWHRAEV PLPSLCSEGVQPAMWGGHTQVRMLGQPRVRAVSEGVIAEFSQVETSGISSCVQHRSAMTG EIPSKTRTARQAWGTLRATRDNKTLLFQDQGSGPQRVINELKATQQMMGRIQCLLIIMRL AALKSLLAVAQDCLFPVPILRCIWGVSTLASVSPNQTMTLASAPLRWSLKLVSPL >gi568815596r:126457433_126658163|GENSCAN_predicted_CDS_1|708_bp atgccttttgtaggctgcaccccttcttcaatcttggctggcttggggttcacttacacc ccactgaatgtggcatatgctcctctgagtgccttccaacctaggccatggatgcctata ctttctaaaatgctgtcttggaatgcaggttctggtggcagttggcatcgtgcagaggta cccctcccttcactgtgctcggaaggagtccagccagccatgtggggaggtcacacgcag gtgaggatgcttggccagcctagggtcagagctgtgagtgagggagtcatcgcagagttc agccaggtcgagacctcaggtatctccagctgtgtccagcatcgaagtgcaatgactgga gagattccaagcaagacgaggactgcccgccaagcctggggaaccctcagagccactaga gataataagacattgttgtttcaagaccagggaagtgggccacagagagtgattaatgag ctcaaggctacacagcagatgatgggccgcatccagtgtctgctcatcatcatgcgtctt gctgccctcaagtcactcctagcagttgcccaggactgtttgttccccgttcctatcctc aggtgtatctggggtgtttccaccctggcctcagtctcccctaaccaaaccatgacactg gcctctgctcctctgcgctggtccctgaagcttgtctctcccctgtag >gi568815596r:126457433_126658163|GENSCAN_predicted_peptide_2|136_aa MVKSEAGFFSEPHRQPETSTASDTTHWISFILGSLQKAPCPLSLVGFTWLAVPPRSHASR RICVQPESGQDVPQPVSALGASTWLKGTQCQNRDASDPEASEGMLQRPNSSFSPVVSPWL SALMLLPVTCGCPLLA >gi568815596r:126457433_126658163|GENSCAN_predicted_CDS_2|411_bp atggttaaaagtgaggcaggatttttctcggagccgcatcgccagccggagacctccacg gccagcgacaccacccactggatctcattcatcctgggctccctgcagaaggcaccctgc ccactcagcctggtgggcttcacttggcttgcagtcccccctagatcccatgctagccga aggatctgcgttcagcctgaatctgggcaggacgtgccacaacctgtttctgccttgggc gccagcacctggctgaagggaacgcagtgccaaaacagggatgccagtgaccccgaagcc tcagaagggatgttacagcgtcctaacagctcttttagtcctgttgtctcaccctggctg tccgccctgatgctgcttcctgtcacatgtggctgccctctgctggcataa >gi568815596r:126457433_126658163|GENSCAN_predicted_peptide_3|148_aa MYQICREKDNRAYRGNGNRQTLKLTVKQLIMCGRGTASQDYHEQGIWNPIEEVCVTEISR KAPANCKEPLSTHVNNTGGLGNVFLLIWEKGLFLMLCSVLDCKDDMPGLLGSLLIYTLAF STAVTLFGPCWWSMSHSEGKGTPGLPEI >gi568815596r:126457433_126658163|GENSCAN_predicted_CDS_3|447_bp atgtatcagatctgcagagaaaaagacaacagggcttatcgtggcaatggaaacagacaa acactaaagttaacagtcaaacagctgataatgtgtggaagaggcactgcatcccaagat taccatgagcaagggatttggaatccaatagaagaagtttgcgtcactgagatatccagg aaggctcctgctaactgcaaagagcccttaagtacccatgtgaataatacaggaggttta ggaaacgttttcctcttaatatgggaaaagggactattcttaatgctctgctctgtgctg gactgtaaggatgacatgcctggtcttcttgggtctctgctcatctataccctggctttt tccactgccgtgaccctctttggtccctgctggtggtccatgtcacactcagagggtaaa ggaacccctggacttcctgagatataa >gi568815596r:126457433_126658163|GENSCAN_predicted_peptide_4|216_aa MDIKPMNQDMIKRTSAGDNGYEESKTERSGWSCPGDAHAEMHMLTPRDLGCHSQPRPVGE WTPGIIYQLDLLSLQLSTPHGRRSTQVSGCRGWDECFWALAGAELCVALGQQEESGDMNV LKMVNVGDFIANKSGCQWDEELERGWSGKVLLYVSGCGASLINGVVFPCICRHTCTFYPL QCSFPRQLMAQLYIWVCALTSAFPSPSLISTSRKAA >gi568815596r:126457433_126658163|GENSCAN_predicted_CDS_4|651_bp atggacattaaacccatgaatcaagacatgatcaagagaacttcagctggtgacaatggc tatgaagaaagtaagacagagagaagtggttggagctgcccgggagacgcacatgctgaa atgcacatgctgacacccagagacctgggatgtcatagccagccaagaccagtgggagaa tggaccccaggcatcatctaccagttggacttactcagcctgcagctctcaacccctcat gggaggaggagcacacaggtgagtgggtgcaggggctgggatgagtgcttctgggcgcta gcaggagctgaactctgtgtggccttggggcagcaggaagaatcaggtgacatgaacgta ttgaagatggtaaatgtgggggattttattgccaataaaagtggctgtcagtgggatgaa gagctggaaaggggatggagtgggaaggtccttttatatgtcagtggatgtggggcatct ttgattaatggggtggtgtttccctgcatttgcaggcacacctgcaccttctatccactg cagtgctctttcccaagacagctgatggctcaactctacatctgggtctgtgctctgaca tcagcttttccaagtccttctctgatctccacatctaggaaagcagcttga >gi568815596r:126457433_126658163|GENSCAN_predicted_peptide_5|300_aa MLPDLEHRTPSSSVLRLGLALPAPQACRQPTVGPCDCNLGILYSLTSPCTVSNNSLKSLA ELLFLVSSSVRLRFCKCIALRTPQGLPVDPLTGLQDAGLDVTQNATMRALSELAIKKFSD LLCLKVDHKTLIPEVLPYTQGKGMPHREAKKNLKDRDTDGGYYPQQINAGTENQTLRILT YKEELYNDNTWTHGMEVMHELSNMDFHSPWLNWLWTLLSVPSASAKTNTKPPMWPHSLPP VLFYNNTNALRQKIGTEEWGVAIHIPGNVEAALELAKAQKLEEFGGLRRRQEDERKFGTF >gi568815596r:126457433_126658163|GENSCAN_predicted_CDS_5|903_bp atgcttcctgatcttgaacatcggactccaagttcttcagttttgagacttggactggct ctgcctgctcctcaagcttgcagacagcctactgtgggaccttgtgattgtaacctaggg attctatattctctcactagcccatgcactgtatcaaacaattcattaaaatcattagct gaattattatttctggtgtctagcagtgtccgcctcaggttttgtaaatgtattgctctc agaacacctcagggacttcctgtggacccactcactggtctacaggatgcggggcttgac gtcacacagaatgctaccatgagggctctttctgagctggctataaagaaattctctgac ctactttgtttgaaagtagatcataagaccctcattccagaggtcctgccctacacccag gggaaaggaatgccacacagagaggccaagaagaatctgaaagacagggacacggatgga ggctattatcctcagcaaattaatgcaggaacagaaaaccaaacactgcgtattctcaca tataaggaggagctgtacaatgacaacacatggacacatgggatggaggttatgcatgag ctcagcaatatggacttccactcaccatggctgaactggctgtggacattgctgagtgtg ccatctgccagcgcaaagaccaatactaagcctccaatgtggccccattccctgcctccg gtactattttataacaacacaaatgcgctaagacagaaaattggcactgaggagtggggt gttgctatacacatacctggaaatgtggaagcagctttggaactggctaaggcacagaag ttggaagagtttggagggctcagaagaagacaggaagatgaaagaaagtttggaactttt tag >gi568815596r:126457433_126658163|GENSCAN_predicted_peptide_6|258_aa MFFFSSLFTFKTVDLKRCLPPPTPDTEHPVTDKNELVQKAKLAEQAEQYDDMAACMKSVT KQGAELSNEERNLLSVAYKNVVGARKSSWRVVSSIEQKTEGAEKKQQMAREHREKIETEL RDICNDVLSLLEKFLIPNASQAESKVFYLKMKGDYYRYLTEVTAGDDKIGIVDQSQQAYQ EAFEISKKEMQPTHPVRLEKACCLAKTAFDEAIAELDTLSEESYKDSMLIMQLLRDNLTL WTSDTQGDEAEAGEGGEN >gi568815596r:126457433_126658163|GENSCAN_predicted_CDS_6|777_bp atgtttttctttagctctctgttcacatttaagactgttgatttaaagcgttgtctacca ccacccactccggacacagaacatccagtcacggataaaaatgagctggttcagaaggcc aaactggccgagcaggctgagcaatatgatgacatggcagcctgcatgaagtctgtaact aagcaaggagctgaattatccaatgaggagaggaatcttctctcagttgcttataaaaat gttgtaggagcccgtaagtcatcttggagggtcgtctcaagtattgaacaaaaaacggaa ggtgctgagaaaaaacagcagatggctcgagaacacagagagaaaattgagacggagcta agagatatctgtaatgatgtattgtctcttttggaaaagttcttgatccccaatgcttca caagcagagagcaaagtcttctatttgaaaatgaaaggagattactaccgttacttgact gaggttactgctggtgatgacaagatagggattgtggatcagtcacaacaagcataccaa gaagcttttgaaatcagcaaaaaagaaatgcaaccaacacatcctgtcagattggagaaa gcctgctgtcttgcaaagaccgcttttgatgaagccattgctgaacttgatacattaagt gaagagtcatacaaagacagcatgctaataatgcaattactgagagacaacttgacattg tggacatcggatacccaaggagacgaagctgaagcaggagaaggaggggaaaattaa >gi568815596r:126457433_126658163|GENSCAN_predicted_peptide_7|140_aa MSPHKDFRINFHSSIAHNSQKVGTTDEWIMVNGGTSNNRILAQQQKPQPALNILCHCPKR VCNMQLAELRGCQRMLSGCQLQCEQCLWKAMRGSRLQCLDWTCRNDPNCGYAFNTVMVLL LNMGASPLEYMSPMHIRQET >gi568815596r:126457433_126658163|GENSCAN_predicted_CDS_7|423_bp atgtctccacacaaagactttagaataaattttcatagcagcattgctcataatagccaa aaagtaggaacaactgatgaatggataatggtgaatggtggtacatccaataatagaata ttggctcagcaacagaaaccacagccagcccttaatatcctttgccactgcccaaagagg gtctgcaatatgcagctagcagagctgcggggatgccagcggatgctgagcggatgccag ctacaatgcgagcagtgcctctggaaggccatgaggggctccagattgcagtgtctggat tggacttgtagaaatgacccaaattgtggttatgctttcaatacagtcatggtgctgctt ttgaacatgggggcgtctccactcgaatacatgagtccaatgcatatacgtcaagagact tga >gi568815596r:126457433_126658163|GENSCAN_predicted_peptide_8|57_aa MGQAQDRHRRIGKKKGVTGPIVSIGYCLAGEFTELALALLDWPGRGPFSQGMLKAEF >gi568815596r:126457433_126658163|GENSCAN_predicted_CDS_8|174_bp atgggacaggcacaggacagacataggagaataggcaagaagaaaggagtaacaggtccc attgtaagcattggctattgcttggctggtgagtttacggaattagcattggcattgctt gactggccaggtcgtggacctttttctcaagggatgctcaaggcagaattctga >gi568815596r:126457433_126658163|GENSCAN_predicted_peptide_9|114_aa MELWVQQAVTSPLHAGHCSEHIAHIHSSPRQLGGGGNPVIDEDGALEVRNLLGSTQLASR AEGLDLEAGLLHCGTTDIWGQIVPHYGAVLCIVGYLAASLAPTHWTSVTPLSQL >gi568815596r:126457433_126658163|GENSCAN_predicted_CDS_9|345_bp atggagctgtgggtccagcaagcagtcaccagccccttgcatgctggacactgttctgag cacattgcacacattcactcctccccacgacagctgggaggaggtgggaacccagttata gatgaggatggagccttagaggtgagaaacctgctggggagcacccagctagcaagcaga gcggagggacttgaccttgaagcagggcttctccactgtggcactactgacatttggggc cagattgtccctcattatggggctgtcctgtgcattgtaggatatttggcagcatccctg gcccccacccactggacatcagtaacacccctctcccagctgtga >gi568815596r:126457433_126658163|GENSCAN_predicted_peptide_10|151_aa MKGLKKRLGNLELWVVTPVSRDIDNGIGLMVYWEKRYEAFVFIIINEAANLPGEPASTLQ STSDTDVINLRGGKVTKQSGVQHESPPCSCAVAPGVHGDMGMWEVGREPKESELAALPWQ GFRGRQPWLEFQLRHRAAVRESLLWSSASVK >gi568815596r:126457433_126658163|GENSCAN_predicted_CDS_10|456_bp atgaagggccttaagaagagactgggaaacttggaactttgggtggtgactccagtaagc agggatattgacaatggtattggcttgatggtgtactgggagaagcgttatgaagccttt gtctttatcatcattaatgaagctgcaaacctcccaggagagcctgcctcaaccctccag agcacgtcagacacagatgtcattaacctacgtgggggcaaagtcactaagcaaagcggt gtccagcatgaatcacccccatgcagctgtgccgtggcaccaggagtccatggggacatg gggatgtgggaggtgggaagggagcccaaggagagtgagctggcagcactgccatggcag ggctttagagggcgacagccctggttggaattccagctcaggcaccgagccgctgtgcga gagtcactcctctggtcctcagcttctgtcaaatga >gi568815596r:126457433_126658163|GENSCAN_predicted_peptide_11|461_aa MPRTKKQKAGDTPWEGGLAASPARSAWRLRVGVRDAGQGLRAAQAPGNRGWSRGSEPSGR LLRLEARHGPGHQGVRGRGSAPGCCGPRPPVGPQDPSPRRVLTEAERPRRAVGASRRPHS WAEETAHGKLPSKSSVLVPYWEWSWRSPAAVAVSWPHQVSTVRMRALIGKEWDPATWNGD MWEDPDEAGDTELINSDEHFWPEVTASPSPVVATSSPPPMLLSAFPPLSEEINPALPEAV ARQNNVDSPQKPPPTPLFASRPITRLKAWWAPRVSECIIGIDIFSSWQNPCIGSLTGRVR ATVVGKAKWKPLELPSPRKIANQKEYHIPGGIVEITATIKDLKDAGVAIPTTSSFSSPIW PVQKTDGAWRMTVDYCKLNQVVTPITAAVPDVVSLLDFEWGPEQQKALQQVQAAVQAALP LGPYDPSDPVVLEVLAFGKLHRRITVEASRILDQGPVISCR >gi568815596r:126457433_126658163|GENSCAN_predicted_CDS_11|1386_bp atgccccgcacaaagaaacagaaagcgggcgacacgccctgggagggcgggctggcggct tctccagctcggagcgcgtggaggctgcgcgtaggggtccgggacgcgggacagggactc cgcgcagcccaggcgccggggaaccggggctggagtcggggatctgagcccagcggacgc ctcctccgcctggaggcccggcacggaccgggacaccagggcgtccgtggccgtggctcg gcccctggctgctgcgggccgcggcctccagtgggtccccaggacccttccccacggcgg gtactcaccgaggctgagaggccacgccgtgctgttggggcttctcgtcgaccacattcc tgggcggaagaaactgctcatgggaaactgccttccaaatcctcagtgctagttccttat tgggagtggagttggcggtcccctgctgctgttgctgtctcctggcctcaccaggtgtct actgttagaatgagggcattgattggaaaagaatgggaccctgcaacttggaatggggac atgtgggaggaccctgatgaagctggggacactgagttaataaactctgatgaacatttt tggccagaagtaacagcttccccatccccagtagtggcaacgtcctctcccccacccatg ctgctatcagcctttccacctttgtctgaagagataaaccctgcgctgcctgaggcagtt gccaggcaaaataatgttgattctcctcagaagccacccccaacacctctgtttgcttct aggcctataactagactaaaggcttggtgggcccctagagtgtcagaatgcataattggt atagacatatttagcagctggcagaacccctgcattggctccctgactggtagggtgagg gctactgttgtgggaaaggccaaatggaagccattggagctgccatcacctagaaaaatt gcaaatcaaaaagaataccacatccctggagggattgtggagattactgccaccatcaag gacttaaaagatgcaggggtggcgattcccaccacatcctcattcagctctcccatttgg cctgtgcagaagacagatggagcttggagaatgacagtggattattgtaagcttaaccaa gtggtgactccaattacagctgctgtaccagatgtggtttcattgcttgattttgagtgg ggtccagaacagcagaaggctctgcaacaggtccaggctgctgtgcaagctgctctgcca cttgggccatatgacccatcagatccagtggtgcttgaggtgttagcctttgggaagctc cataggcgaatcacagtggaagcctctaggattttggatcaaggccctgtcatctcctgc agatag