GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:11:48 Sequence gi568815580r:63231785_63467118 : 235334 bp : 41.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1860 1993 134 2 2 72 91 70 0.048 5.63 1.02 Intr + 6449 6520 72 2 0 62 72 92 0.027 2.60 1.03 Intr + 7111 7217 107 1 2 25 53 96 0.013 -1.26 1.04 Intr + 18098 18261 164 0 2 56 72 101 0.248 4.07 1.05 Intr + 23598 23717 120 2 0 63 37 106 0.016 2.77 1.06 Intr + 27475 27666 192 0 0 -33 94 165 0.004 3.77 1.07 Term + 33069 33518 450 2 0 61 42 210 0.250 7.90 1.08 PlyA + 33863 33868 6 1.05 2.09 PlyA - 34129 34124 6 1.05 2.08 Term - 37060 37014 47 2 2 120 47 37 0.427 -0.81 2.07 Intr - 38468 38432 37 2 1 82 106 51 0.548 3.22 2.06 Intr - 53509 53315 195 1 0 95 99 71 0.423 7.59 2.05 Intr - 59366 59238 129 2 0 64 97 38 0.301 2.27 2.04 Intr - 60944 60723 222 2 0 87 88 163 0.427 13.60 2.03 Intr - 77653 77557 97 0 1 55 74 37 0.072 -1.91 2.02 Intr - 85950 85746 205 1 1 27 101 160 0.350 8.54 2.01 Init - 86882 86294 589 2 1 92 23 571 0.284 46.43 2.00 Prom - 97015 96976 40 -4.55 3.11 PlyA - 97869 97864 6 1.05 3.10 Term - 100117 99998 120 1 0 119 33 190 0.998 14.09 3.09 Intr - 103574 103473 102 2 0 117 115 6 0.974 5.65 3.08 Intr - 107099 107016 84 2 0 72 92 91 0.979 7.00 3.07 Intr - 112709 112626 84 2 0 36 86 77 0.133 1.40 3.06 Intr - 119295 119104 192 0 0 122 83 48 0.683 6.47 3.05 Intr - 123515 123420 96 2 0 93 100 84 0.937 9.39 3.04 Intr - 123779 123714 66 2 0 125 71 13 0.728 1.48 3.03 Intr - 128008 127952 57 1 0 96 99 58 0.978 5.86 3.02 Intr - 131084 130995 90 2 0 44 94 79 0.332 3.37 3.01 Init - 135334 135227 108 1 0 80 89 219 0.997 19.72 3.00 Prom - 142987 142948 40 -7.35 4.00 Prom + 144329 144368 40 -4.85 4.01 Init + 144586 144680 95 0 2 70 34 99 0.536 2.60 4.02 Term + 151006 151453 448 1 1 26 38 217 0.164 4.20 4.03 PlyA + 153086 153091 6 1.05 5.10 PlyA - 153221 153216 6 1.05 5.09 Term - 161765 161619 147 2 0 61 44 175 0.855 7.32 5.08 Intr - 165469 165250 220 0 1 72 115 170 0.979 15.48 5.07 Intr - 167539 167458 82 2 1 76 97 72 0.947 4.68 5.06 Intr - 172042 171923 120 2 0 84 61 107 0.952 7.15 5.05 Intr - 175715 175648 68 1 2 56 4 77 0.320 -6.27 5.04 Intr - 178662 178506 157 1 1 69 71 128 0.759 7.35 5.03 Intr - 179794 179683 112 1 1 75 54 111 0.873 5.43 5.02 Intr - 183946 183832 115 0 1 36 23 170 0.527 4.83 5.01 Init - 184237 184122 116 1 2 45 21 124 0.347 1.13 5.00 Prom - 188821 188782 40 -6.55 6.00 Prom + 189474 189513 40 -9.55 6.01 Init + 190442 190552 111 1 0 45 44 96 0.762 1.18 6.02 Intr + 191042 191187 146 1 2 102 77 100 0.876 8.46 6.03 Term + 192011 192119 109 2 1 111 45 65 0.887 1.50 6.04 PlyA + 193197 193202 6 1.05 7.07 PlyA - 193357 193352 6 1.05 7.06 Term - 199746 199640 107 1 2 73 48 107 0.960 2.79 7.05 Intr - 203964 203796 169 0 1 -81 87 203 0.044 1.90 7.04 Intr - 208771 208421 351 1 0 56 13 184 0.006 2.29 7.03 Intr - 214798 214627 172 0 1 43 100 108 0.273 6.52 7.02 Intr - 224701 224540 162 0 0 38 60 134 0.407 3.77 7.01 Intr - 232066 231908 159 0 0 24 62 126 0.181 1.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 2850 2726 125 1 2 75 44 92 0.902 1.07 S.002 Init - 5146 4888 259 1 1 59 75 167 0.891 9.85 S.003 Intr - 203966 203796 171 0 0 -41 87 202 0.908 6.19 S.004 Intr - 208771 208465 307 1 1 56 52 170 0.909 5.30 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580r:63231785_63467118|GENSCAN_predicted_peptide_1|412_aa MLALLQLTFCGAARFLTGHGPVPVHGLGAGDPWLMTHNHIAKCNRRLSDDLAKACERHPR NAVMWAETETRKRWETGWHVLLGNERAEPLESCGMANCGSPATPGIRGAGPQCGRKGKPV LWSTHRMPHRTLAHTSSHFNSARTLQGGYFTDQEAQDHTGLKSITTFIYPYNETLRDTTV YTSNWKAVSDSFKCSDLQKAPAPITERKRAACKCPKHGPERKAECKRQTQFACTQGFAEV YLYPAPHGSWGAPCRWLALKQCQIFTGHPCGEYAVSSPLGMRPLWQGAAGGPKERTRTLT GDPWKETKCQDLERRQGPQGTGRTAAWQVEKHLWHSRWLQPELQASLSPIKINSEAIKVS TRVPSAEHHAKVLLAAEFGKGKGHNGKGKAKEREERALVLVQLLLNLEKITQ >gi568815580r:63231785_63467118|GENSCAN_predicted_CDS_1|1239_bp atgctggctctcctgcagctcaccttctgcggtgctgccaggttcctaacaggccacgga cctgtaccggtccatggcctgggggctggggacccctggcttatgacacataatcacata gcaaaatgcaacaggcggctgtcagatgatcttgctaaggcctgtgaacggcatcccagg aatgcagtgatgtgggcagaaaccgagaccaggaaacgctgggagacgggctggcatgtg cttcttggtaacgagagagcagagcccctggagagctgtgggatggctaactgcgggagc cctgctactccgggaattagaggagcaggcccccagtgtggaaggaaagggaagccagtg ctgtggagtacccaccgaatgccgcaccgcacacttgcacatacatcctctcacttcaac tcagcaagaactctgcagggcgggtattttacagaccaggaagcccaagaccacacaggg ctcaaaagcatcaccaccttcatttatccttacaatgaaaccttgagggacacgactgtc tacacttctaattggaaagccgtatctgattcattcaaatgttcagatcttcaaaaagca ccagcccccatcacagaaagaaagcgggcagcgtgtaagtgtcccaagcatggccccgag agaaaagcagaatgcaaaaggcaaacgcagtttgcatgcacgcagggcttcgctgaggtc tatttgtacccggctccccatggcagctggggagcaccttgccgttggttggcactgaag cagtgccagatatttacagggcacccatgtggggagtacgctgtctcaagtcctctcggg atgaggcctctgtggcaaggggctgcagggggcccaaaggaaagaacacggactctcacc ggtgacccatggaaagaaaccaagtgccaagatctggagaggcgccagggtcctcaaggg actgggagaacagctgcgtggcaagtggagaagcacctgtggcactccaggtggctccag ccagaactccaggcctctctgtcgccaatcaaaataaacagtgaagcaatcaaagtgagc actcgagtcccatcagcagagcaccacgctaaggtcttgctggcagcagagtttgggaag gggaagggccacaatggcaaggggaaagcaaaagaaagagaagaaagggcccttgttctt gtgcaattattattaaacttggagaaaataacacaataa >gi568815580r:63231785_63467118|GENSCAN_predicted_peptide_2|506_aa MAHAGRTGYDNREIVMKYIHYKLSQRGYEWDAGDVGAAPPGAAPAPGIFSSQPGHTPHPA ASRDPVARTSPLQTPAAPGAAAGPALSPVPPVVHLTLRQAGDDFSRRYRRDFAEMSSQLH LTPFTARGRFATVVEELFRDGVNWGRIVAFFEFGGVMCVESVNREMSPLVDNIALWMTEY LNRHLHTWIQDNGGWVACWNRREVSGTAYEPDTRLGLDSLHPKQAPLKQARCTLKLENNP IRRIDSTEDQGHHQSVKFSKRGKPIGKGKITTVTALWRTCDEFHFGHSSRISKWKCPNQG ELLSHSSVRCVMAEEPFIQDIGEEGSFIAFLQRRFICCPFAEHLGGANMNPFPSVRKPPL SITRVQIPASMHKLGHRKRMAYLQRASWVGCVLSIRGKQLWVPGVEVRLLFHAKGNGLQQ GTTGPHSLPGELSTHKLLDGREVPPVIFIFPFRKNPYSSPFSPAAACEPVSVSWVVQLQE KDPRNGIVAAESHKKKACLANMKFEN >gi568815580r:63231785_63467118|GENSCAN_predicted_CDS_2|1521_bp atggcgcacgctgggagaacagggtacgataaccgggagatagtgatgaagtacatccat tataagctgtcgcagaggggctacgagtgggatgcgggagatgtgggcgccgcgcccccg ggggccgcccccgcaccgggcatcttctcctcccagcccgggcacacgccccatccagcc gcatcccgggacccggtcgccaggacctcgccgctgcagaccccggctgcccccggcgcc gccgcggggcctgcgctcagcccggtgccacctgtggtccacctgaccctccgccaggcc ggcgacgacttctcccgccgctaccgccgcgacttcgccgagatgtccagccagctgcac ctgacgcccttcaccgcgcggggacgctttgccacggtggtggaggagctcttcagggac ggggtgaactgggggaggattgtggccttctttgagttcggtggggtcatgtgtgtggag agcgtcaaccgggagatgtcgcccctggtggacaacatcgccctgtggatgactgagtac ctgaaccggcacctgcacacctggatccaggataacggaggctgggtagcttgttggaac cggagggaggtttcagggactgcttatgaaccagataccaggttgggtcttgacagtctg catcccaaacaagctcccctgaaacaagcccgttgtacactaaaattggagaacaacccg attagaagaatagactccacagaagaccaagggcatcatcaatcggtaaaattttccaaa agaggaaaaccaatagggaaaggaaagataacaacagttactgctttgtggcgtacatgt gatgaatttcattttggacattccagtaggatatccaagtggaaatgcccaaatcaagga gaactgttaagccattcctcagtacgatgtgtcatggctgaagaaccattcattcaagac attggtgaagagggttctttcattgcctttcttcaacgacggtttatttgctgtccattt gctgagcacctgggtggtgccaacatgaacccatttccttctgtgcgcaagccccccctt tctattaccagagtgcagatcccagctagcatgcataaattaggccacaggaaaaggatg gcgtaccttcaaagagccagttgggttggctgtgttctttctattcgtggaaaacagctg tgggtgccgggtgttgaagtcaggcttttgttccatgctaagggcaatggtcttcagcag ggaacaacgggccctcattccctgccaggggagctgagtacacacaaactgctggatggg cgagaagttcccccagtgatctttatttttccgtttagaaagaatccttattccagcccc ttctctccagctgctgcgtgtgagcctgtctctgtctcttgggtggtgcagctgcaggaa aaagatcctagaaatggaattgttgcagctgaaagccataaaaagaaagcttgtctggca aatatgaaatttgaaaactga >gi568815580r:63231785_63467118|GENSCAN_predicted_peptide_3|332_aa MLLLAAAFLVAFVLLLYMVSPLISPKPLALPGAHVVVTGGSSGIGKCIAIECYKQGAFIT LVARNEDKLLQAKKEIEMHSINDKQVVLCISVDVSQDYNQVENVIKQAQEKLGPVDMLVN CAGMAVSGKFEDLEVSTFERLMSINYLGSVYPSRAVITTMKERRVGRIVFVSSQAGQLGL FGFTAYSASKFAIRGLAEALQMEVKPYNVYITVAYPPDTDTPGFAEENRTKPLETRLISE TTSVCKPEQVAKQIVKDAIQGNFNSSLGSDGYMLSALTCGMAPVTSITEGLQQVVTMGLF RTIALFYLGSFDSIVRRCMMQREKSENADKTA >gi568815580r:63231785_63467118|GENSCAN_predicted_CDS_3|999_bp atgctgctgctggctgccgccttcctcgtggccttcgtgctgctgctgtacatggtgtct ccgctcatcagccccaagcccctcgccctgcccggggcgcatgtggtggttacaggaggt tccagtggcatcgggaagtgcattgctatcgagtgctataaacaaggagcttttataact ctggttgcacgaaatgaggataagctgctgcaggcaaagaaagaaattgaaatgcactct attaatgacaaacaggtggtgctttgcatatcagttgatgtatctcaagactataaccaa gtagagaatgtcataaaacaagcacaggagaaactgggtccagtggacatgctggtaaat tgtgcaggaatggcagtgtcaggaaaatttgaagatcttgaagttagtacctttgaaagg ttaatgagcatcaattacctgggcagcgtgtaccccagccgggccgtgatcaccaccatg aaggagcgccgggtgggcaggatcgtgtttgtgtcctcccaggcaggacagttgggatta ttcggtttcacagcctactctgcatccaagtttgccataaggggattggcagaagctttg cagatggaggtgaagccatataatgtctacatcacagttgcttacccaccagacacagac acacctggctttgccgaagaaaacagaacaaagcctttggagactcgacttatttcagag accacatctgtgtgcaaaccagaacaggtggccaaacaaattgttaaagatgccatacaa ggaaatttcaacagttcccttggctcagatgggtacatgctctcggccctgacctgtggg atggctccagtaacttctattactgaggggctccagcaggtggtcaccatgggccttttc cgcactattgctttgttttaccttggaagttttgacagcatagttcgtcgctgcatgatg cagagagaaaaatctgaaaatgcagacaaaactgcctaa >gi568815580r:63231785_63467118|GENSCAN_predicted_peptide_4|180_aa MQEEAGEILSTTRTQPTIAALKVEEGATSQGVGHNNICQLNRSSYNVTLTFLPGMGESMS PPLKIGLASVNIVIHGIWQERHYVTSEARLLKVVQLPFWSLEYLLLKLQSSVTMSAVRLP RGSHAVWKPKHPTQSSHLEQPQVSMKRAIQPAPSCISCPAAVGLHLCRALLARSTWPSPS >gi568815580r:63231785_63467118|GENSCAN_predicted_CDS_4|543_bp atgcaagaagaggcaggagagatcttaagcacaactaggactcaacccaccattgctgct ttgaaagtggaggagggggccacaagccaaggagttggccacaacaacatctgccaactc aaccgctcctcttacaatgtcaccttgacattccttccagggatgggggaatccatgtcc cctcccctcaaaattgggttggcttctgtgaatattgtgatccatggaatatggcaggag agacactatgtgacttcagaggctagattgctaaaggtggtacagcttccattttggtcc ctggaatacttgctcctgaagttgcagtcatcagtcaccatgtcagcagtcagactgccc cgtggcagccatgctgtgtggaagcccaaacatcctacgcagagcagccacttggagcaa ccccaggtctccatgaagagggctatccagccagcgcccagttgcatcagctgtccagca gctgtgggactgcacctgtgcagagctctcctagccagaagcacctggccaagcccttcc tga >gi568815580r:63231785_63467118|GENSCAN_predicted_peptide_5|378_aa MIGGLDKQIKDITEVIEQPVKHSKLFEALGITQPRVVLLSISSSQLEGGSGGDSEVLHIV LELLNQLDSFEAPKKIKKAIDLASKAAQEDKAGNYEEALQLYQHAVQYFLHVVKYEAQGD KAKQSIRAKCTEYLDRAEKLKEYLKNKEKKAQKPVKEGQPSPADEKGNDSDGEGESDDPE KKKLQNQLQGAIVIERPNVKWSDVAGLEGAKEALKEAVILPIKFPHLFTGVGVDNDGILV LGATNIPWVLDSAIRRRFEKRIYIPLPEPHARAAMFKLHLGTTQNSLTEADFRELGRKTD GYSGADISIIVRDALMQPVRKVQSATHFKKVRGPSRADPNHLVDDLLTPCSPGDPGAIEM TWMDVPGDKLLEPVVSMV >gi568815580r:63231785_63467118|GENSCAN_predicted_CDS_5|1137_bp atgattggtggactggacaagcagattaaggacatcacagaagtgatcgagcagcctgtt aagcattccaagctctttgaagcactgggtatcacacaacccagggtggtactgctctcc atcagctcctcgcagctggaggggggttctggaggggacagtgaagtgctgcacatcgtg ctggaactgctcaaccaactggacagctttgaggcccccaagaagatcaagaaagcgata gatctggctagcaaagcagcgcaagaagacaaggctgggaactacgaagaagcccttcag ctctatcagcatgctgtgcagtattttcttcatgtcgttaaatatgaagcacagggtgat aaagccaagcaaagtatcagggcaaagtgtacagaatatcttgatagagcagaaaaacta aaggagtacctgaaaaataaagagaaaaaagcacagaagccagtgaaagaaggacagccg agtccagcagatgagaaggggaatgacagtgatggggaaggagaatctgatgatcctgaa aaaaagaaactacagaatcaacttcaaggtgccattgttatagaacgaccaaatgtgaaa tggagtgacgttgctggacttgaaggagccaaagaagcactgaaagaggctgtgatactg cctattaaatttcctcatctttttacaggggttggtgtagacaatgatggaattttggtt ctgggagctacaaatataccctgggttctggattctgccattaggcgaagatttgagaaa cgaatttatattcccttgccggaaccccatgcccgagcagcaatgtttaaactgcaccta gggaccactcagaacagtctcacggaagcagactttcgggaacttgggaggaaaacagat ggttattcaggggcagatataagtatcattgtacgtgatgcccttatgcagcctgttagg aaagtacagtcagctactcattttaaaaaggttcgcggaccttcccgagctgatcctaac catcttgtagatgatctgctaacaccttgctctccaggtgaccctggtgccattgaaatg acatggatggatgtccctggagataaacttttggagccagttgtttccatggtttga >gi568815580r:63231785_63467118|GENSCAN_predicted_peptide_6|121_aa MIPGGWAKWMTWRSSQAVPKGTRGEESQQQQRRSAHGLLSEKVVHFQDPPTGGFGPVTAP SLRVLCGMGAITVLVSVVNNSGGSARIVLPNLYHPGSKHRALSLPCIYPSFLPTCEFAFA K >gi568815580r:63231785_63467118|GENSCAN_predicted_CDS_6|366_bp atgatacctggaggttgggcgaagtggatgacatggcggagttcccaggcggttcccaag ggaacgaggggcgaggagagccaacagcagcaacgtcgaagcgcgcacgggcttttgagt gaaaaagtcgtccatttccaagacccgccgactgggggctttgggcctgtgactgcgcct tcactccgtgtcctctgtggaatgggggcgatcaccgtcctcgtctcggtggtgaataat tcaggagggagtgcgcgaatagtattgccaaatctgtatcatccaggctcgaagcacaga gccctctctctgccctgcatctacccgtcctttcttcccacttgtgaatttgccttcgca aaatga >gi568815580r:63231785_63467118|GENSCAN_predicted_peptide_7|373_aa XGNQCKLNCSNPEEKENNCHLPTACCTVIEKSESSEEVPVLVTGQRYTVANCVWRSSRGK QLSSSLLLKVRSLEKQDPHHLRADDEMQNLGPTPGLMNRNLLFNKTPRQQETYSFINTYP MAREHKALSGQLQAGRPQGSMLIGQTLVTCPSPDQLVLSKSVTNETPRERERKASQKCDQ QSEGVSVFLGPGQSRPVRDDWTCAAEGYSHPESPVLPELGAEMQSSPAKRHSPTQQTGTW PPKPELCSYFTATISINITADLEAVNVCKLWIHNVNPPSRVLEPEKKDGEKHQAEEQLYS IGRISLTWSLSELTRLHGEALGIKCQNSLQDLSERPYKDLRVDLVTAILALQILPIQKTT GMSYSGEEALSGQ >gi568815580r:63231785_63467118|GENSCAN_predicted_CDS_7|1122_bp nnaggtaatcagtgtaaactaaattgttctaacccggaggaaaaagaaaacaactgtcat cttccaactgcctgctgtacagtaattgagaagtctgaatctagtgaggaggttcctgtc cttgtcacaggacagagatataccgttgcaaattgtgtatggaggtcttctcgtgggaaa cagttatctagctccttgttactaaaagtgaggtccttagagaaacaggatccacatcac ctgagagctgacgatgaaatgcagaaccttggacccaccccaggcctaatgaacaggaac ctgctttttaacaagacccccaggcaacaggagacctacagcttcatcaacacctacccc atggccagagagcacaaagctctatccggccagctccaggcaggcagacctcaaggcagc atgctgattggccaaactctggtcacatgtccatctccagaccaattagtattgtccaag tctgttaccaatgaaaccccaagggaaagggaaagaaaagcaagccaaaagtgtgaccag cagtccgaaggagtgtcagtgttcctgggcccaggccaaagcaggcctgtcagggatgac tggacatgtgctgctgagggttacagccatcctgagagcccagtcttgccagagttggga gcagaaatgcagtccagcccagcaaagagacacagcccaacccagcaaacaggaacttgg cccccaaagccagagctgtgctcatatttcactgccacaatctccatcaatataactgct gatctggaagctgtgaatgtttgtaagctctggatacataatgtcaatccaccttccaga gttctggagcccgaaaagaaggatggagaaaaacaccaagcagaggagcagctttactca atcggtagaatatccctaacttggagcctatcagagctgacccgactacatggagaagcc ttgggcattaagtgtcagaattcactgcaagatctgagtgagagaccttacaaagatctg cgtgttgatttagtgacagcaatcttggcactccaaattttgccaatccaaaagacaact ggaatgagttactctggagaagaggcactcagtggacaataa