GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:53:31 Sequence gi568815582r:25139819_25357127 : 217309 bp : 43.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 21284 21386 103 0 1 81 42 105 0.380 4.43 1.02 Intr + 24780 24900 121 0 1 81 98 79 0.676 8.70 1.03 Intr + 29294 29395 102 1 0 129 71 141 0.995 16.87 1.04 Intr + 30896 30987 92 1 2 61 80 102 0.957 5.49 1.05 Intr + 35119 35216 98 1 2 103 70 106 0.446 9.95 1.06 Term + 52379 52530 152 0 2 69 37 133 0.521 4.37 1.07 PlyA + 53722 53727 6 1.05 2.00 Prom + 72015 72054 40 -2.66 2.01 Init + 73405 73464 60 0 0 78 47 68 0.687 2.95 2.02 Intr + 77380 77627 248 0 2 90 94 257 0.948 22.76 2.03 Intr + 81639 81765 127 0 1 102 98 138 0.997 16.88 2.04 Intr + 84544 84758 215 0 2 60 92 273 0.874 22.41 2.05 Intr + 87250 87384 135 1 0 91 63 117 0.988 9.18 2.06 Term + 88626 88674 49 0 1 100 48 78 0.622 1.78 2.07 PlyA + 89089 89094 6 1.05 3.08 PlyA - 91167 91162 6 1.05 3.07 Term - 100920 99998 923 1 2 89 45 422 0.941 30.36 3.06 Intr - 104458 103967 492 2 0 41 91 393 0.199 27.87 3.05 Intr - 107572 106889 684 2 0 67 83 516 0.320 40.54 3.04 Intr - 112217 112091 127 2 1 14 82 92 0.650 1.45 3.03 Intr - 113219 113128 92 0 2 95 88 19 0.849 2.31 3.02 Intr - 115574 115388 187 2 1 80 95 148 0.976 13.96 3.01 Init - 117309 116911 399 0 0 86 94 347 0.876 31.77 3.00 Prom - 131973 131934 40 -1.26 4.07 PlyA - 133759 133754 6 1.05 4.06 Term - 137054 136939 116 0 2 67 43 66 0.295 -1.27 4.05 Intr - 140121 140012 110 2 2 72 38 94 0.051 2.73 4.04 Intr - 158617 158546 72 0 0 71 80 41 0.006 0.12 4.03 Intr - 175417 175313 105 0 0 38 110 60 0.501 2.53 4.02 Intr - 184154 184087 68 2 2 104 67 64 0.504 3.60 4.01 Init - 188034 187987 48 0 0 97 110 -1 0.576 3.95 4.00 Prom - 197588 197549 40 -2.46 5.00 Prom + 200977 201016 40 -4.36 5.01 Init + 206294 206432 139 1 1 83 78 80 0.540 6.80 5.02 Term + 213168 213259 92 1 2 33 48 118 0.685 0.18 5.03 PlyA + 214722 214727 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 3784 3617 168 2 0 42 86 93 0.804 4.42 S.002 Intr - 12459 12298 162 1 0 84 26 116 0.884 4.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:25139819_25357127|GENSCAN_predicted_peptide_1|222_aa XGHILDSKRYAVIGADLRDLSELEEKLKKCNMNTQLPTLLIAECVLVYMTPEQSANLLKW AANSFERAMFINYEQVNMGDRFGQIMIENLRRRQCDLAGVETCKSLESQKERLLSNGWET ASAVDMMELYNRLPRAEVSRIESLEFLDEMELLEQLMRHYCLCWATKGGNELGLKHDSVS IKLQPATCSREAQLFLQETLEKLHPETLSEIPVIRMFVDLGL >gi568815582r:25139819_25357127|GENSCAN_predicted_CDS_1|669_bp natggacacatactggattcaaagagatatgccgttattggagcagatctccgagacctg tctgaactggaagagaagctaaagaaatgtaacatgaatacacaattgccaacactcctg atagctgaatgtgtgctggtttacatgactccagagcagtccgcaaacctcctgaagtgg gcagccaacagttttgagagagccatgttcataaactacgaacaggtgaacatgggtgat cggtttgggcagatcatgattgaaaacctgcggagacgccagtgtgacctggcgggagtg gagacctgcaagtcattagagtcacagaaagaacggctcctgtcgaatgggtgggaaaca gcatcggccgtcgacatgatggagttgtacaacaggttacctcgagctgaagtgagcagg atagaatcacttgaattcctggatgaaatggagctgctggagcagctcatgcggcattac tgcctttgctgggcaaccaaaggaggaaatgagcttggattgaaacatgattcagtttcc ataaaattacagccagccacatgctcaagagaggcacagctgttcctgcaggagaccctg gagaagcttcacccagaaaccctcagtgagatccctgtgattcgcatgttcgtggacctg ggtttgtag >gi568815582r:25139819_25357127|GENSCAN_predicted_peptide_2|277_aa MEVAWIPESPFGDEPVMDYYIAMCEPEFGNDKAREPSVGGRWRVSWYERFVQPCLVELLG SALFIFIGCLSVIENGTDTGLLQPALAHGLALGLVIATLGNISGGHFNPAVSLAAMLIGG LNLVMLLPYWVSQLLGGMLGAALAKAVSPEERFWNASGAAFVTVQEQGQVAGALVAEIIL TTLLALAVCMGAINEKTKGPLAPFSIGFAVTVDILAGGPVSGGCMNPARAFGPAVVANHW NFHWIYWLGPLLAGLLVGLLIRCFIGDGKTRLILKAR >gi568815582r:25139819_25357127|GENSCAN_predicted_CDS_2|834_bp atggaagtggcctggatccctgagtcaccatttggagacgagcctgtcatggactattat atagccatgtgtgagcctgaatttggcaatgacaaggccagggagccgagcgtgggtggc aggtggcgagtgtcctggtacgaacggtttgtgcagccatgtctggtcgaactgctgggc tctgctctcttcatcttcatcgggtgcctgtcggtcattgagaatgggacggacactggg ctgctgcagccggccctggcccacgggctggctttggggctcgtgattgccacgctgggg aatatcagtggtggacacttcaaccctgcggtgtccctggcagccatgctgatcggaggc ctcaacctggtgatgctcctcccgtactgggtctcacagctgctcggggggatgctcggg gctgccttggccaaggcggtgagtcctgaggagaggttctggaatgcatctggggcggcc tttgtgacagtccaggagcaggggcaggtggcaggggcgttggtggcagagatcatcctg acgacgctgctggccctggctgtatgcatgggtgccatcaatgagaagacaaagggccct ctggccccgttctccatcggctttgccgtcaccgtggatatcctggctgggggccctgtg tctggaggctgcatgaatcccgcccgtgcttttggacctgcggtggtggccaaccactgg aacttccactggatctactggctgggcccactcctggctggcctgcttgttggactgctc attaggtgcttcattggagatgggaagacccgcctcatcctgaaggctcggtga >gi568815582r:25139819_25357127|GENSCAN_predicted_peptide_3|967_aa MAVALDSQIDAPLEVEGCLIMKVEKDPEWASEPILEGSDSSETFRKCFRQFCYEDVTGPH EAFSKLWELCCRWLKPEMRSKEQILELLVIEQFLTILPEKIQAWAQKQCPQSGEEAVALV VHLEKETGRLRQQVSSPVHREKHSPLGAAWEVADFQPEQVETQPRAVSREEPGSLHSGHQ EQLNRKRERRPLPKNARPSPWVPALADEWNTLDQEVTTTRLPAGSQEPVKDVHVARGFSY RKSVHQIPAQRDLYRDFRKENVGNVVSLGSAVSTSNKITRLEQRKEPWTLGLHSSNKRSI LRSNYVKEKSVHAIQVPARSAGKTWREQQQWGLEDEKIAGVHWSYEETKTFLAILKESRF YETLQACPRNSQVYGAVAEWLRECGFLRTPEQCRTKFKSLQKSYRKVRNGHMLEPCAFFE DMDALLNPAARAPSTDKPKEMIPVPRLKRIAISAKEHISLVEEEEAAEDSDDDEIGIEFI RKSEIHGAPVLFQNLSGVHWGYEETKTFLDILRETRFYEALQACHRKSKLYGAVAEQLRE CGFLRTPEQCRTKFKSLQKSYRKVKNGHVLESCAFYKEMDALINSRASAPSPSTPEEVPS PSRQERGGIEVEPQEPTGWEPEETSQEAVIEDSCSERMSEEEIVQEPEFQGPPGLLQSPN DFEIGSSIKEDPTQIVYKDMEQHRALIEKSKRVVSQSTDPSKYRKRECISGRQWENLQGI RQGKPMSQPRDLGKAVVHQRPFVGKRPYRLLKYGESFGRSTRLMCRMTHHKENPYKCGVC GKCFGRSRSLIRHQRIHTGEKPFKCLDCGKSFNDSSNFGAHQRIHTGEKPYRCGECGKCF SQSSSLIIHQRTHTGEKPYQCGECGKSFTNSSHFSAHRRVHTGENPYKCVDCEKSFNNCT RFREHRRIHTGEKPYGCAQCGKRFSKSSVLTKHREVHVREKPLPHPPSLYCPENPHKGKT DEFRKTF >gi568815582r:25139819_25357127|GENSCAN_predicted_CDS_3|2904_bp atggctgtcgccctcgactctcagatcgacgcgcccctggaggttgagggatgcctaata atgaaggtggaaaaggaccctgagtgggcatcagagcccattctggaaggatcggatagc tctgagaccttccgcaaatgcttcaggcaattctgttatgaggatgtgactggaccccat gaagctttcagtaaactctgggaactttgctgccggtggctgaagccagaaatgcgttcc aaggagcaaatacttgagctgctggtgattgagcagtttctcaccattttacccgagaag attcaggcttgggcacagaagcagtgtccgcaaagtggagaggaagcggtggccctggta gtgcatttggagaaagagactggaagactaagacagcaggtcagcagtcccgtgcaccgg gagaagcactccccacttggagcagcgtgggaggtggcagacttccagccagagcaggtg gagacccaacccagggcggtgtctcgggaggaacctggaagcctccactcaggacaccag gaacagctgaaccgaaagcgagaacgtcggcccttacccaagaatgctcggccttctccc tgggttcctgcccttgctgatgaatggaataccctagatcaggaagtgacaaccacacgg cttcctgctgggtcccaggaaccagtgaaagatgtccacgtggccagaggcttttcctac agaaagagtgtgcatcagattcctgcccaaagggacctctaccgggatttcaggaaggag aatgttgggaacgtggtctccctgggaagtgcagtgtctacatctaacaagataacccgg ttggaacagagaaaggagccatggactctaggtctgcattcctctaacaagagaagtatc ctacgaagcaactacgtcaaggaaaagtcagttcatgctattcaggtccctgcaaggagt gcaggaaaaacatggagagagcagcagcagtggggtttagaagatgaaaagatagcaggt gtgcattggagctatgaggaaacaaagactttcctggcaattctcaaagagtctcgcttt tatgaaacacttcaggcctgtccccgaaatagccaagtgtatggtgctgtggctgaatgg ttgcgagaatgtggcttccttagaaccccagaacagtgtcgaaccaagttcaaaagtctc cagaaaagctatcgaaaggtgagaaatggccacatgctagaaccctgcgccttctttgag gacatggatgctttgttgaaccctgcagcccgtgctccgtccactgataaaccaaaggag atgatacctgtccccagactgaagagaattgccatcagtgctaaggaacacatcagcttg gtggaggaggaggaagctgcagaagattctgatgatgatgaaataggcatcgaatttatc cgcaagtctgaaatccatggtgcccctgtcttgtttcagaatctcagtggcgtgcactgg ggctatgaagaaaccaagacttttcttgatatcctccgtgagactcggttttatgaagcg cttcaagcctgtcatcggaagagcaaattgtatggggctgtagctgaacagcttcgagag tgcggcttcctccggacaccagaacagtgccgaaccaagttcaaaagccttcagaagagt taccgcaaggtgaaaaatggccacgtgctagagtcctgcgcgttctacaaggagatggat gccctgattaactctcgggcatctgctccttcccccagcaccccagaggaagtcccatca ccttcaaggcaagaaagagggggtattgaggttgaaccccaggaacctacaggctgggaa cctgaagagacctcacaggaggcagtaatagaagactcttgcagtgagagaatgagcgag gaggaaattgtgcaagagccagagttccagggacctccaggtctactgcagagcccaaat gattttgaaatcggaagtagcatcaaggaggatccaacacagatagtatataaggacatg gaacagcatagggcattaatagaaaagtctaaaagagttgtttcccagagtaccgacccc agcaaatatcgcaaaagggaatgcatctcaggaagacaatgggaaaatcttcaaggaatt agacagggaaagccgatgtctcagcctagagatttagggaaagccgttgtgcatcagagg ccttttgtggggaagagaccctacagacttctcaaatatggagaaagctttggaaggagc actcgtctgatgtgccggatgacccaccacaaggagaatccttacaagtgtggtgtctgt gggaagtgctttggtagaagcaggagcctgatcagacaccaaagaatccacacaggcgaa aaaccttttaaatgtcttgactgtggaaaaagctttaatgactcctcaaattttggtgcc caccagagaatccacacaggagagaaaccctacagatgcggagagtgtggaaaatgcttt agtcagagctctagtcttattatacatcagagaacgcacaccggtgagaagccctatcag tgtggagagtgtgggaaaagtttcaccaacagttctcatttcagcgcccaccggagagtt cacactggggagaatccctacaaatgtgtggactgtgaaaaaagtttcaataactgtacg agatttcgagaacatcggagaatacacactggagagaagccctatggatgtgcccagtgt ggcaaacgtttcagtaagagttctgttcttaccaaacatcgggaagttcatgtgagagaa aagcctctgccacaccctccatctctgtattgccctgagaacccacataagggaaagact gatgaatttaggaaaactttttga >gi568815582r:25139819_25357127|GENSCAN_predicted_peptide_4|172_aa MVEGEGGAKSHLTWQQTHTQSISKPGPLCLPNPAKNDESKIAEEEFLTFISSQKHPLKQP PTHKNTFTRAKKFSYECHLHLNPELLLWELQMAHVRMRSSGETSEMKAINAQILGQLMVS QVALLILAGLLSCPELAGDSKSDVIFGHKLPSAVHFWINSHHYHLKKQTLSW >gi568815582r:25139819_25357127|GENSCAN_predicted_CDS_4|519_bp atggtggaaggtgaaggaggagcaaagtcacatcttacatggcagcagacccatacccaa tccatcagcaagcctggtcccctctgcctgccaaatcctgccaaaaatgatgaaagcaag atagcagaagaagagtttctgacattcatctcctcacagaaacatccacttaaacaacca ccaacacataaaaatacctttacaagagctaagaaattcagctatgaatgccaccttcac ctcaaccctgaactcctcttatgggagctgcagatggcccacgtaagaatgaggtcctct ggggagaccagtgagatgaaagcaatcaatgcacagatcttggggcagctgatggtcagc caggtggctctgctgatcctggctggcttgctctcgtgtccagagcttgctggagacagc aagagtgatgtcatttttgggcacaaactgccctctgctgttcacttttggattaacagc caccattatcacctgaagaagcaaacgctgtcctggtag >gi568815582r:25139819_25357127|GENSCAN_predicted_peptide_5|76_aa MAEGEANMSFFTWQQQGEVQSEVGEKPLIKPSDLVTTHSPSREQHGAISSECFDGEHEDM WNLATYMEGTINPRLP >gi568815582r:25139819_25357127|GENSCAN_predicted_CDS_5|231_bp atggcagaaggggaagcaaacatgtccttcttcacatggcagcaacaaggagaagtgcag agcgaagtaggggaaaagccccttataaaaccatcagatcttgtgacaactcactcacca tcacgagaacagcatggagctatatcatcagaatgctttgatggagaacatgaggacatg tggaacctggccacttacatggaaggtaccatcaacccacgtcttccctga