GENSCAN 1.0 Date run: 2-Nov-116 Time: 19:26:33 Sequence gi568815585r:36204368_36435830 : 231463 bp : 38.98% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 52 47 6 1.05 1.07 Term - 10511 10292 220 1 1 113 49 181 0.777 12.23 1.06 Intr - 11442 11324 119 0 2 76 -26 123 0.672 -1.96 1.05 Intr - 27354 27037 318 0 0 72 66 87 0.056 0.13 1.04 Intr - 30286 30227 60 1 0 108 70 66 0.674 4.81 1.03 Intr - 49489 49436 54 1 0 96 93 82 0.991 7.76 1.02 Intr - 49776 49678 99 0 0 61 76 121 0.992 7.59 1.01 Init - 51705 51496 210 0 0 73 23 201 0.427 10.93 1.00 Prom - 51944 51905 40 -8.05 2.00 Prom + 52073 52112 40 -9.55 2.01 Init + 52123 52209 87 0 0 84 35 49 0.108 -0.11 2.02 Term + 54707 55018 312 1 0 41 47 286 0.497 13.82 2.03 PlyA + 58124 58129 6 1.05 3.00 Prom + 59004 59043 40 -6.95 3.01 Sngl + 65310 65660 351 2 0 46 43 256 0.989 12.70 3.02 PlyA + 66241 66246 6 1.05 4.08 PlyA - 68073 68068 6 -0.45 4.07 Term - 68596 68469 128 2 2 46 49 141 0.337 3.46 4.06 Intr - 79335 79225 111 1 0 96 87 88 0.991 8.93 4.05 Intr - 81733 81658 76 1 1 6 87 92 0.789 -0.93 4.04 Intr - 82440 82369 72 0 0 111 92 -5 0.517 0.88 4.03 Intr - 84995 84861 135 2 0 44 36 106 0.539 0.84 4.02 Intr - 86498 86414 85 1 1 63 66 93 0.846 3.50 4.01 Init - 93352 93270 83 1 2 73 81 173 0.969 15.59 4.00 Prom - 99184 99145 40 -4.95 5.11 PlyA - 99373 99368 6 1.05 5.10 Term - 100265 99998 268 1 1 73 45 348 0.998 22.98 5.09 Intr - 107868 107778 91 1 1 56 108 53 0.866 2.23 5.08 Intr - 108110 107952 159 0 0 34 87 157 0.768 9.24 5.07 Intr - 110054 109860 195 0 0 35 38 246 0.770 12.76 5.06 Intr - 112671 112590 82 0 1 117 45 64 0.309 3.29 5.05 Intr - 112994 112847 148 1 1 40 34 35 0.324 -7.38 5.04 Intr - 114579 114461 119 0 2 10 109 160 0.479 8.64 5.03 Intr - 125150 124995 156 2 0 66 61 184 0.905 12.79 5.02 Intr - 127229 127032 198 2 0 77 93 164 0.961 14.43 5.01 Init - 131463 130654 810 0 0 92 115 536 0.945 51.45 5.00 Prom - 145975 145936 40 -4.65 6.00 Prom + 152308 152347 40 -4.55 6.01 Init + 155952 156002 51 2 0 74 94 35 0.197 3.91 6.02 Intr + 161133 161237 105 2 0 18 50 116 0.106 0.29 6.03 Intr + 161363 161476 114 1 0 102 94 11 0.169 2.82 6.04 Intr + 178721 178864 144 1 0 100 61 31 0.022 1.16 6.05 Intr + 200727 200794 68 2 2 54 92 57 0.149 -0.52 6.06 Term + 203786 204092 307 2 1 25 43 747 0.200 57.50 6.07 PlyA + 204781 204786 6 1.05 7.00 Prom + 221639 221678 40 -5.05 7.01 Init + 222283 222360 78 0 0 62 101 39 0.732 3.71 7.02 Intr + 226243 226337 95 0 2 30 75 109 0.550 1.64 7.03 Term + 226470 226704 235 0 1 77 39 227 0.812 11.51 7.04 PlyA + 226722 226727 6 -1.95 8.00 Prom + 226857 226896 40 -9.45 8.01 Init + 227814 227915 102 2 0 116 51 73 0.769 6.34 8.02 Intr + 228666 228854 189 2 0 77 95 244 0.486 22.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 153004 153274 271 1 1 55 38 198 0.813 5.47 S.002 Sngl + 219017 219181 165 1 0 46 41 172 0.822 2.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585r:36204368_36435830|GENSCAN_predicted_peptide_1|359_aa MEATLAGKEGEGKAGAGQDRNETAHMNIGVGQLTSLAHTVLSALHLVKGGKRLRGGRGLG SEWGLGKQDIESLNTLLKQLEEEKKTLESQVKYYALKLEQESKAYQKINNERRTYLAEMS QAFIVSDEKSVVTPIEDPFYRKVFRKLKWPLTLSSQSVRKVGFLVPHKSLMEKGSHGNLR KGIHGSVLSSLDRTFHIRAPDMLLPAFELPALKLYASVAVEMMCYNRRLPKICTVSRNYS QEGFRVKNCRVSQLYFCMPPGSKSDPQDTAPAAQEPGDLPRDRLQRWRRSGSEYAGLSVV RAALGRGLGVRSQLVTRSPKGPQDSLSSPRGQAPDPGSALTPQSGLLLPHAPELATGPL >gi568815585r:36204368_36435830|GENSCAN_predicted_CDS_1|1080_bp atggaagcaacgcttgcaggcaaagaaggagaaggaaaggctggagcaggccaagacagg aatgaaacagcccacatgaatattggagtgggacaactgacctccctggcccacaccgtg ctgagcgccctgcatctggtcaagggtggtaagagactacgtggaggacgaggacttggc agtgaatggggattggggaagcaggacatagaatccttaaacacattacttaaacagcta gaagaagaaaagaagactcttgaaagtcaagtgaaatactatgcacttaaactggaacaa gaatcaaaggcttaccagaagatcaacaatgaacgccgtacatacctagctgaaatgtct caggccttcatagtttctgatgagaaatcagttgttactcctattgaagatcccttttac agaaaagtttttaggaaattgaagtggcctttaactctctcaagtcagagtgtaaggaag gttggctttttagtgccacataagtcattaatggaaaaaggatctcatggaaatctaaga aaaggaattcacggatctgtcttatcctcattagacagaactttccacattagggcacct gacatgctgcttccagcctttgaattacctgcccttaagctgtatgcctctgtagcggtg gagatgatgtgctataacagaaggctgcctaagatttgcactgtaagcagaaactattca caggaaggtttccgtgtgaagaactgtagagtatcacagctgtatttttgcatgcccccg ggatctaaatcggacccccaggacacagcaccagctgctcaggaaccaggagacttgccc agggaccgcctgcagagatggcggcgcagtgggagtgagtacgcggggcttagtgtcgtg agagctgccctgggaaggggacttggagtgcgcagtcagctagtgacgcggtctcctaag ggcccccaggacagcttgagtagcccccgcgggcaggcccccgacccgggttccgcgttg acccctcagagcggcctgctgctcccccacgcgccagaactggcgacagggccactttaa >gi568815585r:36204368_36435830|GENSCAN_predicted_peptide_2|132_aa MEETAPIINDLHLVLPLTCGNITIQDDIWKAGSRDSQDPAPSDTTVYVGSRTKVMRAAMI VRAESRMAELEGSVLTPGDELSQLVPVSPSEPPSCRGFPQEHYELSPRKRFLVSHCYRME SLDCQLSKFLVY >gi568815585r:36204368_36435830|GENSCAN_predicted_CDS_2|399_bp atggaggaaacagcccccataatcaatgacctccatctggtcttgcccttgacatgtggg aatattacaattcaagatgacatttggaaggcaggctccagggactcccaggatcctgct ccctcggataccacagtctatgtaggcagcaggactaaagtgatgagagccgccatgata gtgagggccgagagcaggatggcggaattagaaggcagcgtcctgacccctggcgatgag ctgagtcaattggtccctgtctccccctcagagccaccatcttgtcgtggcttccctcag gaacattatgagctctcccccaggaagcgcttcttggtttcccattgttaccggatggaa agtcttgactgccagttgtccaagttcttggtgtattga >gi568815585r:36204368_36435830|GENSCAN_predicted_peptide_3|116_aa MIISTDAEKALDKIQHHFVIKTLNKLDIEGTYFKIIKTTHDRPTANIILNGEKLKAFSLR TGTRQGCRHAPLLLNIVLEVLARAFGQERNKGHPNWKRGIRTIAVFRRHDRIPRKS >gi568815585r:36204368_36435830|GENSCAN_predicted_CDS_3|351_bp atgatcatctcaacagatgcagaaaaagcactggacaaaatccagcatcactttgtgata aaaactctcaacaaactagacatagaagggacttacttcaaaataataaaaaccacacat gacagacccacagccaacatcatactgaacggggagaagttgaaagcattctccctgaga acaggaacaagacaaggctgccgacatgcaccactcctattgaatatagttctggaagtc ctagccagagcattcgggcaagagagaaataaagggcatccaaattggaaaagaggaatc cgaactattgctgttttcagacgacatgatcgtatacctagaaaatcctaa >gi568815585r:36204368_36435830|GENSCAN_predicted_peptide_4|229_aa MKEERNYNFDGVSTNRLKQQLLEEVRKNGTPIAKELRVKDLNPIKRFRPDGSGGEQTFKI KHFSIRPQQWNQRKKVTEANLEAEGESCHLRPSQGEKKLKAGEPQHPRTEGCGMNPSHFL KSPVKKPDWQAQALEHTALTIRDTKIVTSNCSEWKTRYETQLELNDELEKQIVYLKEKVE KIHGNSSGMEEGGGEKGAENWETVEDEHMEQLVTGTPLPSHAAGILSFS >gi568815585r:36204368_36435830|GENSCAN_predicted_CDS_4|690_bp atgaaggaagagagaaactacaacttcgacggtgtgagcaccaaccgcctgaaacagcag ttgctggaagaagtccgcaagaatgggacccctattgccaaggaacttagagtcaaagac ttaaatccaattaaacgttttaggccagatgggagtggaggtgagcagacctttaagata aaacattttagcatcaggccacaacagtggaaccagaggaaaaaagttacagaagctaat ttagaagctgaaggagagagctgtcatctcagaccttctcagggggagaaaaagctgaaa gcaggagagcctcagcaccctagaactgaagggtgtgggatgaatccctcccatttcctc aagtcaccagtcaagaaaccagactggcaggctcaggccctggagcatacagcactcacc atacgggacactaaaattgtaaccagcaattgtagtgaatggaaaacccgttatgagaca caacttgaattaaatgatgaactagaaaagcaaattgtttatctcaaggagaaagtggaa aaaatccatggaaactcttcaggtatggaagaaggtgggggtgagaaaggagctgaaaac tgggagacagttgaagatgaacacatggaacagttggtcactgggacccctctgccatcc cacgcagctgggatactctccttcagttag >gi568815585r:36204368_36435830|GENSCAN_predicted_peptide_5|741_aa MEQEPQNGEPAEIKIIREAYKKAFLFVNKGLNTDELGQKEEAKNYYKQGIGHLLRGISIS SKESEHTGPGWESARQMQQKMKETLQNVRTRLEILEKGLATSLQNDLQEVPKLYPEFPPK DMCEKLPEPQSFSSAPQHAEVNGNTSTPSAGAVAAPASLSLPSQSCPAEAPPAYTPQAAE GHYTVSYGTDSGEFSSVGEEFYRNHSQPPPLETLGLDADELILIPNGVQIFFVNPAGEVS APSYPGYLRIVRFLDNSLDTVLNRPPGFLQVCDWLYPLVPDRSPVLKCTAGAYMFPDTML QAAGCFVGVVLSSELPEDDRELFEDLLRQMSDLRLQANWNRAEEENEFQIPGRTRPSSDQ LKEASGTDVKQLDQGNKDVRHKGKRGKRGASEAIGQCQSSAAKPRRSGKESVREPWARVP GALGVAARKAGLAAKSEGEGVEGYLPLPQKSREGVETRRGGVGVLAPSPEKRDLPLRKMK GIEIKRRERLKCGTKIERRKRSRDSASWVSWGLVKGAEITGKAIQKGASKLRERIQPEEK PVEVSPAVTKGLYIAKQATGGAAKVSQFLVDGVCTVANCVGKELAPHVKKHGSKLVPESL KKDKDGKSPLDGAMVVAASSVQGFSTVWQGLECAAKCIVNNVSAETVQTVRYKYGYNAGE ATHHAVDSAVNVGVTAYNINNIGIKAMVKKTATQTGHTLLEDYQIVDNSQRENQEGAANV NVRGEKDEQTKEVKEAKKKDK >gi568815585r:36204368_36435830|GENSCAN_predicted_CDS_5|2226_bp atggagcaagagccacaaaatggagaacctgctgaaattaagatcatcagagaagcatat aagaaggcctttttatttgttaacaaaggtctgaatacagatgaattaggtcagaaggaa gaagcaaagaactactataagcaaggaataggacacctgctcagagggatcagcatttca tcaaaagagtctgaacacacaggtcctgggtgggaatctgctagacagatgcaacagaaa atgaaagaaactctacagaatgtacgcaccaggctggaaattctagagaagggtcttgcc acttctctgcagaatgatcttcaggaggtgcccaagttatatccagaatttccacctaaa gacatgtgtgaaaaattaccagagcctcagtcttttagttcagctcctcagcatgctgaa gtaaatggaaacacctcaactccaagtgcaggggcagttgctgcacctgcttctctgtct ttaccatcacaaagttgtccagcagaagctcctcctgcttatactcctcaagctgctgaa ggtcactacactgtatcctatggaacagattctggggagttttcatcagttggagaggag ttttataggaatcattctcagccaccgcctcttgagaccttagggctggatgcagatgaa ttgattttgataccaaatggagtacagattttttttgtaaatcctgcaggggaggttagt gcaccttcgtatcctgggtaccttcgaattgtgaggtttttggataattctctcgatacg gttctaaaccgtcctcccgggtttcttcaggtttgtgactggttatatcctctagttcct gatagatctccggttctgaaatgtactgcgggagcctacatgtttcctgatacaatgcta caagcagcaggatgctttgtgggggtcgtcctgtcctctgagttaccagaggatgataga gagctctttgaggatctgttaaggcaaatgtctgaccttcggctccaggccaactggaac agagcagaagaagaaaatgaattccaaatccctggaagaactagaccctcctctgaccaa ctaaaagaagcctctggcactgatgtgaaacagttggaccaaggcaataaggatgtacgt cataaaggaaaacgtggaaaaaggggggcttccgaggcgatcgggcagtgtcagtcttca gccgctaagccgagaagatctgggaaggagtcagtcagagagccttgggccagagttcca ggggctctgggagtggctgccagaaaagcaggacttgctgctaagagtgaaggagaaggg gttgaggggtacttgcccctcccccagaaaagcagagaaggggtagagacaaggagagga ggggttggggtacttgccccttccccagaaaagcgggacttgccgctaaggaaaatgaaa ggaattgaaattaagagaagggagagattgaagtgtggcaccaagattgaaaggagaaag aggtcgagggatagtgcttcctgggtgagttggggtttagtcaaaggtgctgagattact ggtaaggcaatccagaaaggtgcttctaaactccgagagcggattcaaccagaagaaaaa cccgtggaagttagtccagctgtcaccaagggactttatatagcgaagcaagctacagga ggagcagcaaaagtcagtcagttcctggttgatggagtttgcactgtagcaaattgcgtt ggaaaagaactagctccacatgtcaagaagcatggaagcaaacttgttccagaatctctt aaaaaagacaaagatgggaaatctcctctggatggtgctatggttgtagcagcaagtagt gttcaaggattttcaactgtctggcaaggattggaatgtgcagctaaatgcatcgttaac aatgtttcagcagaaactgtacaaactgtcagatacaaatacggatataatgcaggagaa gctacccaccatgcggtggattctgcggtcaatgttggcgtaactgcctacaatattaac aacattggtatcaaagcaatggtgaagaaaactgcaacacaaacaggacacactctcctt gaggactatcagatagttgataattctcagagggaaaatcaagaaggagcagcaaatgtc aacgtgagaggggagaaggatgagcagacgaaggaagtaaaggaggcaaagaagaaagat aaatga >gi568815585r:36204368_36435830|GENSCAN_predicted_peptide_6|262_aa MSNEISSSRMMNVRATLITRKAKSLFLDFFSYKPPIDTYPLLYGAQMKDIGHLQHLAQRL QWNRRAVNICGMHGWTKDLSPHSRMPSMLEKQCCSKGRLSSIDSIYLDIIPSSSRIFSDD TELLARDWCSHSWTTEFQACCNDSLATAYVHWRLWGSTISRVLSDKSVQKKKKEEKEKEG KGKKKRKKKKRKKKKRKKKKRNKKEEEKKKKEEEEEVEEEEGEGEGEEEEEEEEEEEEEE EEEEEEEEEEEEELKKQSSSFC >gi568815585r:36204368_36435830|GENSCAN_predicted_CDS_6|789_bp atgtcaaatgaaatcagcagttctagaatgatgaatgttagagctaccctaattactagg aaagccaagtctctcttcctggatttcttctcctataaaccgcctattgatacctaccca cttctctatggagcacagatgaaggatattggtcatctccagcatctagcacaacgtctg caatggaacaggcgagctgtgaatatttgtggaatgcatgggtggactaaagacctatca cctcactctagaatgcccagcatgttggagaaacagtgttgttcaaaagggagactttct agtatagatagcatctatttagatatcattcctagcagcagcaggattttctctgatgat acagagctgttagctagagattggtgtagtcactcatggaccactgagtttcaggcctgc tgtaatgactccctggctactgcctatgttcactggaggctctggggctctacaatcagc agggtgcttagtgataaatcagttcagaagaagaagaaggaggagaaggagaaggagggg aaggggaagaagaagaggaagaagaagaagaggaagaaaaagaagaggaagaagaagaag aggaataagaaggaggaggagaagaagaagaaagaagaggaagaagaggtagaagaagaa gaaggagaaggagaaggagaggaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaggaagaagaagaagaagaattaaagaaacaaagctctagc ttttgctga >gi568815585r:36204368_36435830|GENSCAN_predicted_peptide_7|135_aa MEYYSAIKKNEILSFATGWMELEIIMRLRLLAGLAVESQTELATESFSDPGNNNTVGRCY QNSRVSFIFKGTGNLGCKRTAVSQGIEVCDKRIEMSFMLNKIRFCVPPSSSLDSPDAYQL LNIYLNCKAFNRRFC >gi568815585r:36204368_36435830|GENSCAN_predicted_CDS_7|408_bp atggaatactattcagctataaaaaagaatgagatcctgtcatttgcaacaggatggatg gaactggagatcattatgcgactcaggctcttggcaggacttgctgtagagtcccagacg gagttagccactgagtccttcagcgacccaggaaataacaacactgttggcagatgttac cagaattcccgtgtttccttcatattcaaaggcacaggaaatctggggtgcaagcggact gctgtctctcaaggaatagaagtctgtgacaaacgtatagaaatgagtttcatgttgaac aagatccgtttctgcgttcctccctcatccagccttgattctccagatgcttaccaactt cttaatatttacctcaactgcaaggctttcaaccggcgcttttgttag >gi568815585r:36204368_36435830|GENSCAN_predicted_peptide_8|97_aa MAMRPRRAHACRGRHGNAPARSGGAADWPIQQTRQQPVESEAMHCSNPKSGVVLATVARG PDACQILTRAPLGQDPPQRTVLGLLTANGQYRRTCGQ >gi568815585r:36204368_36435830|GENSCAN_predicted_CDS_8|291_bp atggcgatgcggccccggagagcgcacgcctgccgcggtcggcatggaaacgctcccgct aggtccgggggcgccgctgattggccgattcaacagacgcggcagcagcccgtggagtct gaagcaatgcactgcagcaaccccaagagtggagttgtgctggctacagtggcccgaggt cccgatgcttgtcagatactcaccagagccccgctgggccaggatcccccgcagaggaca gtgctagggctgctaactgcaaatgggcagtacaggaggacctgtggccag