GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:26:47 Sequence gi568815588r:69304883_69516323 : 211441 bp : 49.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13765 14128 364 1 1 77 105 247 0.481 20.49 1.02 Intr + 38945 39107 163 1 1 130 119 151 0.897 21.95 1.03 Intr + 55015 55163 149 2 2 97 94 204 0.998 21.85 1.04 Intr + 59901 60020 120 2 0 113 94 91 0.981 12.89 1.05 Intr + 63654 63749 96 2 0 119 43 72 0.983 6.01 1.06 Intr + 64355 64502 148 1 1 106 39 205 0.917 17.21 1.07 Intr + 64559 64742 184 0 1 75 68 219 0.717 17.45 1.08 Intr + 69526 69611 86 1 2 96 38 57 0.451 0.96 1.09 Intr + 71994 72207 214 1 1 73 56 211 0.394 14.17 1.10 Intr + 74980 75213 234 1 0 72 79 223 0.999 16.60 1.11 Intr + 77605 77909 305 1 2 45 93 526 0.998 45.03 1.12 Intr + 79451 79599 149 0 2 37 78 257 0.998 19.55 1.13 Intr + 79914 80033 120 2 0 91 101 110 0.998 13.29 1.14 Intr + 81441 81536 96 2 0 104 78 41 0.967 4.91 1.15 Intr + 84315 84414 100 2 1 65 97 227 0.989 20.98 1.16 Intr + 87243 87426 184 1 1 102 86 242 0.999 24.25 1.17 Intr + 90068 90223 156 2 0 91 105 251 0.999 26.23 1.18 Intr + 93713 93946 234 2 0 118 79 229 0.939 21.80 1.19 Term + 96109 96253 145 1 1 105 48 209 0.758 15.98 1.20 PlyA + 97798 97803 6 -1.75 2.07 PlyA - 98025 98020 6 1.05 2.06 Term - 98122 98110 13 0 1 103 38 7 0.213 -5.13 2.05 Intr - 100202 100092 111 1 0 115 38 114 0.561 8.59 2.04 Intr - 102398 102202 197 2 2 16 99 322 0.990 24.31 2.03 Intr - 104193 104040 154 2 1 114 86 233 0.999 25.77 2.02 Intr - 110216 110063 154 0 1 56 67 219 0.975 15.73 2.01 Init - 111441 111042 400 0 1 110 -8 825 0.942 71.93 2.00 Prom - 114548 114509 40 -5.76 3.05 PlyA - 116510 116505 6 1.05 3.04 Term - 117161 117124 38 0 2 93 53 55 0.230 0.00 3.03 Intr - 117339 117225 115 0 1 80 31 66 0.251 0.12 3.02 Intr - 118138 117868 271 0 1 45 -19 154 0.180 -2.06 3.01 Init - 120369 120290 80 0 2 64 54 161 0.748 10.84 3.00 Prom - 124917 124878 40 -7.56 4.03 PlyA - 125794 125789 6 1.05 4.02 Term - 128400 128219 182 1 2 73 36 173 0.691 8.37 4.01 Init - 134469 134433 37 0 1 66 86 35 0.434 1.27 4.00 Prom - 136371 136332 40 -4.46 5.00 Prom + 140681 140720 40 -6.56 5.01 Init + 146713 146808 96 0 0 84 92 93 0.938 9.62 5.02 Intr + 172125 172193 69 2 0 100 91 11 0.642 1.98 5.03 Intr + 178809 178994 186 2 0 123 73 249 0.711 26.79 5.04 Intr + 180259 180333 75 0 0 78 80 122 0.538 10.11 5.05 Intr + 181741 181857 117 0 0 71 75 35 0.531 1.16 5.06 Intr + 190712 190807 96 1 0 48 105 230 0.999 20.91 5.07 Intr + 193398 193560 163 2 1 96 36 130 0.702 8.15 5.08 Intr + 197666 197751 86 0 2 115 97 37 0.946 6.84 5.09 Intr + 199556 199603 48 1 0 62 80 70 0.832 2.48 5.10 Intr + 201242 201358 117 1 0 88 80 238 0.969 23.66 5.11 Intr + 201947 202037 91 1 1 88 7 160 0.025 7.47 5.12 Intr + 206909 206988 80 0 2 109 99 87 0.043 11.17 5.13 Term + 210025 210036 12 0 0 124 48 -9 0.028 -3.10 5.14 PlyA + 211117 211122 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 189778 189939 162 0 0 108 53 91 0.838 5.88 S.002 Term + 201947 202096 150 1 0 88 49 207 0.951 14.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:69304883_69516323|GENSCAN_predicted_peptide_1|1082_aa XRERPPRPRSPAGTPPPGVAEARRNQWAWRRWVGWRLSPSRGRERGDRERASCRRAPGRG GGAGGGGGGGAAEQPPEDHGSPGLRRTDRPHACRPATPTASMIAAQLLAYYFTELKDDQV KKIDKYLYAMRLSDETLIDIMTRFRKEMKNGLSRDFNPTATVKMLPTFVRSIPDGSEKGD FIALDLGGSSFRILRVQVNHEKNQNVHMESEVYDTPENIVHGSGSQLFDHVAECLGDFME KRKIKDKKLPVGFTFSFPCQQSKIDEAILITWTKRFKASGVEGADVVKLLNKAIKKRGDY DANIVAVVNDTVGTMMTCGYDDQHCEVGLIIGNAFPFAHPFVRPIFPGTGTNACYMEELR HIDLVEGDEGRMCINTEWGAFGDDGSLEDIRTEFDREIDRGSLNPGKQLCLTAETPGHYL FSYHKEKGPGVTRSVITALVCGAQRKADKCRCAFLHRFEKMVSGMYLGELVRLILVKMAK EGLLFEGRITPELLTRGKFNTSDVSAIEKNKEGLHNAKEILTRLGVEPSDDDCVSVQHVC TIVSFRSANLVAATLGAILNRLRDNKGTPRLRTTVGVDGSLYKTHPQYSRRFHKTLRRLV PDSDVRFLLSESGSGKGAAMVTAVAYRLAEQHRQIEETLAHFHLTKDMLLEVKKRMRAEM ELGLRKQTHNNAVVKMLPSFVRRTPDGTENGDFLALDLGGTNFRVLLVKIRSGKKRTVEM HNKIYAIPIEIMQGTGEELFDHIVSCISDFLDYMGIKGPRMPLGFTFSFPCQQTSLDAGI LITWTKGFKATDCVGHDVVTLLRDAIKRREEFDLDVVAVVNDTVGTMMTCAYEEPTCEVG LIVGTGSNACYMEEMKNVEMVEGDQGQMCINMEWGAFGDNGCLDDIRTHYDRLVDEYSLN AGKQRYEKMISGMYLGEIVRNILIDFTKKGFLFRGQISETLKTRGIFETKFLSQIESDRL ALLQVRAILQQLGLNSTCDDSILVKTVCGVVSRRAAQLCGAGMAAVVDKIRENRGLDRLN VTVGVDGTLYKLHPHFSRIMHQTVKELSPKCNVSFLLSEDGSGKGAALITAVGVRLRTEA SS >gi568815588r:69304883_69516323|GENSCAN_predicted_CDS_1|3249_bp nngagggagcggccgccgcgtccccgctccccggccgggacgccaccgccgggcgttgca gaggcgcgccgcaaccaatgggcgtggaggaggtgggtcggctggcggctgtcaccctcc aggggacgggagcgcggagaccgggagcgcgcgagctgtcgccgcgccccgggccgaggg ggaggagccgggggaggaggaggaggaggagccgccgagcagccgccggaggaccacggc tcgccagggctgcggaggaccgaccgtccccacgcctgccgccccgcgaccccgaccgcc agcatgatcgccgcgcagctcctggcctattacttcacggagctgaaggatgaccaggtc aaaaagattgacaagtatctctatgccatgcggctctccgatgaaactctcatagatatc atgactcgcttcaggaaggagatgaagaatggcctctcccgggattttaatccaacagcc acagtcaagatgttgccaacattcgtaaggtccattcctgatggctctgaaaagggagat ttcattgccctggatcttggtgggtcttcctttcgaattctgcgggtgcaagtgaatcat gagaaaaaccagaatgttcacatggagtccgaggtttatgacaccccagagaacatcgtg cacggcagtggaagccagctttttgatcatgttgctgagtgcctgggagatttcatggag aaaaggaagatcaaggacaagaagttacctgtgggattcacgttttcttttccttgccaa caatccaaaatagatgaggccatcctgatcacctggacaaagcgatttaaagcgagcgga gtggaaggagcagatgtggtcaaactgcttaacaaagccatcaaaaagcgaggggactat gatgccaacatcgtagctgtggtgaatgacacagtgggcaccatgatgacctgtggctat gacgaccagcactgtgaagtcggcctgatcatcggtaatgcattcccctttgcccatcca tttgttcggcccatctttccaggcactggcaccaatgcttgctacatggaggaactgagg cacattgatctggtggaaggagacgaggggaggatgtgtatcaatacagaatggggagcc tttggagacgatggatcattagaagacatccggacagagtttgacagggagatagaccgg ggatccctcaaccctggaaaacagctgtgtctgactgccgagacacctgggcattacctg ttcagttatcacaaggaaaagggcccgggcgttacccgttcagttatcacagccctcgtg tgtggggcgcagaggaaggctgacaagtgccggtgtgcctttctccacaggtttgagaag atggtcagtggcatgtacttgggagagctggttcgactgatcctagtcaagatggccaag gagggcctcttatttgaagggcggatcaccccggagctgctcacccgagggaagtttaac accagtgatgtgtcagccatcgaaaagaataaggaaggcctccacaatgccaaagaaatc ctgacccgcctgggagtggagccgtccgatgatgactgtgtctcagtccagcacgtttgc accattgtctcatttcgctcagccaacttggtggctgccacactgggcgccatcttgaac cgcctgcgtgataacaagggcacacccaggctgcggaccacggttggtgtcgacggatct ctttacaagacgcacccacagtattcccggcgtttccacaagactctaaggcgcttggtg ccagactccgatgtgcgcttcctcctctcggagagtggcagcggcaagggggctgccatg gtgacggcggtggcctaccgcttggccgagcagcaccggcagatagaggagaccctggct catttccacctcaccaaggacatgctgctggaggtgaagaagaggatgcgggccgagatg gagctggggctgaggaagcagacgcacaacaatgccgtggttaagatgctgccctccttc gtccggagaactcccgacgggaccgagaatggtgacttcttggccctggatcttggagga accaatttccgtgtgctgctggtgaaaatccgtagtgggaaaaagagaacggtggaaatg cacaacaagatctacgccattcctattgaaatcatgcagggcactggggaagagctgttt gatcacattgtctcctgcatctctgacttcttggactacatggggatcaaaggccccagg atgcctctgggcttcacgttctcatttccctgccagcagacgagtctggacgcgggaatc ttgatcacgtggacaaagggttttaaggcaacagactgcgtgggccacgatgtagtcacc ttactaagggatgcgataaaaaggagagaggaatttgacctggacgtggtggctgtggtc aacgacacagtgggcaccatgatgacctgtgcttatgaggagcccacctgtgaggttgga ctcattgttgggaccggcagcaatgcctgctacatggaggagatgaagaacgtggagatg gtggagggggaccaggggcagatgtgcatcaacatggagtggggggcctttggggacaac gggtgtctggatgatatcaggacacactacgacagactggtggacgaatattccctaaat gctgggaaacaaaggtatgagaagatgatcagtggtatgtacctgggtgaaatcgtccgc aacatcttaatcgacttcaccaagaagggattcctcttccgagggcagatctctgagacg ctgaagacccggggcatctttgagaccaagtttctctctcagatcgagagtgaccgatta gcactgctccaggtccgggctatcctccagcagctaggtctgaatagcacctgcgatgac agtatcctcgtcaagacagtgtgcggggtggtgtccaggagggccgcacagctgtgtggc gcaggcatggctgcggttgtggataagatccgcgagaacagaggactggaccgtctgaat gtgactgtgggagtggacgggacactctacaagcttcatccacacttctccagaatcatg caccagacggtgaaggaactgtcaccaaaatgtaacgtgtccttcctcctgtctgaggat ggcagcggcaagggggccgccctcatcacggccgtgggcgtgcggttacgcacagaggca agcagctaa >gi568815588r:69304883_69516323|GENSCAN_predicted_peptide_2|342_aa MGTCDIVTEANISSGPESNTTGITAFSMPSWQLALWATAYLALVLVAVTGNAIVIWIILA HRRMRTVTNYFIVNLALADLCMAAFNAAFNFVYASHNIWYFGRAFCYFQNLFPITAMFVS IYSMTAIAADRQEAPSTKAVIAGIWLVALALASPQCFYSTVTMDQGATKCVVAWPEDSGG KTLLLYHLVVIALIYFLPLAVMFVAYSVIGLTLWRRAVPGHQAHGANLRHLQAMKKFVKT MVLVVLTFAICWLPYHLYFILGSFQEDIYCHKFIQQVYLALFWLAMSSTMYNPIIYCCLN HRFRSGFRLAFRCCPWVTPTKEDKLELTPTTSLSTRVNRRAM >gi568815588r:69304883_69516323|GENSCAN_predicted_CDS_2|1029_bp atggggacctgtgacattgtgactgaagccaatatctcatctggccctgagagcaacacc acgggcatcacagccttctccatgcccagctggcaactggcactgtgggccacagcctac ctggccctggtgctggtggccgtgacgggtaatgccatcgtcatctggatcatcctggcc catcggaggatgcgcacagtcaccaactacttcatcgtcaatctggcgctggctgacctc tgcatggctgccttcaatgccgccttcaactttgtctatgccagccacaacatctggtac tttggccgtgccttctgctacttccagaacctcttccccatcacagccatgtttgtcagc atctactccatgaccgccattgctgccgacaggcaagaggctcccagcaccaaggcggtt attgctggcatctggctggtggctctcgccctggcctcccctcagtgcttctactccacc gtcaccatggaccagggtgccaccaagtgcgtggtggcctggcccgaagacagcgggggc aagacgctcctcctgtaccacctcgtggtgatcgccctcatctacttcctgccgctcgcg gtgatgtttgtagcctacagcgtcatcggcctcacgctctggaggcgcgcagtgcccgga catcaggcgcacggtgccaacctgcgccacctgcaggccatgaagaagtttgtgaagacc atggtgctggtggtgctgacgtttgccatctgctggctgccctaccacctctacttcatc ctgggcagcttccaggaggacatctactgccacaagttcatccagcaagtctacctggca ctcttctggttggccatgagctctaccatgtacaatcccatcatctactgctgtctcaac cacaggtttcgctctggattccggcttgccttccgctgctgcccatgggtcacacccacc aaggaagataagctcgagctgactcccacgacctccctctccacgagagtcaacagaaga gccatgtaa >gi568815588r:69304883_69516323|GENSCAN_predicted_peptide_3|167_aa MAVIKAVIKAVIKAVIIIPGFVDFLVCNALRGMAKMGEGEELWEVGPPGPNAQLSAKLPG PRIVSETTAPTPFLKNPAEMDSRHLQSDLCRLMTSKGTPQMPWHYGRHQEGDSGSPGPCH STCCHIRQHPEQTPSSFCLQYPGWLWDDSPVTQPANGQILLEPDVEK >gi568815588r:69304883_69516323|GENSCAN_predicted_CDS_3|504_bp atggctgtcatcaaggctgtcatcaaggctgtcatcaaggctgtcatcatcattcctggc tttgtggatttcctagtatgcaatgccctgagaggaatggccaagatgggagaaggagag gagctctgggaagttgggcctccaggacccaatgcccaactaagcgccaaactcccaggg cccaggattgtctcagaaacaacagcccccacccccttcctcaagaaccctgcagaaatg gacagcaggcatctccaaagtgacctgtgtagattaatgaccagtaaaggcacaccccag atgccctggcactatggacggcaccaggagggggatagtgggagcccaggaccatgccac agcacgtgttgccacatccgccagcacccagagcagacgccaagttccttctgtctgcaa tatccaggctggctctgggatgactcccccgtgacccaaccagccaatggacagattctg ctggagcccgatgttgagaaatga >gi568815588r:69304883_69516323|GENSCAN_predicted_peptide_4|72_aa MGSGFEPVAIEEAQRIHLNSFLQQLPTPGARQKFQTSVVSGDIDTAAKLIGAGAATVGVA GSGADIGTVFGS >gi568815588r:69304883_69516323|GENSCAN_predicted_CDS_4|219_bp atggggtctggatttgagcctgtggctatagaggaagcccagagaattcatctaaacagc ttcctacagcaactccctactccaggggccagacagaagttccagaccagtgttgtctct ggggacattgacacagcagccaagttgattggtgctggggcagccacagttggtgtggct ggttcaggggctgacattggaactgtgtttggcagctag >gi568815588r:69304883_69516323|GENSCAN_predicted_peptide_5|411_aa MPRGDSEQVRYCARFSYLWLKFSLIIYSTVFWERGVGQQEPSCPGDVGEKTVFSRLIGAL VLSVGIYAEVERQKYKTLESAFLAPAIILILLGVVMFMVSFIGVLASLRDNLYLLQAFMY ILGICLIMELIGGVVALTFRNQQLLLRQQVLSHTLGCADLSDGPGSGPVKMFMGVPVIPA QTIDFLNDNIRRGIENYYDDLDFKNIMDFVQKKFKCCGGEDYRDWSKNQYHDCSAPGPLA CGVPYTCCIRNTVDTAPVGTGGLSGTPGWKRLLDPGLGACELQQSLLQNIDSMVRQTEVV NTMCGYKTIDKERFSVQDVIYVRGCTNAVIIWFMDNYTIMAGILLGILLPQFLGVLLTLL YITRVEDIIMEHSVTDGLLGPGTWMKLEAIILGKLTQEQKTKHRMFSLQLL >gi568815588r:69304883_69516323|GENSCAN_predicted_CDS_5|1236_bp atgccgcgcggggactcggagcaggtgcgctactgcgcgcgcttctcctacctctggctc aagttttcacttatcatctattccaccgtgttctgggaaaggggagtgggacagcaggaa cccagctgccctggggatgtgggagagaagactgttttctcccgactgattggggccctg gtcctgtctgtgggcatctatgcagaggttgagcggcagaaatataaaacccttgaaagt gccttcctggctccagccatcatcctcatcctcctgggcgtcgtcatgttcatggtctcc ttcattggtgtgctggcgtccctccgtgacaacctgtaccttctccaagcattcatgtac atccttgggatctgcctcatcatggagctcattggtggcgtggtggccttgaccttccgg aaccagcagttgctcttaaggcagcaggtgctgtcccacaccctgggttgtgcagacctc tcagatggcccaggatctggccctgtgaaaatgttcatgggagtgcctgtgattccagcc cagaccattgacttcctgaacgacaacattcgaagaggaattgagaactactatgatgat ctggacttcaaaaacatcatggactttgttcagaaaaagttcaagtgctgtggcggggag gactaccgagattggagcaagaatcagtaccacgactgcagtgcccctggacccctggcc tgtggggtgccctacacctgctgcatcaggaacacggtagacactgctcctgtggggact ggggggctgtcggggaccccaggatggaagaggctactggaccctggtcttggggcctgt gaacttcagcagagcctcttgcagaatatcgattctatggtcaggcagacagaagttgtc aacaccatgtgtggctacaaaactatcgacaaggagcgtttcagtgtgcaggatgtcatc tacgtgcggggctgcaccaacgccgtgatcatctggttcatggacaactacaccatcatg gcgggcatcctcctgggcatcctgcttccccagttcctgggggtgctgctgacgctgctg tacatcacccgggtggaggacatcatcatggagcactctgtcactgatgggctcctgggg cccgggacatggatgaagctggaagccatcattctcggcaaactgacacaggaacagaaa accaaacaccgcatgttctcactccaactgctttga