GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:41:22 Sequence gi568815586r:54544716_54748303 : 203588 bp : 44.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5145 5270 126 0 0 96 84 156 0.815 15.79 1.02 Intr + 22259 22372 114 2 0 115 93 187 0.995 21.46 1.03 Intr + 24469 24651 183 1 0 75 89 358 0.999 33.50 1.04 Intr + 24831 24897 67 0 1 106 84 30 0.948 3.31 1.05 Intr + 25526 25642 117 1 0 86 53 102 0.960 7.16 1.06 Intr + 27886 28026 141 0 0 106 66 170 0.995 17.15 1.07 Intr + 28433 28533 101 1 2 118 61 208 0.999 20.01 1.08 Intr + 28640 28765 126 2 0 76 30 205 0.998 13.29 1.09 Intr + 28893 28994 102 0 0 86 23 122 0.652 4.79 1.10 Intr + 30383 30503 121 2 1 72 74 148 0.989 12.30 1.11 Intr + 30717 30785 69 2 0 79 82 27 0.599 0.58 1.12 Intr + 30836 30917 82 1 1 117 95 24 0.999 5.21 1.13 Intr + 31277 31385 109 0 1 70 94 190 0.981 17.14 1.14 Intr + 31856 31986 131 2 2 59 52 146 0.995 8.44 1.15 Term + 32510 32613 104 0 2 117 41 205 0.999 17.14 1.16 PlyA + 32835 32840 6 -3.64 2.18 PlyA - 33037 33032 6 -0.45 2.17 Term - 33246 33112 135 0 0 92 49 159 0.995 10.42 2.16 Intr - 36335 36229 107 0 2 102 103 34 0.978 6.23 2.15 Intr - 37416 37261 156 1 0 104 53 73 0.972 5.38 2.14 Intr - 38080 38017 64 1 1 99 47 34 0.976 -1.31 2.13 Intr - 38533 38496 38 2 2 129 105 50 0.994 8.78 2.12 Intr - 39605 39545 61 2 1 98 115 21 0.881 4.11 2.11 Intr - 43801 43690 112 0 1 98 80 187 0.589 19.28 2.10 Intr - 45898 45846 53 1 2 84 78 12 0.167 -2.49 2.09 Intr - 59566 59475 92 2 2 109 74 62 0.604 6.61 2.08 Intr - 64962 64918 45 1 0 103 95 47 0.596 5.28 2.07 Intr - 87124 87023 102 2 0 56 87 61 0.612 2.95 2.06 Intr - 87666 87436 231 1 0 66 85 145 0.602 9.74 2.05 Intr - 90095 90069 27 0 0 82 105 33 0.052 2.49 2.04 Intr - 100547 100458 90 0 0 87 113 17 0.152 4.07 2.03 Intr - 100992 100891 102 1 0 95 94 14 0.202 2.85 2.02 Intr - 102444 102406 39 1 0 123 84 27 0.331 4.00 2.01 Init - 103588 103531 58 1 1 74 76 136 0.754 10.47 2.00 Prom - 107623 107584 40 -6.36 3.00 Prom + 110788 110827 40 -5.26 3.01 Init + 114744 114824 81 2 0 79 100 63 0.424 7.63 3.02 Intr + 121508 121572 65 1 2 119 78 45 0.515 4.12 3.03 Term + 123765 123849 85 0 1 58 36 98 0.230 -1.27 3.04 PlyA + 123909 123914 6 1.05 4.04 PlyA - 125457 125452 6 1.05 4.03 Term - 144865 144683 183 1 0 54 34 102 0.022 -1.06 4.02 Intr - 176931 176716 216 0 0 44 28 250 0.170 13.30 4.01 Init - 183154 183065 90 1 0 61 43 85 0.277 1.79 4.00 Prom - 187171 187132 40 -1.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 90126 90069 58 0 1 37 105 83 0.815 5.18 S.002 Sngl - 98601 98233 369 0 0 58 43 190 0.907 7.61 S.003 Term - 100041 99931 111 0 0 85 42 116 0.822 5.26 S.004 Init + 149987 150119 133 1 1 78 47 91 0.807 4.30 S.005 Term - 161601 161522 80 1 2 112 54 36 0.845 0.53 S.006 Init - 161743 161689 55 1 1 70 56 74 0.922 3.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:54544716_54748303|GENSCAN_predicted_peptide_1|564_aa XPWLSMELSPRSPPEMLEESDCPSPLELKSAPSKKMWIKLRSLLRYMVKQLENGEINIEE LKKNLEYTASLLEAVYIDETRQILDTEDELQELRSDAVPSEVRDWLASTFTQQARAKGRR AEEKPKFRSIVHAVQAGIFVERMFRRTYTSVGPTYSTAVLNCLKNLDLWCFDVFSLNQAA DDHALRTIVFELLTRHNLISRFKIPTVFLMSFLDALETGYGKYKNPYHNQIHAADVTQTV HCFLLRTGMVHCLSEIELLAIIFAAAIHDYEHTGTTNSFHIQTKSECAIVYNDRSVLENH HISSVFRLMQDDEMNIFINLTKDEFVELRALVIEMVLATDMSCHFQQVKTMKTALQQLER IDKPKALSLLLHAADISHPTKQWLVHSRWTKALMEEFFRQVNSAEISHNNPPPPPLATCT PDLGDKEAELGLPFSPLCDRTSTLVAQSQIGFIDFIVEPTFSVLTDVAEKSVQPLADEDS KSKNQPSFQWRQPSLDVEVGDPNPDVVSFRSTWVKRIQENKQKWKERAASGITNQMSIDE LSPCEEEAPPSPAEDEHNQNGNLD >gi568815586r:54544716_54748303|GENSCAN_predicted_CDS_1|1695_bp nnaccgtggctgagcatggagctgtccccccgcagtcctccggagatgctggaggagtcg gattgcccgtcacccctggagctgaagtcagcccccagcaagaagatgtggattaagctt cggtctctgctgcgctacatggtgaagcagttggagaatggggagataaacattgaggag ctgaagaaaaatctggagtacacagcttctctgctggaagccgtctacatagatgagaca cggcaaatcttggacacggaggacgagctgcaggagctgcggtcagatgccgtgccttcg gaggtgcgggactggctggcctccaccttcacccagcaggcccgggccaaaggccgccga gcagaggagaagcccaagttccgaagcattgtgcacgctgtgcaggctgggatcttcgtg gaacggatgttccggagaacatacacctctgtgggccccacttactctactgcggttctc aactgtctcaagaacctggatctctggtgctttgatgtcttttccttgaaccaggcagca gatgaccatgccctgaggaccattgtttttgagttgctgactcggcataacctcatcagc cgcttcaagattcccactgtgtttttgatgagtttcctggatgccttggagacaggctat gggaagtacaagaatccttaccacaaccagatccacgcagccgatgttacccagacagtc cattgcttcttgctccgcacagggatggtgcactgcctgtcggagattgagctcctggcc atcatctttgctgcagctatccatgattatgagcacacgggcactaccaacagcttccac atccagaccaagtcagaatgtgccatcgtgtacaatgatcgttcagtgctggagaatcac cacatcagctctgttttccgattgatgcaggatgatgagatgaacattttcatcaacctc accaaggatgagtttgtagaactccgagccctggtcattgagatggtgttggccacagac atgtcctgccatttccagcaagtgaagaccatgaagacagccttgcaacagctggagagg attgacaagcccaaggccctgtctctactgctccatgctgctgacatcagccacccaacc aagcagtggttggtccacagccgttggaccaaggccctcatggaggaattcttccgtcag gtcaacagtgcagaaatttcacacaacaaccccccacccccaccacttgccacctgtacc ccagatctgggtgacaaggaggcagagttgggcctgcccttttctccactctgtgaccgc acttccactctagtggcacagtctcagatagggttcatcgacttcattgtggagcccaca ttctctgtgctgactgacgtggcagagaagagtgttcagcccctggcggatgaggactcc aagtctaaaaaccagcccagctttcagtggcgccagccctctctggatgtggaagtggga gaccccaaccctgatgtggtcagctttcgttccacctgggtcaagcgcattcaggagaat aagcagaaatggaaggaacgggcagcaagtggcatcaccaaccagatgtccattgacgag ctgtccccctgtgaagaagaggcccccccatcccctgccgaagatgaacacaaccagaat gggaatctggattag >gi568815586r:54544716_54748303|GENSCAN_predicted_peptide_2|503_aa MRFMTLLFLTALAGALVCAYDPEAASAPGSGNPCHEASAAQKENAGEDPGLARQAPKPRK QRSSLLEKGLDGAKKAVGGLGKLGKDAVEDLESVGKAVAGALVYAAKPNEEISGPAEPAS PPETTTTAQETSAAAVQGTAKVTSSRQELNPLSKSLSLCQINNLEKSLAAGPHHTSTHRD KPESIVEKSILLTEQALAKAGKGMHGGVPGGKQFIEKPEGEIHSETQPADAWVEGGRKKP STQVQGAERDVMEGQRSAAVPQEITIKPAATINRDLLCTSAGPPAAAPAMEQDNSPRKIQ FTVPLLEPHLDPEAAEQIRRRRPTPATLVLTSDQSSPEIDEDRIPNPHLKSTLAMSPRQR KKMTRITPTMKELQMMVEHHLGQQQQGEEPEGAAESTGTQESRPPGIPDTEVESRLGTSG TAKKTAECIPKTHERGSKEPSTKEPSTHIPPLDSKGANSVGCFFHDHAPLEQMGEPVLMD GADDIKHFGLNEDLGKEGETSVP >gi568815586r:54544716_54748303|GENSCAN_predicted_CDS_2|1512_bp atgaggttcatgactctcctcttcctgacagctctggcaggagccctggtctgtgcctat gatccagaggccgcctctgccccaggatcggggaacccttgccatgaagcatcagcagct caaaaggaaaatgcaggtgaagacccagggttagccagacaggcaccaaagccaaggaag cagagatccagccttctggaaaaaggcctagacggagcaaaaaaagctgtggggggactc ggaaaactaggaaaagatgcagtcgaagatctagaaagcgtgggtaaagctgtagcaggg gccctggtctatgctgctaagcctaatgaagagatctcaggtccagcagaaccagcttca cccccagagacaaccacaacagcccaggagacttcggcggcagcagttcaggggacagcc aaggtcacctcaagcaggcaggaactaaaccccctgagtaagtctctgtctctatgccag atcaacaacctagaaaagtctctggctgcaggcccacatcacacctccacgcacagagat aagcctgaatccatagtggagaaaagtatcttactaacagaacaagcccttgcaaaagca ggaaaaggaatgcacggaggcgtgccaggtggaaaacaattcatcgaaaagccagaaggt gaaattcactcggagactcagcctgcagatgcctgggtagaaggtggaagaaaaaaacct tctacacaggttcaaggtgcagagagagatgtcatggagggccagcgctcagccgcagtg cctcaggaaataacaataaaaccagcagctacaattaaccgtgaccttctatgcaccagc gcgggcccaccggccgccgccccagccatggagcaagacaacagcccccgaaagatccag ttcacggtcccgctgctggagccgcaccttgaccccgaggcggcggagcagattcggagg cgccgccccacccctgccaccctcgtgctgaccagtgaccagtcatccccagagatagat gaagaccggatccccaacccacatctcaagtccactttggcaatgtctccacggcaacgg aagaagatgacaaggatcacacccacaatgaaagagctccagatgatggttgaacatcac ctggggcaacagcagcaaggagaggaacctgagggggccgctgagagcacaggaacccag gagtcccgcccacctgggatcccagacacagaagtggagtcaaggctgggcacctctggg acagcaaaaaaaactgcagaatgcatccctaaaactcacgagagaggcagtaaggaaccc agcacaaaagaaccctcaacccatataccaccactggattccaagggagccaactcggtg ggttgtttcttccacgaccacgctcccttggagcagatgggggagccagtcctgatggat ggtgctgatgacatcaaacactttggactcaatgaagacctggggaaagagggagaaact tcagtgccctga >gi568815586r:54544716_54748303|GENSCAN_predicted_peptide_3|76_aa MDVGSMPPLPPVEVVLVLSIVVRPEAKSDWLKDVPWSEQMEEGVEQDLRVEKSNNNKELD QSSKDSEGKKEQEKNK >gi568815586r:54544716_54748303|GENSCAN_predicted_CDS_3|231_bp atggacgtgggcagcatgccacctcttcctcctgtagaggtggttctggtgctcagcatt gtggtgaggcctgaggcgaagagtgactggttgaaggatgtgccctggagtgagcaaatg gaggaaggagtcgagcaggacctgagagtagaaaagagcaacaacaacaaagaacttgat caatccagcaaagattcagaaggaaagaaagaacaagaaaagaataaataa >gi568815586r:54544716_54748303|GENSCAN_predicted_peptide_4|162_aa MREGLEVPRDLSTACDQDADSDMDNEVQAESTEQGSVSFRKDYKMKEKEEEEEEKKKRKK RRRKGGVGEGEGGEGEKKKEEAAEEEEEKKKAATEEEEGRRSKNRYQVSSTLYYSSLLAV IDSVGRGRGRKKGRLSCQHAIIHTITMSGKLCGVPDTSSTFC >gi568815586r:54544716_54748303|GENSCAN_predicted_CDS_4|489_bp atgagggaaggtttggaagttcctagagacttgtcaactgcttgtgaccaagatgctgat agtgatatggacaatgaagtccaggctgagagcacagaacagggctctgtcagtttcaga aaggattataagatgaaggagaaggaggaggaggaggaggagaagaagaagaggaagaag aggaggaggaaaggaggagtgggagaaggagagggaggggaaggagagaagaagaaggag gaggcggcggaggaggaggaagagaagaagaaggcagcgactgaggaggaggaggggagg aggagtaaaaaccgctaccaggtcagcagcaccctgtactactccagtcttttggctgtt atagactcagtgggcagaggaagaggtagaaagaaggggagactatcctgtcagcatgcc atcatccacactatcaccatgtcaggaaaattgtgtggtgttcctgatacatcttccaca ttctgttaa