GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:47:00 Sequence gi568815584r:68689536_68892938 : 203403 bp : 47.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3555 3702 148 2 1 51 1 145 0.437 2.35 1.02 Term + 3789 3907 119 1 2 83 39 106 0.656 3.90 1.03 PlyA + 4071 4076 6 1.05 2.04 PlyA - 9709 9704 6 1.05 2.03 Term - 16287 15971 317 1 2 75 48 145 0.522 4.30 2.02 Intr - 21547 21443 105 2 0 72 44 110 0.633 5.19 2.01 Init - 27544 27445 100 1 1 75 94 53 0.604 4.18 2.00 Prom - 34298 34259 40 -6.56 3.00 Prom + 42771 42810 40 -5.46 3.01 Init + 43768 43809 42 0 0 88 96 44 0.934 5.64 3.02 Intr + 44411 44604 194 1 2 91 59 92 0.875 4.89 3.03 Intr + 47337 47468 132 0 0 70 100 25 0.577 1.66 3.04 Intr + 62287 62313 27 1 0 74 99 55 0.020 2.43 3.05 Intr + 73816 73852 37 1 1 102 75 27 0.004 1.06 3.06 Intr + 83551 83671 121 0 1 57 99 41 0.011 2.17 3.07 Intr + 86590 86673 84 2 0 92 70 49 0.474 3.29 3.08 Term + 91596 91780 185 1 2 47 43 110 0.310 0.11 3.09 PlyA + 92836 92841 6 1.05 4.00 Prom + 93670 93709 40 -0.26 4.01 Init + 96013 96069 57 0 0 61 86 57 0.942 4.15 4.02 Term + 98048 98134 87 1 0 105 48 79 0.903 3.26 4.03 PlyA + 98199 98204 6 1.05 5.04 PlyA - 98401 98396 6 -0.45 5.03 Term - 100957 99998 960 1 0 136 46 1190 0.999 111.66 5.02 Intr - 103535 103347 189 2 0 48 98 87 0.352 5.48 5.01 Init - 106614 106540 75 0 0 37 94 111 0.478 5.69 5.00 Prom - 118412 118373 40 -4.36 6.00 Prom + 119848 119887 40 -7.76 6.01 Init + 120169 120232 64 0 1 81 80 20 0.096 1.81 6.02 Intr + 127087 127419 333 2 0 55 81 175 0.282 9.04 6.03 Intr + 127453 127648 196 2 1 95 -6 52 0.457 -5.03 6.04 Intr + 130839 131009 171 0 0 88 116 72 0.790 9.06 6.05 Intr + 132687 132820 134 0 2 89 47 46 0.175 0.89 6.06 Intr + 161025 161085 61 1 1 86 100 39 0.292 2.69 6.07 Intr + 161767 161833 67 1 1 101 50 45 0.313 0.91 6.08 Term + 164860 164967 108 0 0 90 45 104 0.426 4.81 6.09 PlyA + 166460 166465 6 1.05 7.00 Prom + 170700 170739 40 -1.76 7.01 Init + 172227 172455 229 2 1 70 14 255 0.774 14.83 7.02 Intr + 174732 174860 129 1 0 85 96 63 0.980 7.47 7.03 Term + 176388 176503 116 1 2 96 52 42 0.955 0.13 7.04 PlyA + 177274 177279 6 1.05 8.14 PlyA - 177538 177533 6 1.05 8.13 Term - 185605 185324 282 1 0 31 36 481 0.737 32.63 8.12 Intr - 187705 187547 159 1 0 90 98 254 0.986 26.78 8.11 Intr - 188988 188923 66 0 0 80 52 91 0.929 3.80 8.10 Intr - 189534 189454 81 0 0 45 78 122 0.990 6.73 8.09 Intr - 190573 190427 147 1 0 99 80 329 0.992 33.63 8.08 Intr - 191454 191275 180 0 0 20 90 422 0.980 35.66 8.07 Intr - 193057 192923 135 1 0 101 90 242 0.998 26.46 8.06 Intr - 193520 193338 183 2 0 83 94 328 0.992 32.88 8.05 Intr - 194773 194633 141 1 0 69 99 312 0.998 30.95 8.04 Intr - 195348 195240 109 2 1 129 78 105 0.999 13.89 8.03 Intr - 196040 195890 151 0 1 91 58 364 0.797 32.92 8.02 Intr - 200751 200604 148 0 1 138 89 266 0.999 31.51 8.01 Intr - 202748 202518 231 2 0 110 97 430 0.984 44.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584r:68689536_68892938|GENSCAN_predicted_peptide_1|88_aa MSPKFNPNKIKVVYLRCTSGEVSTTSALVPKISPLGLSPKKANEDITKTMIMKALEEPPR DRNKQKNIKYSGNIICGEIINIAQQMQH >gi568815584r:68689536_68892938|GENSCAN_predicted_CDS_1|267_bp atgtcacccaagttcaaccccaacaagatcaaagtcgtatacctgaggtgcacgagtggg gaagtcagtactacatctgctctggttcctaagatcagccccctgggtctgtctccaaaa aaggccaatgaggacatcaccaagacaatgatcatgaaagcccttgaggaaccaccaaga gacagaaacaagcagaaaaacattaagtacagtggaaatatcatttgtggtgagatcatc aacattgcccaacagatgcagcactga >gi568815584r:68689536_68892938|GENSCAN_predicted_peptide_2|173_aa MNSLNPIALLWAAAGEAMRCQPIDRVPGIYQAPGSKHQIQEISKAKTSAAQDGDTGAYFS SGGCGEGRSLGPSDCYQVTLCRQGSFGVFPEKLTKAKAYLGLGFLQELWLASSPRHTLTL HQGDLNFIDGENGEGRHYVNWPFILTLPASHFPMGNRPTSPSLPVTQGLNQGF >gi568815584r:68689536_68892938|GENSCAN_predicted_CDS_2|522_bp atgaactcgctcaatcccattgctttactctgggcagcagctggggaagccatgagatgt cagccaattgacagagtacccggcatttatcaggccccaggttcaaaacaccagatccag gagataagtaaagccaagaccagcgctgcccaggatggagacacaggtgcctacttctcc agtggtggctgcggcgaaggcagaagtcttgggccaagcgactgctaccaagtgacactg tgcaggcaggggagcttcggtgtcttccctgagaaactgaccaaggccaaggcatacttg ggcttaggtttcctgcaggaactatggctagctagctcaccccggcacacgctgaccttg catcaaggtgacctgaacttcattgatggggagaatggggaagggagacattacgtgaac tggccatttattctcacgctgccagccagccactttcccatgggaaacagacccacttca ccctcactccccgtcactcaagggctgaaccagggtttctga >gi568815584r:68689536_68892938|GENSCAN_predicted_peptide_3|273_aa MQTKALTVTAEAGRMRKLKQGHKPDDLKGSHYCLLKNPNQESQINLPSILFGNHMSTLQL YLACVPQGGRCRGLRKLRRTQYLRISALLQPRAAWKPSGKDVVSRVGPESRWNGVHVAGN KGRNSIMIGCCRDGVTDVFILLEELPKFPDSTLGTHHPESVVGGEEDAKTLALCGEGRAV WAGGVPWLSHFMLWRMEKEKLGKEEDPDLEQKGINDFLEWRGTARMQFNHNGQYLSENKA PKEGTPMPATMLVPSTASPKVTSFKQTIKLLTD >gi568815584r:68689536_68892938|GENSCAN_predicted_CDS_3|822_bp atgcagacaaaggcactgacagttacagcagaagctggaaggatgagaaaactgaaacaa ggtcacaaaccagatgatctaaagggtagccactactgcctcctaaagaaccccaatcag gaatcccagattaatttgcccagcatcctgtttggaaaccacatgtccaccctgcaactg tacctggcttgtgtcccacaaggaggaagatgcaggggactccggaagctaaggaggacc cagtatctcaggatttctgccctcctccagccccgagcagcctggaagccaagtggcaag gatgtagtcagccgggtgggccctgagagccggtggaatggtgtccatgttgcaggaaac aagggaagaaacagcatcatgatcggctgctgcagggatggggtcacggatgtgtttatc ttgttggaggagcttcccaagttccctgactcaacactgggcacccaccacccagagtcg gtggttggcggggaggaagatgctaagacactggccctatgtggggaggggagggctgta tgggctggaggagtcccgtggctgtctcatttcatgctctggaggatggagaaggaaaaa cttggaaaagaggaagacccagacttggaacaaaaaggaataaatgacttccttgaatgg agaggaactgctagaatgcagtttaatcataatggacagtatctctcagaaaacaaggca ccaaaagagggaacaccgatgccagcaaccatgctggtgccttctacagccagccccaag gtcacaagtttcaagcagactataaagctgctaactgattag >gi568815584r:68689536_68892938|GENSCAN_predicted_peptide_4|47_aa MADLVQAPKPSLQDTLASRDLQGIQGENLTGKVKPKPSKTSSTINVS >gi568815584r:68689536_68892938|GENSCAN_predicted_CDS_4|144_bp atggcagacctggtgcaggcccctaaacccagccttcaagacaccctggcctccagggac ttgcaaggtatacaaggggagaaccttactggcaaagtcaaaccaaaaccaagtaaaacc agctccaccatcaatgtttcttga >gi568815584r:68689536_68892938|GENSCAN_predicted_peptide_5|407_aa MGRARKSGLGQRSRPTASRSEAAVQPGVRKARGAGNWRVGLQTGEAAPSPHRDLRDTPDP RPWLARTHRMTTTLVSATIFDLSEVLCKGNKMLNYSAPSAGGCLLDRKAVGTPAGGGFPR RHSVTLPSSKFHQNQLLSSLKGEPAPALSSRDSRFRDRSFSEGGERLLPTQKQPGGGQVN SSRYKTELCRPFEENGACKYGDKCQFAHGIHELRSLTRHPKYKTELCRTFHTIGFCPYGP RCHFIHNAEERRALAGARDLSADRPRLQHSFSFAGFPSAAATAAATGLLDSPTSITPPPI LSADDLLGSPTLPDGTNNPFAFSSQELASLFAPSMGLPGGGSPTTFLFRPMSESPHMFDS PPSPQDSLSDQEGYLSSSSSSHSGSDSPTLDNSRRLPIFSRLSISDD >gi568815584r:68689536_68892938|GENSCAN_predicted_CDS_5|1224_bp atggggagggcccggaagtcgggcctgggacagaggagccggcccaccgcctcccggtcg gaagcggcagtgcagcccggagtcagaaaggcgaggggcgccgggaactggcgtgtggga ctccagacaggagaggctgcgccttccccgcaccgggaccttcgcgacacaccagatcct cgcccctggctcgcgcgaacgcacaggatgaccaccaccctcgtgtctgccaccatcttc gacttgagcgaagttttatgcaagggtaacaagatgctcaactatagtgctcccagtgca gggggttgcctgctggacagaaaggcagtgggcacccctgctggtgggggcttccctcgg aggcactcagtcaccctgcccagctccaagttccaccagaaccagctcctcagcagcctc aagggtgagccagcccccgctctgagctcgcgagacagccgcttccgagaccgctccttc tcggaagggggcgagcggctgctgcccacccagaagcagcccgggggcggccaggtcaac tccagccgctacaagacggagctgtgccgcccctttgaggaaaacggtgcctgtaagtac ggggacaagtgccagttcgcacacggcatccacgagctccgcagcctgacccgccacccc aagtacaagacggagctgtgccgcaccttccacaccatcggcttttgcccctacgggccc cgctgccacttcatccacaacgctgaagagcgccgtgccctggccggggcccgggacctc tccgctgaccgtccccgcctccagcatagctttagctttgctgggtttcccagtgccgct gccaccgccgctgccaccgggctgctggacagccccacgtccatcaccccaccccctatt ctgagcgccgatgacctcctgggctcacctaccctgcccgatggcaccaataaccctttt gccttctccagccaggagctggcaagcctctttgcccctagcatggggctgcccgggggt ggctccccgaccaccttcctcttccggcccatgtccgagtcccctcacatgtttgactct ccccccagccctcaggattctctctcggaccaggagggctacctgagcagctccagcagc agccacagtggctcagactccccgaccttggacaactcaagacgcctgcccatcttcagc agactttccatctcagatgactaa >gi568815584r:68689536_68892938|GENSCAN_predicted_peptide_6|377_aa MIPYGWLTGEFEQEPCARRTSGPRPSRRAHQARPARGDTGAGAAAAELPGRLGTQGAGEP TLLQEAAAWGEVLATASFPSSRGRSPPEALGGSTMDVATLPSSGQLAEQHVWAPFGGLGD VSTLSLPFATFEGHQSCHSAHSPNSRGFAAFGGSQTSVCTGIPPWTVLSNPDCWVPQPSQ RICISNKSSGDAELLVRCLESSKARELTQPPGSRWLPFFDIHQCHWLSSGYTAIFSLKPP NQLVKSVAPLCFLSRMKEALSSKDMATESATKDRPRGGEGLPTALPLLAELNGQGLTQAG PGVEKIHWESCQHVTGLVARDRSPRSTQTTNQKKKNYCISSIPRTCFPDSSSPTTIVAFT YGSTSISTHDEWTLRQC >gi568815584r:68689536_68892938|GENSCAN_predicted_CDS_6|1134_bp atgataccctatggctggctcactggagaatttgagcaggagccctgtgccaggagaaca tcaggcccgaggccatccagacgtgcacaccaggcccggcccgcgaggggagacacaggg gccggggctgcggcggctgagctccctgggcggctcggaacgcaaggggccggcgagccg acgctgctgcaggaggctgcggcgtggggcgaggttctcgctacggcttcgttcccgtcc tcccgcggccgctctcccccggaagccctcggaggcagcacgatggacgtggccacactg ccctccagtggccagctcgccgaacaacacgtctgggccccttttgggggtctgggtgac gtgtccaccctctctctgccctttgctacatttgaaggtcaccaaagctgccacagtgca cactcacccaacagcaggggatttgccgccttcggtggttctcaaacttcagtgtgcaca ggaatccccccttggacagttctttcaaacccggattgctgggtcccccaaccaagtcag agaatttgcatttctaacaaatcctcaggcgatgctgagctgctggtccgctgtctggag tccagcaaggcaagggaactgacacagcccccagggtcacgatggcttccattctttgac attcaccagtgtcactggctgagctcaggctacacggccatcttctctttgaagcctcca aaccagctggtgaagtcagtagcacccctgtgttttttatcaaggatgaaagaagcatta tccagcaaggacatggccactgaaagtgccaccaaggaccggccaaggggaggggaaggg ctccctacagcccttcccctgctggctgagttgaatggccagggacttacccaggctggt cctggggtggagaaaattcactgggaaagctgccagcatgtaacaggcctcgtggcaaga gacagaagccccagaagcacacaaaccaccaatcagaagaaaaagaactactgcatttca tcgattcccaggacatgcttccctgacagcagctctccaaccaccattgtggcttttact tatggctccacgagtattagtacccatgatgagtggacactgcgacagtgctaa >gi568815584r:68689536_68892938|GENSCAN_predicted_peptide_7|157_aa MVMASDFYLRYYVGHKGKFGHEFLESEFQLDGKLRYANSSNYKNDVMIRKEAYVHKSLME ELKRIIDDNEITKEDDEAGMVSLPVGLSVPVLFEDLVWGSFPAHLKGLERDFQVKSLSPD LQSQGGEAGAAGSHFAALERDTFENGVQPEKLGPAGI >gi568815584r:68689536_68892938|GENSCAN_predicted_CDS_7|474_bp atggttatggctagcgatttctacctgcgctactacgtagggcacaagggcaagtttggg catgagtttttggagtccgaatttcagctggatggaaagcttagatatgccaacagtagc aattacaaaaatgacgtcatgatcagaaaagaggcttacgtgcacaagagtttaatggaa gaactgaagagaattattgatgacaatgaaatcacaaaagaagatgatgaagcaggaatg gtttctttacctgtgggtctaagtgttccagtgctgtttgaggacctggtgtggggctcc ttcccagcccatctcaaaggtctagagcgggacttccaggtgaagtcactgtccccagac ttgcaatcacaaggtggggaagctggagctgctggaagtcattttgctgccttggagaga gacacttttgagaatggggtccagccagagaagctaggccctgctggcatttga >gi568815584r:68689536_68892938|GENSCAN_predicted_peptide_8|670_aa LLEWIRRTIPWLENRVPENTMHAMQQKLEDFRDYRRLHKPPKVQEKCQLEINFNTLQTKL RLSNRPAFMPSEGRMVSDINNAWGCLEQVEKGYEEWLLNEIRRLERLDHLAEKFRQKASI HEAWTDGKEAMLRQKDYETATLSEIKALLKKHEAFESDLAAHQDRVEQIAAIAQELNELD YYDSPSVNARCQKICDQWDNLGALTQKRREALERTEKLLETIDQLYLEYAKRAAPFNNWM EGAMEDLQDTFIVHTIEEIQGLTTAHEQFKATLPDADKERLAILGIHNEVSKIVQTYHVN MAGTNPYTTITPQEINGKWDHVRQLVPRRDQALTEEHARQQHNERLRKQFGAQANVIGPW IQTKMEEIGRISIEMHGTLEDQLSHLRQYEKSIVNYKPKIDQLEGDHQLIQEALIFDNKH TNYTMEHIRVGWEQLLTTIARTINEVENQILTRDAKGISQEQMNEFRASFNHFDRDHSGT LGPEEFKACLISLGYDIGNDPQKKTGMMDTDDFRACLISMGYNMGEAEFARIMSIVDPNR LGVVTFQAFIDFMSRETADTDTADQVMASFKILAGDKEGGKMQTAHAAFTPPGFAAVSGR AALRLLDFAAFLTTLSSQNYITMDELRRELPPDQAEYCIARMAPYTGPDSVPGALDYMSF STALYGESDL >gi568815584r:68689536_68892938|GENSCAN_predicted_CDS_8|2013_bp ctgttggagtggatccgccgcacaatcccgtggctggagaaccgggtgcccgagaacacc atgcatgccatgcaacagaagctggaggacttccgggactaccggcgcctgcacaagccg cccaaggtgcaggagaagtgccagctggagatcaacttcaacacgctgcagaccaagctg cggctcagcaaccggcctgccttcatgccctctgagggcaggatggtctcggacatcaac aatgcctggggctgcctggagcaggtggagaagggctatgaggagtggttgctgaatgag atccggaggctggagcgactggaccacctggcagagaagttccggcagaaggcctccatc cacgaggcctggactgacggcaaagaggccatgctgcgacagaaggactatgagaccgcc accctctcggagatcaaggccctgctcaagaagcatgaggccttcgagagtgacctggct gcccaccaggaccgtgtggagcagattgccgccatcgcacaggagctcaatgagctggac tattatgactcacccagtgtcaacgcccgttgccaaaagatctgtgaccagtgggacaat ctgggggccctaactcagaagcgaagggaagctctggagcggaccgagaaactgctggag accattgaccagctgtacttggagtatgccaagcgggctgcacccttcaacaactggatg gagggggccatggaggacctgcaggacaccttcattgtgcacaccattgaggagatccag ggactgaccacagcccatgagcagttcaaggccaccctccctgatgccgacaaggagcgc ctggccatcctgggcatccacaatgaggtgtccaagattgtccagacctaccacgtcaat atggcgggcaccaacccctacacaaccatcacgcctcaggagatcaatggcaaatgggac cacgtgcggcagctggtgcctcggagggaccaagctctgacggaggagcatgcccgacag cagcacaatgagaggctacgcaagcagtttggagcccaggccaatgtcatcgggccctgg atccagaccaagatggaggagatcgggaggatctccattgagatgcatgggaccctggag gaccagctcagccacctgcggcagtatgagaagagcatcgtcaactacaagccaaagatt gatcagctggagggcgaccaccagctcatccaggaggcgctcatcttcgacaacaagcac accaactacaccatggagcacatccgtgtgggctgggagcagctgctcaccaccatcgcc aggaccatcaatgaggtagagaaccagatcctgacccgggatgccaagggcatcagccag gagcagatgaatgagttccgggcctccttcaaccactttgaccgggatcactccggcaca ctgggtcccgaggagttcaaagcctgcctcatcagcttgggttatgatattggcaacgac ccccagaagaagacaggcatgatggacacggatgatttccgcgcctgcctgatctccatg ggttacaacatgggagaagcagaatttgcccgcatcatgagcattgtggaccccaaccgc ctgggggtagtgacattccaggccttcattgacttcatgtcccgcgagacagccgacaca gatacagcagaccaagtcatggcttccttcaagatcctggctggggacaaggagggaggc aaaatgcaaacggcacatgctgccttcacgccgccaggctttgcggctgtgtcgggccgc gccgctttacggctgctggactttgcggccttcctgaccactctctcctcgcagaactac attaccatggacgagctgcgccgcgagctgccacccgaccaggctgagtactgcatcgcg cggatggccccctacaccggccccgactccgtgccaggtgctctggactacatgtccttc tccacggcgctgtacggcgagagtgacctctaa