GENSCAN 1.0 Date run: 3-Nov-116 Time: 09:29:27 Sequence gi568815582r:27998542_28281032 : 282491 bp : 47.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 19823 19898 76 1 1 83 81 40 0.917 4.05 1.02 Intr + 21559 21707 149 2 2 78 -7 155 0.762 4.95 1.03 Intr + 30445 30568 124 0 1 22 116 88 0.808 5.16 1.04 Term + 34683 34789 107 1 2 128 41 52 0.931 3.07 1.05 PlyA + 34961 34966 6 1.05 2.04 PlyA - 36477 36472 6 1.05 2.03 Term - 58452 58309 144 0 0 92 43 100 0.654 3.81 2.02 Intr - 64062 64001 62 1 2 90 -14 84 0.507 -3.15 2.01 Init - 64883 64535 349 2 1 92 97 776 0.998 74.25 2.00 Prom - 69921 69882 40 -1.86 3.00 Prom + 70995 71034 40 -10.05 3.01 Init + 71502 71588 87 2 0 91 78 88 0.941 8.91 3.02 Intr + 72371 72467 97 1 1 96 94 5 0.574 1.48 3.03 Intr + 75055 75147 93 2 0 52 100 63 0.763 3.84 3.04 Intr + 78573 78736 164 1 2 84 32 32 0.349 -3.11 3.05 Intr + 78997 79117 121 0 1 50 100 96 0.762 7.07 3.06 Intr + 82103 82189 87 0 0 78 87 64 0.651 5.24 3.07 Term + 84563 84597 35 0 2 113 48 30 0.255 -0.75 3.08 PlyA + 85281 85286 6 -0.45 4.27 PlyA - 85385 85380 6 1.05 4.26 Term - 92630 92509 122 0 2 98 53 99 0.645 6.04 4.25 Intr - 92853 92697 157 0 1 71 34 135 0.896 5.98 4.24 Intr - 100117 99752 366 1 0 71 79 139 0.184 6.64 4.23 Intr - 103147 102917 231 1 0 114 76 417 0.986 41.07 4.22 Intr - 103404 103306 99 0 0 67 115 115 0.982 12.41 4.21 Intr - 106166 106005 162 2 0 82 97 201 0.981 20.57 4.20 Intr - 107673 107502 172 2 1 95 75 236 0.999 22.95 4.19 Intr - 107956 107842 115 2 1 113 115 44 0.999 9.11 4.18 Intr - 109136 108981 156 0 0 80 91 165 0.989 15.98 4.17 Intr - 113465 113276 190 2 1 74 98 164 0.963 15.16 4.16 Intr - 114509 114363 147 2 0 142 100 131 0.999 20.13 4.15 Intr - 118921 118777 145 0 1 129 19 126 0.999 10.08 4.14 Intr - 123221 123129 93 1 0 102 75 80 0.899 7.18 4.13 Intr - 127307 127148 160 0 1 91 102 319 0.995 32.65 4.12 Intr - 131789 131671 119 1 2 85 45 76 0.670 3.11 4.11 Intr - 133651 133611 41 1 2 71 83 25 0.523 -2.68 4.10 Intr - 135392 135300 93 2 0 82 55 92 0.951 5.46 4.09 Intr - 136783 136675 109 0 1 85 69 185 0.995 16.59 4.08 Intr - 147662 147553 110 2 2 79 97 70 0.990 6.08 4.07 Intr - 154244 154118 127 1 1 83 103 62 0.958 7.88 4.06 Intr - 157230 157171 60 2 0 121 65 36 0.663 2.45 4.05 Intr - 157986 157533 454 1 1 83 80 380 0.733 29.02 4.04 Intr - 168044 167967 78 0 0 111 44 32 0.460 0.62 4.03 Intr - 171368 171209 160 2 1 72 99 163 0.602 15.36 4.02 Intr - 177554 177357 198 2 0 82 110 25 0.129 3.65 4.01 Init - 193291 193214 78 1 0 46 103 40 0.011 2.36 4.00 Prom - 200646 200607 40 -1.96 5.03 PlyA - 200654 200649 6 1.05 5.02 Term - 213469 213308 162 1 0 65 54 92 0.016 1.44 5.01 Init - 232218 232192 27 0 0 80 101 26 0.042 2.70 5.00 Prom - 236356 236317 40 -3.66 6.04 PlyA - 237167 237162 6 1.05 6.03 Term - 241342 241148 195 1 0 -4 47 343 0.663 18.41 6.02 Intr - 241542 241375 168 0 0 23 36 278 0.296 16.24 6.01 Init - 242010 241882 129 0 0 60 13 148 0.406 4.65 6.00 Prom - 247745 247706 40 -0.86 7.00 Prom + 252687 252726 40 -3.06 7.01 Init + 259754 259857 104 1 2 46 20 91 0.162 -3.76 7.02 Intr + 260563 260961 399 1 0 51 78 361 0.460 25.02 7.03 Term + 279650 279701 52 2 1 117 50 22 0.008 -1.80 7.04 PlyA + 280691 280696 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 194368 194504 137 0 2 82 93 108 0.970 8.49 S.002 Sngl - 221054 220887 168 2 0 83 43 156 0.802 3.40 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:27998542_28281032|GENSCAN_predicted_peptide_1|151_aa MAAVAPDITSLPNHFQMQEIRGIQDETLMWRVIEVKDSMPVPKEIKIGPTVLLGIGDSVH PVKQTCKMAMQGAVFGTRRPTSADEQILNLRKAPCLTRDQELYLRPVSEYLLHCGQGNSS VTSLEKPSPTSLSEGDQLPIYNRQSRTVDRV >gi568815582r:27998542_28281032|GENSCAN_predicted_CDS_1|456_bp atggctgcggtagctccagacatcacatccttacccaaccacttccaaatgcaggaaata aggggcattcaggatgagacattgatgtggcgagtgattgaagtcaaagactcaatgcct gttcccaaagagatcaaaatcggccctactgtcttgctgggcattggcgactcagtccac ccagtgaagcagacatgcaaaatggctatgcagggtgccgtctttggcaccaggcgaccc acatcagctgatgaacagatcctcaacctacggaaagcaccttgcctaacacgggatcag gagctgtatctgaggcctgtttctgagtaccttctccactgtggtcaaggtaactcaagt gtcacctccttggagaagccttctccgacctcactatctgaaggagatcagctccccatc tacaaccgccagagtagaacagtggatagagtgtga >gi568815582r:27998542_28281032|GENSCAN_predicted_peptide_2|184_aa MKTSRRGRALLAVALNLLALLFATTAFLTTHWCQGTQRVPKPGCGQGGRANCPNSGANAT ANGTAAPAAAAAAATASGNGPPGGALYSWETGDDRFLFRNFHTGIWYSCEEELSGLDPQT REPSTPIANPRRTSQTGELFHKTTPYKSLLKCPPVCEAFLDHPTEISRLAARRHSPAPLM SPLL >gi568815582r:27998542_28281032|GENSCAN_predicted_CDS_2|555_bp atgaagactagccgccgcggccgagcgctcctggccgtggccctgaacctgctggcgctg ctgttcgccaccaccgctttcctcaccacgcactggtgccagggcacgcagcgggtcccc aagccgggctgcggccagggcgggcgcgccaactgccccaactcgggcgccaacgccacg gccaacggcaccgccgcccccgccgccgccgccgccgccgccaccgcctcggggaacggc ccccctggcggcgcgctctacagctgggagaccggcgacgaccgcttcctcttcaggaat ttccacaccggcatctggtactcgtgcgaggaggagctcagcgggcttgatccacagacc cgggagccttccacccccatcgccaacccacgccgaacctcccagacgggagaactcttc cataagacaaccccgtacaagtctctgctcaaatgtcctccagtttgtgaggccttcctg gaccaccccactgaaattagccgccttgctgcccgccggcactcccctgcacccctgatg tccccactgctctga >gi568815582r:27998542_28281032|GENSCAN_predicted_peptide_3|227_aa MRIQRRLEQAHDAAFLEGGSGDPRSVGWEGPRVGPPVLIPWTSKSVPSTLIHPPPRHENC PEDTVSSRKFGSRTVVVAQATKDTPKSRAITEVLTSSLCPLAPHGGPTRHKTAAWIPDLT SSQHLISFSDGGEANGSWRLAVDRENESPHARLGLWGDNDDLTCGSGNWYQPLVRVPQFS PQKPSPSMNEVTKSHHLEGSCEVTHTQFGISELCAVGHLSQDLGPNG >gi568815582r:27998542_28281032|GENSCAN_predicted_CDS_3|684_bp atgaggatccagcgacggctggagcaggcgcacgacgctgcgtttctggaaggcggctct ggggacccgcggtcggttggatgggaggggccaagagtgggaccgccagtcttgatcccc tggaccagtaagagcgtcccctccacactcatccatccaccaccaagacacgaaaactgc ccagaagacactgtgagttctcgcaaatttgggtccaggactgtggttgtggcacaggcc acaaaagatacacccaagtcccgggctatcacagaagtgttaacttcatccctttgtcct ctggctccacatggaggtcccacgcggcacaaaacagccgcctggattcctgacctcaca tcctcacagcacctcatctctttctcagatggtggagaagcaaacggttcctggcgtctc gctgtggacagggagaatgagtctccccatgctcggctggggctgtggggtgacaatgat gacctcacctgtggatctggcaactggtaccagccgctggtcagggtgcctcagttctcc ccacagaagccctcgccctccatgaatgaggtcacaaagagccaccacctggaaggcagc tgtgaggtcacccacacccagtttggcatctctgagctgtgtgctgttggacacttgtca caggatttagggcccaatggatag >gi568815582r:27998542_28281032|GENSCAN_predicted_peptide_4|1313_aa MLDLQHAFIVVRQFHTHSLPDVIQCQNLINKMWLGVPSQDKMEIRSCLPKLLLAHHKTLP YFIRNKLCKVIVDIGRQDWPMFYHDFFTNILQLIQSPVTTPLGLIMLKTTSEELACPRED LSVARKEELRKLLLDQVQTVLGLLTGILETVWDKHSVTAATPPPSPTSGESGDLLSNLLQ SPSSAKLLNQPIPILDVESEYICSLALECLAHLFSWIPLSASITPSLLTTIFHFARFGCD IRARKMASVNGSSQNCVSGQERGRLGVLAMSCINELMSKNCVPMEFEEYLLRMFQQTFYL LQKITKDNNAHTVKSRLEELDERSEVVQGLRLLLLRSQEQEKSYIEKFTDFLRLFVSVHL RRIESYSQFPVVEFLTLLFKYTFHQPTHEGYFSCLDIWTLFLDYLTSKIKSRLGDKEAVL NRYEDALVLLLTEVLNRIQFRYNQAQLEELDDETLDDDQQTEWQRYLRQSLEVVAKVMEL LPTHAFSTLGTFLKAYDEKLWGSPAFSSWVSSGACVNGRQPRFLSTPGAVTVYDNGMHIL ASRHRLNITAENDCRRLHCSLRDLSSLLQAVGRLAEYFIGDVFAARFNDALTVVERLVKV TLYGSQIKLYNIETAVPSVLKPDLIDVHAQSLAALQAYSHWLAQYCSEVHRQNTQQFVTL ISTTMDAITPLISTKVQDKLLLSACHLLVSLATTVRPVFLISIPAVQKVFNRITDASALR LVDKAQVLVCRALSNILLLPWPNLPENEQQWPVRSINHASLISALSRDYRNLKPSAVAPQ RKMPLDDTKLIIHQTLSVLEDIVENISGESTKSRQICYQSLQESVQVSLALFPAFIHQSD VTDEMLSFFLTLFRGLRVQMGVPFTEQIIQTFLNMFTREQLAESILHEGSTGCRVVEKFL KILQVVVQEPGQVFKPFLPSIIALCMEQVYPIIAERPSPDVKAELFELLFRTLHHNWRYF FKSTVLASVQRGIAEEQMENEPQFSAIMQAFGQSFLQPDIHLFKQNLFYLETLNTKQKLY HKKIFRTAMLFQFVNVLLQVLVHKSHDLLQEEIGIAIYNMASVDFDGFFAAFLPEFLTSC DGVDANQKSVLGRNFKMDRLCPCLPGPALIHPECAQAGQRPALLQTLQRQPAPWHCEALG LLLPGDTDFCCCHLRQPYLPPQMSPRWALVTLLGFSHRKQRCLPLPLLHILPLPSRAGFW VHLSTGRCSQGVGAGGGVCGQDLGAWEFPELMSVTPKRHILMAEGDSIPVTVPIESQGDG GPKHDSPGVTELEARGAVAALSILLPSERPAGGAVRLLAVLLVERQSLGLAAG >gi568815582r:27998542_28281032|GENSCAN_predicted_CDS_4|3942_bp atgttagatttgcagcatgcttttatagtggttcgtcagtttcatactcattcattacct gatgtcatccaatgccagaatctgatcaataaaatgtggcttggggtcccatctcaggat aagatggaaatccgtagctgtctgcccaaactccttttggctcaccataaaaccttacct tactttatccggaacaagctctgcaaagttattgttgatattggacgtcaggattggccc atgttctaccacgacttttttactaacattttacagttgatccagtcccctgtgacaacc ccccttgggctgatcatgttgaagacaacttcagaagagctggcttgtccccgtgaggac ctcagtgtggctcggaaggaggagttgcggaagctgctactggaccaggtgcagacagtg cttgggctactgacaggtatcttggagactgtctgggacaaacacagtgttactgctgcc actccaccaccatccccgacctcaggagaaagtggtgacttactgagtaacctgttgcag agtcccagttcagccaaactgttgaatcagccaattcccatccttgatgtggagagtgag tatatctgttccctggctttggagtgcctggcccatctcttcagttggattcctctgtct gccagcatcaccccatccctccttaccaccatcttccactttgcacgatttggctgtgac atccgggccagaaagatggcgtcagttaacggcagcagccagaactgtgtctcgggtcag gagcgcggccggctgggggtcctggccatgtcctgcatcaatgaactcatgtccaagaac tgtgtgcctatggaatttgaggagtatttactgcgtatgttccagcagactttctacctc ctgcagaaaatcaccaaggataacaatgcccacacagtgaagagcaggctagaagagctc gatgagaggtctgaggtggtgcagggactgagactgctgctcctgcgtagccaagaacag gagaagagctatatcgagaagtttactgactttcttcggctctttgtgagtgttcaccta agaagaatcgagtcttactcccagttccctgtggtggagtttttgacacttttgttcaag tacacatttcatcagcctactcatgaaggttacttctcttgtttggatatctggacgctg tttttggactatctgacaagtaaaattaaaagtcgtcttggagacaaggaagcagttctc aacaggtacgaagatgccctggtgctcctgctcacagaggtgttgaatcgaatccagttc agatacaaccaagcccagctggaggagttggatgatgagactctggatgacgatcagcag acggagtggcagcggtacttacggcagagcttggaggtggtggccaaagtgatggagctc ctgcccacgcacgccttctccacactgggaacgtttttaaaggcttatgatgaaaagctg tgggggagcccagcctttagttcttgggtctcctctggagcctgtgtaaatgggcgccag ccgcgattcttgtctactcctggtgctgtgaccgtgtatgacaatgggatgcacattttg gccagcagacacaggttgaacatcacggcggagaacgactgccggcggctgcactgctcc ctgagagacttgagctccctgctgcaggccgtgggccgcctggccgagtactttatcggg gatgtgtttgctgcacggttcaatgatgccctcacagtcgtggaaaggttggtcaaagtc actctgtacggatctcagataaaattgtacaacattgaaactgctgtgccatcagtattg aaacctgacctcattgatgtgcatgctcagtccctggctgcgctgcaggcttactctcac tggttagcacagtattgcagtgaagttcaccggcagaacacgcagcagttcgtgacactc atctctactaccatggatgcaatcacacctctaatcagcaccaaggtccaagacaagctg ctgctatctgcgtgccacttactggtctcactggccaccaccgtgcggcccgtctttctg atcagcatccctgcagtgcagaaagtattcaacagaatcactgatgcctctgccctgcga cttgtcgataaggcccaggtgttggtgtgccgagccctctctaacatcttgctgcttccg tggccaaaccttccagagaatgagcagcagtggcccgtgcgctccatcaaccacgccagc ctcatctctgcactctcccgggactatcgcaacctgaagcccagtgctgttgccccacag agaaagatgccactggatgacaccaaactgattatccaccagacactcagcgtcttagaa gatattgtggagaatatctcgggggagtccaccaagtctcgacagatttgctaccagtcg ctgcaggaatctgttcaggtctccctggccctctttccagcttttatccatcagtcagat gtgactgatgagatgctgagcttcttcctcactctgtttcgaggccttagagtacagatg ggtgtgcctttcactgagcaaatcatacagactttcctcaacatgtttaccagagagcag ttagccgagagcatcctccacgagggcagcacaggctgccgggtggtggagaagtttctg aagatcctgcaggtggtggtccaggagccaggccaggtgttcaagcccttcctccccagc atcatcgccctgtgcatggagcaagtgtatcccatcattgccgagcgtccctcccctgat gtgaaggccgagctgtttgagctccttttccggacgctccatcacaactggaggtacttc ttcaagtccaccgtgctggccagtgtccagagggggatcgctgaggagcagatggagaat gagccccagttcagtgccatcatgcaggctttcggacagtcctttctccagcccgacatc cacctttttaaacaaaatctcttctacttggagactctcaacaccaagcagaagctgtac cacaagaagatcttccggactgccatgctgttccagtttgtgaacgtgctgctccaggtc ctggtccacaagtcccatgatcttctgcaggaggagattggcatcgccatctacaacatg gcctcagtcgactttgatggcttctttgccgccttcctcccagagttcctgaccagctgt gatggtgtggatgccaaccagaaaagtgtgctggggcggaatttcaagatggatcggcta tgcccctgccttccaggacctgccctcattcacccagaatgtgcacaggctggtcaacga cctgcgctactacagactctgcaacgacagcctgccccctggcactgtgaagctctaggc ctgctactgcctggggacacggacttctgctgctgccacctgcgccagccctaccttcca ccacagatgtctcccagatgggccttggtcacactccttggcttctcccaccgcaagcaa cgctgcctgcctctgccgctcctccacatcttgccgctgcccagcagagctggcttctgg gtccacctgagcactggacggtgctcccagggcgttggagcaggcggaggggtgtgtggc caggatttaggggcctgggaattccctgagctcatgtcagttactcctaaacggcacatt ctgatggcggaaggagacagcatcccggtcactgttcccattgagtcacaaggagatgga ggcccaaagcatgactcacctggggtcacagagctggaagccaggggcgccgtggctgct ctgtccattttactgcctagtgaaaggcctgcggggggcgctgtcaggctccttgcggtg ctgctggtggagcgccagagcctgggcctggctgcgggctga >gi568815582r:27998542_28281032|GENSCAN_predicted_peptide_5|62_aa MGLQTENKMRRSPAAALWGRPALATSGALGLLVGGGRAVLAGGSRAGARKDALGGGGSGR VG >gi568815582r:27998542_28281032|GENSCAN_predicted_CDS_5|189_bp atggggctacagactgaaaataagatgcgccgcagcccggcggccgcgctgtggggccgc ccggcgctcgccacttccggcgcgttggggctgttggtcgggggcggccgcgcggtacta gcgggcggctccagggcgggcgcgcgcaaggatgctctagggggcggcggcagtggccgt gtgggttga >gi568815582r:27998542_28281032|GENSCAN_predicted_peptide_6|163_aa MVYMFRYDCIRSKFHGTVKAENGEIPSSSSRSKIPPKSNEEMLNIIPASTGAAKAAGKVI PELNGKLTCMAFSVPAANVSVVDLTCRLEKPVEYDDIKKGIWGYTEHQVVSSDFNRDTHS STFDAGAGIALNDHFVKLVSWYENEFGYSNRVADLMAHMASKE >gi568815582r:27998542_28281032|GENSCAN_predicted_CDS_6|492_bp atggtctacatgttccggtatgattgcatccgtagcaaattccatggcactgtcaaggct gagaacggggaaatcccatcatcatcttccagaagcaagatccctccaaaatcaaatgag gagatgctgaacatcatccctgcctctactggtgctgccaaggctgcgggtaaggtcatc cctgagctgaatgggaagctcacttgcatggccttcagtgtccccgctgccaacgtgtcg gtcgtggacctgacctgccgcctggaaaaacctgtcgaatatgacgacatcaagaagggc atctggggctacactgaacaccaggttgtctcctccgacttcaaccgtgacactcattct tccaccttcgacgctggggctggcattgccctcaacgaccactttgtcaagctcgtttcc tggtatgagaatgaatttggctacagcaacagggtggcggacctcatggcccacatggcc tccaaggagtaa >gi568815582r:27998542_28281032|GENSCAN_predicted_peptide_7|184_aa MLPEALSPFYPRGAALSSQCPFPGPPNFLGVHVEGHLGIQVAGHRFARPSPVAGTPTGQD RSPRQTEPPAPPPTEATGNLSEKAQVAGGQVQSPEADGLLTLERPGSGTPAQAGDDAAEA TPGHPCPVLELPPAWPMGCGVDDVPAFCFVCFHREEEEELLEEVPLRSSSAQPSPGHIRS RERD >gi568815582r:27998542_28281032|GENSCAN_predicted_CDS_7|555_bp atgctccctgaagccctgagccccttctatcccagaggagctgccctgagctcccagtgc cccttcccagggccccccaacttcctgggggtacatgtggagggccacctcggaatccag gtggctggacaccgcttcgcacggcccagccccgtggctggaacccccacgggtcaagac aggagcccccgccagacagagcccccggctcctccacccaccgaggcgacgggcaacctg agtgagaaggcacaggtggcgggcgggcaagtgcagagccctgaagccgacgggctgctg accctggagcgccctggctcggggactcctgcccaggctggcgatgatgctgcggaggcc acccctggccacccctgccctgtcctggagctgcctccggcctggcccatgggctgcgga gtcgatgatgtgccggccttctgcttcgtctgcttccacagggaggaggaagaggagctg ctggaagaagtcccattgcggagctcctccgcccagccctccccaggccacatccggtcc cgggagagagactga