GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:40:56 Sequence gi568815593r:42939468_43140046 : 200579 bp : 44.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11583 11898 316 1 1 39 -28 252 0.490 3.82 1.02 Term + 11948 12416 469 2 1 15 43 361 0.925 18.75 1.03 PlyA + 14235 14240 6 1.05 2.00 Prom + 31380 31419 40 -0.86 2.01 Init + 31474 32101 628 0 1 71 5 341 0.443 19.62 2.02 Term + 33134 33243 110 0 2 88 46 78 0.277 2.27 2.03 PlyA + 33595 33600 6 1.05 3.06 PlyA - 33790 33785 6 1.05 3.05 Term - 34297 34105 193 0 1 65 39 113 0.057 0.99 3.04 Intr - 46195 46139 57 0 0 69 113 36 0.074 2.20 3.03 Intr - 52766 52461 306 1 0 11 -21 262 0.103 3.16 3.02 Intr - 53123 52859 265 0 1 8 53 298 0.591 14.87 3.01 Init - 61127 60920 208 2 1 42 91 83 0.232 1.01 3.00 Prom - 65273 65234 40 -5.86 4.03 PlyA - 65452 65447 6 1.05 4.02 Term - 69317 69206 112 1 1 35 47 176 0.990 6.23 4.01 Init - 70737 70562 176 0 2 93 98 54 0.839 5.74 4.00 Prom - 72046 72007 40 -3.36 5.03 PlyA - 72475 72470 6 1.05 5.02 Term - 79094 78776 319 1 1 6 46 456 0.841 27.55 5.01 Init - 80175 80069 107 0 2 37 101 79 0.998 3.79 5.00 Prom - 95981 95942 40 -3.36 6.00 Prom + 96730 96769 40 -4.66 6.01 Init + 100133 100434 302 1 2 80 81 179 0.903 11.34 6.02 Term + 102519 103278 760 0 1 33 49 483 0.553 31.80 6.03 PlyA + 104191 104196 6 1.05 7.00 Prom + 110207 110246 40 -0.76 7.01 Init + 110826 110871 46 2 1 59 48 37 0.243 -2.36 7.02 Intr + 111092 111156 65 0 2 102 94 -2 0.436 0.24 7.03 Term + 114767 115099 333 1 0 52 54 362 0.955 23.81 7.04 PlyA + 115934 115939 6 1.05 8.00 Prom + 116111 116150 40 -4.96 8.01 Sngl + 116858 117232 375 1 0 32 49 173 0.672 3.94 8.02 PlyA + 117410 117415 6 1.05 9.00 Prom + 120635 120674 40 -2.46 9.01 Init + 120759 120891 133 2 1 78 47 103 0.274 5.50 9.02 Intr + 124229 124415 187 0 1 101 34 71 0.267 1.75 9.03 Intr + 127001 127457 457 2 1 14 59 430 0.201 25.92 9.04 Term + 127846 128118 273 0 0 70 48 69 0.160 -3.43 9.05 PlyA + 128902 128907 6 1.05 10.03 PlyA - 130832 130827 6 1.05 10.02 Term - 134075 133595 481 1 1 24 45 226 0.374 6.17 10.01 Init - 151256 151195 62 2 2 95 29 77 0.126 1.23 10.00 Prom - 153369 153330 40 -2.66 11.00 Prom + 155760 155799 40 -4.46 11.01 Init + 156396 156478 83 2 2 80 86 44 0.071 1.97 11.02 Intr + 182016 182061 46 0 1 8 105 89 0.020 1.01 11.03 Intr + 182572 182710 139 0 1 74 76 201 0.437 17.54 11.04 Intr + 183742 183843 102 2 0 91 94 20 0.358 3.05 11.05 Intr + 199698 199842 145 1 1 104 80 86 0.580 8.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 94833 94967 135 2 0 82 48 96 0.853 3.02 S.002 Sngl + 120042 120452 411 2 0 84 47 157 0.818 7.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:42939468_43140046|GENSCAN_predicted_peptide_1|261_aa XTPAAQPPDWVLEAEPQSVRRRWEEKAPRQLPCHRPRGAAGSPSPAPPPPGNSRRNWAQL GQLRHRIPHICNLGAAATEKMEAQTNERCRDRETSAGEMVVPGACDQWRRHQETIDSEKK ERFGHRETPYDKCCGRKTAYSAPQAKQPMETPGAATVTPALEGLGWRKRRTETRKLCVAQ LAENRLVGHPGESADQKQATSMQGTAQPLKKVTHVAVVPAKHCSLDGSCGDATEKHALPL KYIQLRGASVHMIKKPPVGLC >gi568815593r:42939468_43140046|GENSCAN_predicted_CDS_1|786_bp naaaccccagcagcgcagccaccggactgggttctggaggccgagccgcagtccgtgcgg cggcgctgggaagagaaggcgccccggcagctcccctgccaccggccccgaggagcggct ggctcccccagcccagcgccgccgccgcccggtaactccaggcgcaactgggcgcaactg gggcagctgcgacaccgaatccctcacatctgcaacctgggtgctgcggccactgagaaa atggaggcgcagaccaacgagcggtgccgcgaccgagagacctcggctggcgaaatggtg gtgccgggagcctgcgaccaatggcgccgtcaccaagaaaccatcgactctgagaaaaaa gagaggttcggccaccgagaaactccgtacgacaagtgctgtggcagaaaaaccgcctac tccgcgccacaggcaaaacagccaatggaaaccccaggtgctgcgaccgtgacaccggca ctagagggtctcggatggagaaagcggcgcacggagaccaggaaactatgtgtagcacaa ctagcagaaaaccgtctggtcggccatccgggagaaagcgcggatcagaaacaagcgact tcgatgcagggaaccgcgcagccactgaagaaagtgacccacgtggcagtggtgccagcg aaacactgcagtttggacggcagctgtggggatgccacagagaaacatgcactgccactg aagtacatccagctccgcggagctagtgttcatatgatcaagaaaccgccagttgggctc tgctag >gi568815593r:42939468_43140046|GENSCAN_predicted_peptide_2|245_aa MEFHPSTLEELSWRQAGPEPHIPRDLNREQAWAWALDNFPVLSRHSCQCHQLYGSSAGSW RLHRASTGHGALPGDERGQKRPRDSHVEESKAETTDGLQSPQEDANATDGEVQSCLMQPG DASACGGGRTQAHGALCSSGRPRKLLKTQGSIRKEPGNERNPSLSASLRHYPLQGTLCWK EPEAHQWGPSFCPCGLLHWNCALHPQEPGDISRTVCSIKIWASEHVFSTRGKLLQQLQCR NIQAL >gi568815593r:42939468_43140046|GENSCAN_predicted_CDS_2|738_bp atggagttccatccatcaaccttggaagagctgtcttggaggcaagcaggtccagagccc catataccacgggacctgaaccgagagcaagcctgggcctgggcactggataacttccct gtgctgagccgccactcctgccaatgccatcagctctacgggagctctgctgggagctgg cggctgcaccgcgcctccaccgggcacggagcccttcccggagatgagaggggccaaaaa cggcctcgagactcccatgtcgaggaatcaaaggctgaaaccacagacgggctccagtca cctcaagaagacgcaaatgctaccgacggcgaagtccaaagctgcctgatgcagcctgga gatgccagcgcctgcggaggtggaaggacacaagcccatggcgccctctgcagttcaggc cgcccccggaagctgctgaaaacccagggttccatcaggaaggaacccggaaacgagagg aaccccagcctttctgcaagcctgcgacactacccgcttcaaggaacactgtgctggaag gagcctgaggcacaccagtggggaccctcattctgtccctgcggacttttacactggaac tgtgccctgcacccacaagagcctggagacatttctcggactgtctgcagcatcaagata tgggcttcagagcacgtttttagcacccgtgggaaactgttacaacagctgcaatgcaga aatatccaagccctgtga >gi568815593r:42939468_43140046|GENSCAN_predicted_peptide_3|342_aa MQPPALPRAPAAPPELPAPDPEQATLRPASRYSEAWERDLFAGSFSKEARRERKVCGSGN LGRPRRARSENGGIEQRAVLRPRDLSSAKCRCLELANDCGRRVVKDNIPYGATNSAVTKK LDSEKKEGFGHRETPCHMCRDSKTARSAPLVKELTNASAVATGKLSGQCCCGCEKPVIRR RSGAGRWTRWCENLAALGPGKRGHQANASDFARQMHVFFGRHSKMQPAVSPGSEKQVEVS AINPPFREKRVAATDRLAVGSCMPVKLLMSAVLQDPMRRNIDRTGGHYPYQTNAGTEKQI PYVLTYKLELNNENTWTQRGEYQTLEPLKVDGERRERIGKNN >gi568815593r:42939468_43140046|GENSCAN_predicted_CDS_3|1029_bp atgcagcctcctgctctcccgagggctcctgcggctcctcctgaattgcctgcgccggat ccagagcaggcgactctgcgccccgcctccaggtactctgaggcttgggaaagagatctt ttcgccggcagtttcagcaaggaagcacgcagggagcggaaagtatgtggcagtggaaat ctaggtcggcccaggagagctcgcagtgaaaatggaggcatcgagcaacgagctgtgctg agaccgagagacctcagttctgcgaaatgtcggtgcctggagcttgcgaatgactgcggc cggcgggttgtcaaggacaacattccttatggcgcaaccaacagcgctgtcaccaagaaa ctggactctgagaaaaaagagggtttcggccaccgagaaactccgtgccacatgtgccgt gacagcaaaaccgcccgctccgcgccgctggtgaaagagctaacgaacgccagcgctgtc gcaaccggaaaactgtcaggccagtgctgctgcggatgtgagaaaccggtgatccgaagg cggagtggcgctgggcgctggactcgctggtgtgaaaatctggctgctctaggacccggc aagcgcgggcaccaggcgaacgccagcgactttgccagacaaatgcatgtcttttttggt cgccattcaaagatgcagccggcagtcagccccggctctgagaaacaagtggaggtgtca gccataaaccctccgttcagggaaaagagagtcgctgcgacggatcgactggccgttggg tcctgcatgccagtaaaactactcatgtcagctgttctgcaggatcccatgagaaggaac atagacagaactggaggccattatccttaccaaactaatgcaggaacagaaaagcaaata ccgtatgtcctcacttataagctggagctgaataatgagaacacatggacacaaagaggg gaataccagacactggagccattgaaggtggatggtgaaaggagggagaggattgggaaa aataactaa >gi568815593r:42939468_43140046|GENSCAN_predicted_peptide_4|95_aa MAQLRSEADVASEGLADGCRDHRGLDFEMSRTKCCPFSAALPVSAGTVGAAVQPNFDFRV LFRGSVPMSSQQFRISTFANSGDSVFSRLLHRWKR >gi568815593r:42939468_43140046|GENSCAN_predicted_CDS_4|288_bp atggctcagttgagaagtgaggcagatgtggcctcagagggtctggcggatggctgccgg gatcaccgggggctggactttgagatgtccaggacgaagtgttgccctttcagtgctgcc ctccctgtgtctgctggaactgttggagccgcagtgcagcctaattttgacttcagggtt ctcttccgcggctctgtgcctatgtcgagccagcagtttcgcatcagcactttcgccaat agcggagactcagtgttttctcggctgctgcaccgctggaagcgctga >gi568815593r:42939468_43140046|GENSCAN_predicted_peptide_5|141_aa MQNHQLRIRRRATRHNLAPDSRETETELGSPDPERIPQIGDLGAVGTEKIQAPSQKQCRN QETSAWRNGGVWSLRVTPAGGAVKENIRYGATNGTVTKKPSSLRKKETFGRRETPYKCCD RKTAGSAQRQKSQRERRVLRP >gi568815593r:42939468_43140046|GENSCAN_predicted_CDS_5|426_bp atgcaaaaccatcagctccgcatacgaaggagagccaccagacacaacctggcacctgac agcagggagactgagaccgagctcggctccccggatcctgagagaatccctcagatcggc gacctgggcgctgtaggcaccgagaaaatacaggcaccgagccagaagcagtgccgcaac caagagacctcggcttggcgaaatggcggcgtctggagcctgcgagtgacgccagctggt ggggctgtcaaggagaacattcgttacggcgcaaccaacggtactgtcaccaagaaaccg tcgtctctgagaaaaaaagagacgttcggccgccgagaaactccgtacaagtgctgtgac agaaaaaccgccggctccgcgcagcggcaaaagagccaacgggaacgccgggtgctgcga ccatga >gi568815593r:42939468_43140046|GENSCAN_predicted_peptide_6|353_aa MAAFQKAPAVTDLGVLLSVPGMERTAPGKLLEPESVGFLCLPHLFYGCLLLLLCPRPTEL SGLRCTWFQGALDSRRESVLPVDSGQPPARAGKQSQITAVQGWKSQAPRNRNLLGLRSQR RAKPWGPATETPAAPPQDCVLEAEPEPVRRRWEETVPLQLPCHRLQGASGSPRPALQHPS GNARLGATGAAATLNLSDPRPEGCRHREMEAQSCEQETTARRNGGARSLKGNAAGGVVKD NICFGATSGAVTKKPSTLRKKEKFGYRETPCSKCCDSKTAGFAPLAKEPTETPGAATAKP ALEGLGWRKPRTETRKLCTARLAVDTNRKVSRLLSGRKRGSKPATQWREPRSH >gi568815593r:42939468_43140046|GENSCAN_predicted_CDS_6|1062_bp atggcggcgttccaaaaggcacccgctgtcacagacctcggtgtcctgctgtctgtccca gggatggagaggactgctccagggaagctgctggagcctgagtctgtcgggttcctctgc ctgccccacctcttctacgggtgcctcttgctgcttctgtgtccccggccaactgaactc agtgggcttcgctgtacttggttccaaggtgctctggactccaggagagagtccgttttg ccagtagactccgggcagccgccagcaagggctggaaagcagtcccaaatcacagctgtc caagggtggaaatcccaggcaccgcgaaatcgaaacttgctaggactaagaagccaacga cgcgcgaaaccgtggggtcctgcgacagagacgccggcggcgccgccacaggactgcgtt ctggaggccgagccggaacccgtgcggcggcgctgggaagagactgtgcccctgcagctc ccctgtcaccggctccaaggagcgtcgggctccccccgcccagccctgcagcacccatcc ggcaacgccagactcggcgcaacgggggcagctgcgactttaaatctctcagatccgcgg cctgagggctgccgccaccgagaaatggaggcacagagctgtgaacaagagaccacggct cgccgaaatggcggtgccaggagcctgaaagggaatgcagccggcggggttgtcaaggac aacatttgttttggcgcaaccagcggtgccgtcaccaagaaaccgtcgactctgagaaaa aaagagaagttcggctaccgagaaactccgtgcagcaagtgctgtgacagcaaaaccgcc ggcttcgcgccgctggcgaaagagccaacggaaacgccgggtgctgcgaccgcgaagccg gcactagagggcctcggatggagaaagccccgcaccgagacgaggaaactgtgcacagca cgactagcagttgacacaaacagaaaagtgtctcgtctgctctctgggagaaagcgtgga tcaaaaccagcaactcagtggagagaaccccgcagccactga >gi568815593r:42939468_43140046|GENSCAN_predicted_peptide_7|147_aa MGKNENCFYCFKRKEGTETTHSDLRNWSGLKGFLALQNHSSSPSTEQSWTENNFDEVTEV GFRRSVITNFSELKEDVQTHRKEAKNLEKRLDEWLTGINSIEKTLNDLMELKTMAQVLRD ACTSFSSRFDQVEESVSVIEDQKNEMK >gi568815593r:42939468_43140046|GENSCAN_predicted_CDS_7|444_bp atggggaagaatgagaactgcttttactgcttcaaaagaaaagagggaacagaaacaaca cactcagacttgagaaactggtctggtttaaaagggttcctggccttacagaatcacagc tcctcgccatcaacggaacaaagctggacagagaataactttgacgaggtgacagaagta ggcttcagaagatcggtaataacaaacttctccgagctaaaggaggatgttcaaactcat cgcaaagaagctaaaaaccttgaaaaaagattagacgaatggctaactggaataaacagc atagagaagaccttaaatgacctgatggagctgaaaaccatggcacaagtactacgtgat gcatgcacaagcttcagtagccgatttgatcaagtggaagaaagcgtatcagtgattgaa gatcaaaagaatgaaatgaagtga >gi568815593r:42939468_43140046|GENSCAN_predicted_peptide_8|124_aa MVSRSWFLEKINKIDRPLARLIKKKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLY VNKLENLKEMDKFLDTSTLPRLNQEEVESLNRPITGSEIEAIINSLPTKKSPGPDGFTAK FCQR >gi568815593r:42939468_43140046|GENSCAN_predicted_CDS_8|375_bp atggtatccaggagctggtttttggaaaagatcaacaaaattgatagaccactagcaaga ctaataaagaagaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggat atcaccaccgatcccacggaaatacaaactaccatcagagaatactataaacacctctat gtaaataaactagaaaacctaaaagaaatggataaattcctggacacatccaccctccca agactaaaccaggaagaagttgaatccctgaatagaccaataacaggctctgaaattgag gcaataattaatagcctaccaaccaaaaaaagtccaggaccagatggattcacagccaaa ttctgtcagcggtag >gi568815593r:42939468_43140046|GENSCAN_predicted_peptide_9|349_aa MEYYAAIKKDEFMSFAGTWMKLETIILSKLPQGQKTKHYTLSLIGVCSFWPAAEPREAAW DGVGKHPASPRPMKSRYGTTSSARTQKLAKLFRRSCGLDPPAVLPRSSSASGSEERRAPP PSAGVAREPQTLRNWLRFQIPQIGYQSAAATEKMEAPRNKMCRDQENSAWRNGGAGTLRV TPAGGVPKDNIHFGGTNGVVTQKMSTLGKERFCDREIRCNKCCDRKPPARRCWRKSQQNA GCCHRDTDFRTPASDGERRRSHTEGPINRSLAMRMGKGGSIGTENRADWAGPGNKSHPGR QPSSTARNKPSSGWAPPFSLVRALQAPKGEADTSSLWNLKFSVQDLRGE >gi568815593r:42939468_43140046|GENSCAN_predicted_CDS_9|1050_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgcagggacatggatg aaactggaaaccatcattctgagcaaactaccacaaggacagaaaaccaaacactacacg ctctcactcataggagtctgttccttttggccggctgctgaacctagggaggcagcgtgg gatggagttggcaaacatcctgcaagtccacgacccatgaagagccgctatggtaccact agcagtgctaggacccagaaactagcgaaactgtttaggagaagctgcgggctggaccca ccagcggtgctgccacggagctcctctgcctctggctcggaggagcggcgggctcccccg cccagcgccggcgtcgcccgggaaccccagactctgcgcaactggctgcgattccaaatc cctcagatcggctaccagagcgctgccgccaccgagaaaatggaggcaccgaggaacaag atgtgccgcgaccaagagaactcggcttggcgaaatggcggtgctgggaccctaagagtg acgccagccggaggagttcccaaggacaacatccattttggcggaaccaacggtgttgtc acgcagaaaatgtcgaccctcggaaaagagaggttctgcgaccgagaaattcggtgcaac aagtgctgtgacagaaaaccccccgctcggcgctgctggcgaaagagccaacaaaacgct gggtgctgccaccgtgacaccgactttaggaccccggcctcggatggagaaaggaggcga agtcatactgagggccccatcaaccgatcccttgcgatgcgcatgggaaaaggaggcagt atagggaccgagaatcgggcggattgggctggccctggtaacaaaagccatccgggtcgc caaccgtcttccactgcaagaaacaaaccttccagtggctgggctccgcccttctccttg gtcagagccctccaggctcctaagggcgaagccgacacctcatccctttggaacttgaag ttctcagtccaagacctccgtggagagtga >gi568815593r:42939468_43140046|GENSCAN_predicted_peptide_10|180_aa MTCASLSLVQLCLLAMLREIRPNQPRRNHYFKPLVHVPIHPDIRRKLQKLDDGSQNPQRD LLNLAFKVFNNRDEENKRQKHAEFQMLDSAIRGPAGPQGRSSTQKPPSNPPPPGACFKYG NEGHWSRQCPNPGKPIRPCPLCRGPHWKLDCERPPQGPLPSLPELAKTSYSDLTGLATED >gi568815593r:42939468_43140046|GENSCAN_predicted_CDS_10|543_bp atgacctgtgcctcactttccctagtccagctgtgcctcctggccatgctgagagaaatc agacccaaccagcccagaaggaaccactattttaaacctttagttcatgtcccaatccac cccgatattcggcgtaagctacagaagcttgatgatggctctcaaaacccacaacgagac cttcttaatttagccttcaaagtctttaacaatcgtgatgaggaaaataaaaggcaaaaa catgcagagtttcaaatgcttgactccgccatccggggccctgcaggcccacagggccgc agctccacacagaagcctcctagcaatccacctccacctggcgcatgtttcaagtatggc aatgaaggccactggtctagacaatgcccaaacccaggtaagcccatcaggccatgcccc ctctgcagaggaccccactggaagttggactgtgagcggcccccacaaggaccactccca tcccttcctgagctggccaaaacctcctactcggatctcactggccttgccactgaagac tga >gi568815593r:42939468_43140046|GENSCAN_predicted_peptide_11|172_aa MHLSWPRPGHQLLCSLSFLSSQTRNMHGRPRQPELDAVTVCVDSSPTAMEAEETMECLQE FPEHHKMILDRLNEQREQDRFTDITLIVDGHHFKAHKAVLAACSKFFYKFFQEFTQEPLV EIEGVSKMAFRHLIEFTYTAKLMIQGEEEANDVWKAAEFLQMLEAIKALEVS >gi568815593r:42939468_43140046|GENSCAN_predicted_CDS_11|516_bp atgcacctgtcctggcccaggcccggccaccagctcctctgctccctgtccttcctcagc agccaaaccagaaacatgcatgggcgtccgcgtcagccggagctcgacgcggtgacggtg tgcgtcgacagcagcccgacggccatggaggctgaagagacgatggaatgccttcaggag ttccctgaacatcataaaatgatcctcgaccgattgaatgaacagcgagagcaggaccgg tttactgacatcaccctaattgtcgacggacaccattttaaggctcacaaggctgttttg gctgcttgtagtaagttcttctacaaattctttcaggagtttacccaagaaccattggtg gagatagaaggtgttagtaaaatggcctttcgccatttaattgagttcacatatacagca aaattaatgatacaaggagaagaagaagccaatgatgtatggaaagcagcagagtttcta caaatgctagaagctatcaaagcccttgaagtcagn