GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:09:03 Sequence gi568815583f:48231516_48442075 : 210560 bp : 40.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1212 1323 112 2 1 101 83 60 0.962 5.83 1.02 Intr + 3362 3489 128 0 2 76 94 59 0.583 4.78 1.03 Intr + 10000 10084 85 0 1 62 97 56 0.316 2.37 1.04 Intr + 10704 10847 144 1 0 95 50 48 0.606 1.03 1.05 Term + 13238 13395 158 0 2 57 48 198 0.563 9.81 1.06 PlyA + 14961 14966 6 1.05 2.00 Prom + 19530 19569 40 -4.05 2.01 Init + 20171 20255 85 1 1 63 110 95 0.906 10.37 2.02 Intr + 24296 24395 100 0 1 102 103 89 0.998 10.05 2.03 Intr + 27685 27796 112 1 1 116 83 80 0.768 9.76 2.04 Intr + 36046 36186 141 0 0 76 110 48 0.445 5.43 2.05 Intr + 38143 38249 107 0 2 67 98 103 0.952 7.29 2.06 Intr + 43056 43138 83 0 2 121 95 77 0.887 10.06 2.07 Intr + 53591 53734 144 0 0 50 116 217 0.988 20.03 2.08 Intr + 56528 56659 132 0 0 70 83 137 0.565 11.10 2.09 Intr + 56890 57001 112 2 1 79 75 57 0.724 2.02 2.10 Intr + 60263 60349 87 2 0 80 64 57 0.344 0.67 2.11 Intr + 67625 67760 136 2 1 39 116 167 0.730 14.25 2.12 Intr + 69800 69867 68 1 2 63 95 82 0.731 3.18 2.13 Intr + 82026 82165 140 0 2 57 109 107 0.897 8.89 2.14 Term + 83521 83801 281 2 2 7 43 247 0.384 6.72 2.15 PlyA + 84652 84657 6 1.05 3.00 Prom + 95577 95616 40 -1.65 3.01 Init + 100806 100891 86 2 2 90 99 138 0.980 13.45 3.02 Intr + 102902 102993 92 2 2 59 106 97 0.997 7.32 3.03 Intr + 104531 104575 45 0 0 133 75 37 0.950 4.36 3.04 Intr + 109774 109848 75 2 0 90 95 83 0.990 7.77 3.05 Intr + 110000 110070 71 0 2 37 108 77 0.966 2.58 3.06 Term + 110507 110563 57 1 0 103 38 77 0.973 0.91 3.07 PlyA + 111022 111027 6 1.05 4.08 PlyA - 111147 111142 6 1.05 4.07 Term - 115872 115749 124 2 1 28 47 130 0.359 -0.22 4.06 Intr - 116357 116277 81 1 0 77 86 42 0.155 0.73 4.05 Intr - 135326 135140 187 0 1 37 52 166 0.181 5.73 4.04 Intr - 138182 137985 198 0 0 60 38 122 0.317 2.80 4.03 Intr - 139840 139756 85 1 1 35 64 49 0.216 -4.33 4.02 Intr - 141390 141217 174 0 0 33 47 152 0.576 4.81 4.01 Init - 142764 142714 51 0 0 34 59 124 0.714 5.31 4.00 Prom - 142924 142885 40 -8.25 5.00 Prom + 144380 144419 40 -7.15 5.01 Sngl + 145354 146016 663 0 0 100 32 341 0.877 25.62 5.02 PlyA + 147173 147178 6 1.05 6.03 PlyA - 149088 149083 6 1.05 6.02 Term - 161414 161294 121 1 1 44 38 188 0.503 6.37 6.01 Init - 167532 167477 56 0 2 61 116 50 0.404 5.91 6.00 Prom - 174456 174417 40 -4.75 7.04 PlyA - 176207 176202 6 1.05 7.03 Term - 179864 179475 390 2 0 102 31 319 0.710 21.60 7.02 Intr - 181228 181054 175 0 1 85 90 151 0.996 14.02 7.01 Init - 184277 184021 257 2 2 67 98 307 0.534 26.35 7.00 Prom - 186612 186573 40 -3.35 8.15 PlyA - 186861 186856 6 1.05 8.14 Term - 189291 189164 128 1 2 76 32 171 0.739 7.76 8.13 Intr - 190171 190043 129 2 0 93 98 35 0.705 4.75 8.12 Intr - 190553 190437 117 0 0 102 97 52 0.726 7.02 8.11 Intr - 193976 193854 123 0 0 67 98 82 0.518 6.74 8.10 Intr - 194349 194224 126 1 0 72 97 91 0.992 8.13 8.09 Intr - 196258 196052 207 2 0 92 80 151 0.993 12.83 8.08 Intr - 196956 196831 126 1 0 20 97 160 0.990 9.83 8.07 Intr - 199287 199156 132 1 0 69 82 144 0.988 11.60 8.06 Intr - 201473 201351 123 0 0 107 113 139 0.985 17.94 8.05 Intr - 203198 203079 120 0 0 82 83 163 0.997 14.75 8.04 Intr - 205562 205446 117 0 0 75 95 150 0.998 13.92 8.03 Intr - 205872 205807 66 1 0 43 57 153 0.966 5.66 8.02 Intr - 206402 206253 150 0 0 33 80 101 0.869 3.01 8.01 Intr - 210331 210206 126 2 0 70 115 76 0.981 8.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:48231516_48442075|GENSCAN_predicted_peptide_1|208_aa AQVILLVILLIAIANFFIGTVIPSNNEKKSRGFFNYQASIFAENFGPRFTKGEGFFSVFA IFFPAATGILAGANISGDLEDPQDAIPRGTMLAIFITTVAYLGVAICVATPPPGWGLPHT FSIVGKEEVSQHLEKKTIFYVQQGKQLQAGPPWKPKGACVVRDATGNMNDTIISGMNCNG SAACGLGYDFSRCRHEPCQYGLMNNFQV >gi568815583f:48231516_48442075|GENSCAN_predicted_CDS_1|627_bp gcccaagtcattcttctggtcattcttctaattgctattgcaaacttcttcattggaact gtcattccatccaacaatgagaaaaagtccagaggtttctttaattaccaagcatcaata tttgcagaaaactttgggccacgcttcacaaagggtgaaggcttcttctctgtctttgcc atttttttcccagcagctactgggattcttgctggtgccaatatctcaggagatttggag gatccccaagatgccatccccagaggaaccatgctggccattttcatcaccactgttgcc tacttaggggttgcaatttgtgtagccacacctcctcctggttgggggttgcctcatact ttttccatagtaggaaaggaagaggtttcccaacatttagagaagaaaaccatcttctat gtacaacagggaaaacagctccaggctggaccaccatggaagccaaagggggcctgtgtg gtccgagatgccaccgggaacatgaatgacaccatcatttctgggatgaactgcaatggt tcagcagcatgtgggttgggctatgacttctcaagatgtcgacatgaaccatgtcagtac gggctgatgaacaatttccaggtttga >gi568815583f:48231516_48442075|GENSCAN_predicted_peptide_2|575_aa MFVINWWAAVITYVIEFFLYVYVTCKKPDVNWGSSTQALSYVSALDNALELTTVEDHVKN FRPQCIVLTGGPMTRPALLDITHAFTKNSGLCICCEVFVGPRKLCVKEMNSGMAKKQAWL IKNKIKAFYAAVAADCFRDGVRSLLQASGLGRMKPNTLVIGYKKNWRKAPLTEIENYVGI IHDAFDFEIGVVIVRISQGFDISQVLQVQEELERLEQERLALEATIKDNECEEESGGIRG LFKKAGKLNITKTTPKKDGSINTSQSMHVGEFNQKLVEASTQFKKKQEKGTIDVWWLFDD GGLTLLIPYILTLRKKWKDCKLRIYVGGKINRIEEEKIVMASLLSKFRIKFADIHIIGDI NIRPNKESWKVFEEMIEPYRLHESCKDLTTAEKLKRETPWKITDAELEAVKEKSYRQVRL NELLQEHSRAANLIVLPWKSHCPYKSLQGPTPAPHPPISATSTPTTVYAFLCHCSDHTGL LTVKGLDRQENKLHIIITNCEKYCEGNKEKVLRQDNSVDLDALCGRELERLLSLSPLSAT PPKREPASKRVQQPETTLELAHYSSLAGARSVQAL >gi568815583f:48231516_48442075|GENSCAN_predicted_CDS_2|1728_bp atgtttgtcatcaactggtgggcagctgtcatcacctatgtcattgaattcttcctttac gtctatgtgacttgtaagaagccagatgtgaactggggctcctccacacaggctctttcc tacgtgagtgctttagacaatgctctggaattaaccacagtggaagaccacgtaaaaaac ttcaggccccagtgcattgtcttaacagggggacccatgacaagacctgctctcctggac ataactcacgcctttaccaagaacagtggcctttgcatctgctgtgaagtctttgtggga ccgcgcaaactgtgtgttaaggagatgaacagtggcatggcgaaaaaacaggcctggctt ataaagaacaaaatcaaggctttttatgctgcagtggcggcagactgtttcagggatggt gtccgaagtcttcttcaggcctcaggcttaggaagaatgaaaccaaacactctggtgatt ggatataagaaaaactggaggaaagctcccttgacagagattgagaactacgtgggaatc atacatgatgcatttgattttgagattggcgtggttatagtcagaatcagccaaggattt gacatctctcaggttcttcaggtgcaagaggaattagagagattagaacaggagagacta gcattggaagcgactatcaaagataatgagtgtgaagaggaaagtggaggcatccgaggc ttgtttaaaaaagctggcaagttgaacattactaagacaacgcctaaaaaagatggcagc attaacacaagccagtcgatgcatgtgggagagttcaaccagaaactggtggaagccagc actcaatttaaaaagaaacaagaaaaaggcacaattgatgtttggtggttgtttgatgat ggagggttaacacttcttatcccctatatcttaactctcagaaaaaaatggaaagactgt aaattaagaatctatgtgggagggaagatcaaccgcattgaagaagaaaaaattgtaatg gcttcccttctgagcaaatttaggataaaatttgcagacatccatatcatcggtgacatc aacattaggccaaacaaagagagctggaaagtctttgaagagatgattgaaccatatcgt ctccatgaaagctgcaaagatttaacaactgctgagaaattaaaaagagaaactccgtgg aaaattacagatgcagaactggaagcagtcaaggaaaagagttaccgccaagttcgactg aatgaactcttacaggagcactccagagctgctaatctcattgtccttccatggaaaagc cactgtccttacaagagcttacaaggccctacccctgcaccccatccacctatttctgcg acttcaacacctacaactgtgtacgccttcctctgtcactgttcagaccacactggcctt cttacagttaaaggtttagacagacaagaaaacaagttacacataataattacaaattgt gaaaaatactgtgaaggaaataaagagaaagtgttgagacaggataactctgttgacctt gatgccctttgtgggcgggaactggagaggctcctttcactcagcccactgtcggccact cctcccaaaagagagcctgcgagcaagcgagtgcagcaaccggagacaacactggaactg gcccattactcctctctggcaggagcacgctctgtgcaggccctatag >gi568815583f:48231516_48442075|GENSCAN_predicted_peptide_3|141_aa MQLRFARLSEHATAPTRGSARAAGYDLYSAYDYTIPPMEKAVVKTDIQIALPSGCYGRVA PRSGLAAKHFIDVGAGVIDEDYRGNVGVVLFNFGKEKFEVKKGDRIAQLICERIFYPEIE EVQALDDTERGSGGFGSTGKN >gi568815583f:48231516_48442075|GENSCAN_predicted_CDS_3|426_bp atgcagctccgctttgcccggctctccgagcacgccacggcccccacccggggctccgcg cgcgccgcgggctacgacctgtacagtgcctatgattacacaataccacctatggagaaa gctgttgtgaaaacggacattcagatagcgctcccttctgggtgttatggaagagtggct ccacggtcaggcttggctgcaaaacactttattgatgtaggagctggtgtcatagatgaa gattatagaggaaatgttggtgttgtactgtttaattttggcaaagaaaagtttgaagtc aaaaaaggtgatcgaattgcacagctcatttgcgaacggattttttatccagaaatagaa gaagttcaagccttggatgacaccgaaaggggttcaggaggttttggttccactggaaag aattaa >gi568815583f:48231516_48442075|GENSCAN_predicted_peptide_4|299_aa MNRKVTLGVDRGLDAERVLGQTPYAGCQQIHHFPGAAKSLSLAIIFCSMTLHQGRCPSPL NLLLAYTEICDSTSQVRLWDRVMISNKNRCRLKFDIKMTTFKPAVFPYCQNESSDVLWQF DSTSQIFLKVTPKTTLRRGGEHNLVGVEGCGRGEKESEEMKRERLLRQLAGSFTKQILAH DMITKETAVGWILSVGSSEASVEQTRSDKKGASFKEPCSLAIANENNKAMVWMLLALEPG LALLAPQLADNLLWNLVIVTCKCDLIGNMGLADVIKMRSYYIEEEGNMDTYTERHREDA >gi568815583f:48231516_48442075|GENSCAN_predicted_CDS_4|900_bp atgaacagaaaagtgacactgggtgtggaccgaggactggatgccgaaagggtgcttggt caaacaccctatgctggttgtcaacaaattcatcattttcccggtgctgccaaaagtctg agtttggcgatcatcttctgctctatgacacttcaccaggggagatgtccatctccctta aatctcctgctagcatacactgagatttgtgactctacatcacaggtcaggctctgggac cgggtaatgattagcaacaagaacagatgcaggctgaaatttgatataaaaatgactaca tttaagccagctgtattcccatattgtcagaatgagagctcagatgttttatggcaattt gactccactagtcagatcttcctaaaggttactcctaaaaccacgttaaggagaggtgga gagcacaacctggtaggagtggagggttgtgggagaggagaaaaagagagtgaagagatg aaaagagagagattgctgagacaactagctggatcttttacaaaacaaatcctggcccac gacatgattaccaaggagacagctgtggggtggatcttgagtgtaggctcatctgaggcc agtgtcgagcagacaaggtcagacaaaaagggtgcatcttttaaagagccttgctcatta gccattgctaatgagaacaacaaagccatggtgtggatgcttcttgccctagaacccgga ttggctctccttgctcctcagcttgcagacaacctattgtggaaccttgtgatcgttact tgtaaatgtgatcttattggaaatatgggcttggcagatgtaatcaagatgaggtcatac tacattgaggaagagggaaatatggacacatatacagagagacacagagaggatgcctag >gi568815583f:48231516_48442075|GENSCAN_predicted_peptide_5|220_aa MAERGQCRAQAMASEGGSSKPWQLPCGVEPVSARKSRIEVLEPLPRFQKMYGNTWMPRQK FAIGVVSSWKTSARAMQKGNVGWEPPHRVPTGAPPSRTVRGGPPSSRPQNDRSTDSLHCA PGKTSDTQHQPVKVARREAVPCKATGVELPKTLGTHLLHQCDLDVRHGVKGDHFEALGFD CPAGFGTCMGPLAPLFWPISPIWNGCIYPMPVPPLYPGSN >gi568815583f:48231516_48442075|GENSCAN_predicted_CDS_5|663_bp atggctgaaaggggccaatgtagagctcaggccatggcttcggagggtggaagctccaag ccctggcagcttccatgtggtgttgagcctgtgagtgcacggaagtcaagaattgaggtt ttggaacctctgcctagatttcagaagatgtatggaaacacttggatgcccaggcagaag tttgctataggggtggtgtcctcatggaaaacctctgctagggcaatgcagaagggaaat gtggggtgggagcccccacacagagtccctactggggcaccacctagtagaactgtgaga ggagggccaccatcctccagaccccagaatgatagatccactgacagcttgcactgtgca cctggaaaaacttcagacactcaacaccagccagtgaaggtagcaaggagggaggctgta ccctgcaaagccacaggggtggagctgcccaagactctaggaacccacctcttgcatcag tgcgacctggatgtgagacacggagtcaaaggagatcattttgaagctttaggatttgac tgtcctgctggatttgggacttgcatggggcctctagcccctttgttttggccaatttct cccatttggaatggctgtatttacccaatgcctgtacccccattgtatccaggaagtaac taa >gi568815583f:48231516_48442075|GENSCAN_predicted_peptide_6|58_aa MKKTFLAEKNQSKHAELGRSTGDPNIPGEGEEQLQEYQDRLCGKQKLTEERKEMRRPD >gi568815583f:48231516_48442075|GENSCAN_predicted_CDS_6|177_bp atgaagaagacatttctggctgagaagaaccagagtaaacatgcagaattgggaagatcc acaggagacccgaacatccccggagagggcgaagagcagctacaggagtaccaggaccgt ttgtgtgggaagcagaagttgacggaagagagaaaggagatgagaaggcctgattga >gi568815583f:48231516_48442075|GENSCAN_predicted_peptide_7|273_aa MPLLILPADENECLSAHICGGASCHNTLGSYKCMCPAGFQYEQFSGGCQDINECGSAQAP CSYGCSNTEGGYLCGCPPGYFRIGQGHCVSGMGMGRGNPEPPVSGEMDDNSLSPEACYEC KINGYPKRGRKRRSTNETDASNIEDQSETEANVSLASWDVEKTAIFAFNISHVSNKVRIL ELLPALTTLTNHNRYLIESGNEDGFFKINQKEGISYLHFTKKKPVAGTYSLQISSTPLYK KKELNQLEDKYDKDYLSGELGDNLKMKIQVLLH >gi568815583f:48231516_48442075|GENSCAN_predicted_CDS_7|822_bp atgccgcttcttattttgcctgcagatgaaaacgaatgcctcagcgctcacatctgcgga ggagcctcctgtcacaacaccctggggagctacaagtgcatgtgtcccgccggcttccag tatgaacagttcagtggaggatgccaagacatcaatgaatgtggctctgcgcaggccccc tgcagctatggctgttccaataccgagggcggttacctgtgtggctgtccacctggttac ttccgcataggccaagggcactgtgtttctggaatgggcatgggccgaggaaacccagag ccacctgtcagtggtgaaatggatgacaattcactctccccagaggcttgttacgagtgt aagatcaatggctaccccaaacggggcaggaaacggagaagcacaaacgaaactgatgcc tccaatatcgaggatcagtctgagacagaagccaatgtgagtcttgcaagttgggatgtt gagaagacagccatctttgctttcaatatttcccacgtcagtaacaaggttcgaatccta gaactccttccagctcttacaactctgacgaatcacaacagatacttgatcgaatctgga aatgaagatggcttctttaaaatcaaccaaaaggaagggatcagctacctccacttcaca aagaagaagccagtggctggaacctattcattacaaatcagtagtactccactttataaa aagaaagaacttaaccaactagaagacaaatatgacaaagactacctcagtggtgaactg ggtgataatctgaagatgaaaatccaggttttgcttcattaa >gi568815583f:48231516_48442075|GENSCAN_predicted_peptide_8|596_aa XIDECVEEPEICALGTCSNTEGSFKCLCPEGFSLSSSGRRCQDLRMSYCYAKFEGGKCSS PKSRNHSKQECCCALKGEGWGDPCELCPTEPDEAFRQICPYGSGIIVGPDDSAVDMDECK EPDVCKHGQCINTDGSYRCECPFGYILAGNECVDTDECSVGNPCGNGTCKNVIGGFECTC EEGFEPGPMMTCEDINECAQNPLLCAFRCVNTYGSYECKCPVGYVLREDRRMCKDEDECE EGKHDCTEKQMECKNLIGTYMCICGPGYQRRPDGEGCVDENECQTKPGICENGRCLNTRG SYTCECNDGFTASPNQDECLDNREGYCFTEVLQNMCQIGSSNRNPVTKSECCCDGGRGWG PHCEICPFQGTVAFKKLCPHGRGFMTNGADIDECKVIHDVCRNGECVNDRGSYHCICKTG YTPDITGTSCVDLNECNQAPKPCNFICKNTEGSYQCSCPKGYILQEDGRSCKDLDECATK QHNCQFLCVNTIGGFTCKCPPGFTQHHTSCIDNNECTSDINLCGSKGICQNTPGSFTCEC QRGFSLDQTGSSCEDVDECEGNHRCQHGCQNIIGGYRCSCPQGYLQHYQWNQCVGK >gi568815583f:48231516_48442075|GENSCAN_predicted_CDS_8|1791_bp natattgatgagtgtgtcgaagagccagaaatttgtgccctgggcacatgcagtaacact gaaggcagcttcaaatgtctgtgtccagaagggttttccttgtcctccagtggaagaagg tgccaagatttgcgaatgagctactgttatgcgaagtttgaaggaggaaagtgttcatca cccaaatccagaaatcactccaagcaggaatgctgctgtgccttgaagggagaaggctgg ggagacccctgcgagctctgccccacggaacctgatgaggccttccgccagatatgtcct tatggaagtgggatcatcgtgggacctgatgattcagcagttgatatggacgaatgcaaa gaacccgatgtctgtaaacatggacagtgcatcaatacagatggttcctatcgctgcgag tgtccctttggttatattctagcagggaatgaatgtgtagatactgatgaatgttctgtt ggcaatccttgtggaaatggaacctgcaagaatgtgattggaggttttgaatgcacctgc gaggagggatttgagcccggtccaatgatgacatgtgaagatataaatgaatgtgcccag aatcctctgctctgtgccttccgatgtgtgaacacttatgggtcatatgaatgcaaatgt cccgtgggatatgtgctcagagaagaccgtaggatgtgcaaagatgaggatgagtgtgaa gagggaaaacatgactgtactgaaaaacaaatggaatgcaagaacctcattggcacatat atgtgcatctgtggacccgggtatcagcggagacctgatggagaaggctgtgtagatgag aatgaatgtcagacgaagccagggatctgtgagaatgggcgctgcctcaacacccgtggg agctacacctgtgagtgtaatgatgggtttaccgccagccccaaccaggacgagtgcctt gacaatcgggaagggtactgcttcacagaggtgctacaaaacatgtgtcagatcggctcc agcaacaggaaccccgtcaccaaatcggaatgctgctgtgacggagggagaggctggggt ccccactgtgagatctgccctttccaggggactgtggctttcaagaaactctgtccccat ggccgaggattcatgaccaatggagcagatatcgatgaatgcaaggttattcacgatgtt tgccgaaatggggaatgtgtcaatgacagaggatcatatcattgcatttgtaaaactggg tacactccagatataactgggacttcctgtgtagatctgaacgagtgcaaccaggctccc aaaccctgcaattttatctgcaaaaacacagaagggagttaccagtgttcatgcccgaaa ggctacattctgcaagaggatggaaggagctgcaaagatcttgatgagtgtgcaaccaag caacacaactgccagttcctatgtgttaacaccattggcggcttcacatgcaaatgtcct cccggatttacccaacaccatacgtcctgcattgataacaatgaatgcacctctgacatc aatctgtgcgggtctaagggcatttgccagaacactcctggaagcttcacctgtgaatgc cagcggggattctcacttgatcagaccggctccagctgtgaagacgtggacgagtgtgag ggtaaccaccgctgccagcatggctgccagaacatcattgggggctacaggtgcagctgc ccccagggctacctccagcactaccagtggaaccagtgtgttggcaagtaa