GENSCAN 1.0 Date run: 4-Nov-116 Time: 05:50:15 Sequence gi568815587f:30131903_30333797 : 201895 bp : 38.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16767 16833 67 2 1 39 119 62 0.457 5.69 1.02 Intr + 41139 41318 180 1 0 77 92 32 0.013 1.42 1.03 Term + 45371 45558 188 0 2 86 48 95 0.030 1.97 1.04 PlyA + 46183 46188 6 1.05 2.00 Prom + 46360 46399 40 -9.15 2.01 Init + 47941 48292 352 0 1 54 77 205 0.422 13.37 2.02 Intr + 49120 50107 988 2 1 28 72 417 0.176 23.09 2.03 Intr + 50424 51323 900 0 0 31 39 355 0.062 14.38 2.04 Intr + 57688 57808 121 1 1 68 115 92 0.049 9.48 2.05 Intr + 78629 78708 80 1 2 94 75 76 0.157 4.33 2.06 Term + 86519 86744 226 2 1 66 53 171 0.088 6.67 2.07 PlyA + 89809 89814 6 1.05 3.00 Prom + 99077 99116 40 -3.75 3.01 Init + 100001 100159 159 1 0 79 73 98 0.915 5.74 3.02 Term + 101668 101898 231 0 0 99 43 154 0.982 7.49 3.03 PlyA + 102829 102834 6 1.05 4.04 PlyA - 103099 103094 6 1.05 4.03 Term - 114016 113870 147 1 0 47 53 131 0.417 2.42 4.02 Intr - 124526 124412 115 1 1 114 76 63 0.324 7.23 4.01 Init - 131547 131399 149 0 2 83 23 80 0.234 0.61 4.00 Prom - 132371 132332 40 -5.15 5.02 PlyA - 132488 132483 6 1.05 5.01 Sngl - 134106 133441 666 0 0 66 43 327 0.994 21.92 5.00 Prom - 134311 134272 40 -8.05 6.03 PlyA - 134471 134466 6 1.05 6.02 Term - 136290 136063 228 0 0 65 54 260 0.748 15.95 6.01 Init - 143085 143071 15 0 0 90 92 6 0.465 1.66 6.00 Prom - 143586 143547 40 -6.45 7.00 Prom + 145821 145860 40 -2.35 7.01 Init + 150475 150647 173 0 2 68 77 129 0.426 8.80 7.02 Intr + 163524 163695 172 0 1 74 9 107 0.230 0.42 7.03 Term + 163794 164006 213 2 0 22 42 254 0.531 10.45 7.04 PlyA + 164665 164670 6 1.05 8.00 Prom + 169043 169082 40 -1.65 8.01 Init + 175920 175995 76 2 1 47 93 65 0.412 4.20 8.02 Intr + 188450 188595 146 0 2 83 73 25 0.276 -0.42 8.03 Intr + 188938 189005 68 0 2 87 105 39 0.608 2.28 8.04 Intr + 190367 190488 122 2 2 82 36 54 0.433 -1.18 8.05 Term + 191204 191595 392 0 2 71 45 329 0.934 21.06 8.06 PlyA + 193883 193888 6 1.05 9.00 Prom + 198393 198432 40 -9.95 9.01 Init + 199050 199472 423 2 0 69 42 201 0.261 9.99 9.02 Intr + 200964 201091 128 2 2 92 96 58 0.382 5.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 57743 57808 66 1 0 108 115 72 0.877 13.04 S.002 Term + 86533 86744 212 2 2 81 53 183 0.892 10.57 S.003 Init + 108222 108331 110 2 2 107 22 102 0.830 5.24 S.004 Term + 108395 108587 193 2 1 44 43 131 0.812 0.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:30131903_30333797|GENSCAN_predicted_peptide_1|144_aa MIQWVVSRDDFCGCIDIIIEKPARQEPTQDACLSFHRRASSRARAQSQPLCSTHEASAPN FSNPTLPAFRGNVLVRCWSLLICHTEARGGFPQSWEVPPLWLCRVQLSWAGVECLQLFQA HDANCQWIYHSGVWRTVTLFSQFH >gi568815587f:30131903_30333797|GENSCAN_predicted_CDS_1|435_bp atgattcagtgggtagttagcagggatgatttctgtggctgcattgacattatcattgaa aagcctgcaaggcaggagcccacacaggatgcctgcctcagctttcaccggagagcatca tcaagggccagagcccagtctcagcctctctgctccactcatgaggcctctgctccaaac ttctcaaaccctacacttcctgctttcagaggaaatgttcttgtgcgttgctggagtttg ctaatatgtcacactgaggcaagaggtgggttcccacagtcttgggaagttccacccctg tggctttgcagagtacagctttcatgggctggtgttgagtgtctgcagcttttccaggca catgatgcaaactgtcagtggatctaccattctggggtctggaggacagtgactctcttc tcacagttccactag >gi568815587f:30131903_30333797|GENSCAN_predicted_peptide_2|888_aa MEDEMNEMKQEEKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLERQANIQIQEIQRTPQRYSLRRETPRHKIVRFTKVEMKEKMLRAAREKDRS TRQKVNKDTQEMNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYSKADHIVGSKAFLSK CKRTEIITNCLSDHSAVKLELRIKKLTQNRSTTWKLNNLLLNDYWVNNEMKAEIKMFFET NENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASR RQEITKIRAELKEIQTQKTLQKINESRRWFLERINKIDRLLARLIKKKREENQIDAIKND KGDITTDPTEIQTTIRESYKHLYKNKLENLEEMNKFLDTYIFPRLNQEEAESLNRPITGS EIEAIINSLPTKKSPGPDGFTAEFYQRTNDINHTIISIDAEKAFDKMQQLFMLKTLSKLG IDGSYLKIIRAIYDKPTANIILNGQKLEAFPLKTGARQGCPLSPLLFNVVLEVLARAIRQ EKEIKGIQLGKEEVKLSLFAHDMIVYLENPIISAQNLLKLISNFSKVSGYKINVQKSQAF LYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKEKYKPLLNEIKEDTNKWKN IPCSWIGRINIVKMAILPKVIYRFNAIPIKPPITFFTELEKTTLKFIWNQKRARIAKSIL SQKNKAGGITLPDFKLYYKATVTKTAWWPICREYVFFTALNHPGNMAFMVPLTMDVASET HTACDHMATKYQGMLSDSSSAADSKAPEHLRLNSSHSLGPGNYLRAVLTAHDASCMNAKV LANNAGVELHEWERDAGFPDEIEECSSPSNAVPYFVTEPQRKQFFHLP >gi568815587f:30131903_30333797|GENSCAN_predicted_CDS_2|2667_bp atggaagatgaaatgaatgaaatgaagcaagaagagaagtttagagaaaaaagaataaaa agaaacgaacaaagcctacaagaaatatgggactatgtgaaaagaccaaatctacgtctc attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagaaaggcaggccaacattcaaattcaggaaata cagagaactccacaaagatactctttgagaagagaaactccaagacacaaaattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaagacagatca acgagacagaaagttaacaaggatacccaggaaatgaactcagctctgcaccaagcggac ctaatagacatctacagaactctccaccccaaatcaacagaatatacattcttttcagca ccacaccacacctattccaaagctgaccacatagttggaagtaaagcattcctcagcaaa tgtaaaagaacagaaattataacaaactgtctctcagaccacagcgcagtcaaactagaa ctaaggattaagaaactcactcaaaaccgctcaactacatggaaactgaacaacctgctc ctgaatgactactgggtaaataacgaaatgaaggcagaaataaagatgttctttgaaacc aatgagaacaaagacacaacataccagaatctctgggacacattcaaagcagtgtgtaga gggaaatttatagcactaaatgcccacaagagaaagcaggaaagatctaaaattgacacc ctaacatcacaattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcaga aggcaagaaataactaaaatcagagcagaactgaaggaaatacagacacaaaaaaccctt caaaaaattaatgaatccaggagatggtttttggaaaggatcaacaaaattgatagactg ctagcaagactaataaagaagaaaagagaggagaatcaaatagatgcaataaaaaatgat aaaggggatatcaccactgatcccacagaaatacaaactaccatcagagaatcctataaa cacctctacaaaaataagctagaaaatctagaagaaatgaataaattccttgacacatac atcttcccaagactaaaccaggaagaagctgaatctctgaatagaccaataacaggctct gaaattgaggcaataatcaatagcttaccaaccaaaaaaagtccaggaccagatggattc acagctgaattctaccagagaaccaacgacataaaccacacgattatctcaatagatgca gaaaaggcctttgacaaaatgcaacaactcttcatgctaaaaactctcagtaaattaggt attgatgggtcgtatctcaaaataataagagctatttatgacaaacccacagccaatatc atactgaatgggcaaaaactggaagcattccctttgaaaactggcgcaagacagggatgt cctctatcaccactcctattcaacgtagtgttggaagttctggccagggcaataaggcag gagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctctttgca catgacatgattgtatatctggaaaaccccatcatctcagcccaaaatctcctcaagctg ataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattc ttatacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcacaatt gcttcaaagagaataaaatacctaggaatccaattgacaagggatgtgaaggacctcttc aaggagaaatacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaac attccatgctcatggataggaagaatcaatatcgtgaaaatggccatactgcccaaggta atttatagattcaatgccatccccatcaagccaccaataactttcttcacagaattagaa aaaactaccttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatccta agccaaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaaggct acagtaaccaaaacagcatggtggcccatatgcagagaatatgtgttcttcactgccctc aaccatccaggaaacatggccttcatggtcccactgaccatggatgtggcttcagagacc cacactgcctgtgaccacatggcaacaaagtaccagggaatgctcagtgactcatcctct gcagctgattcgaaggcacctgagcacctcaggctgaacagctctcattccctaggaccc gggaactacttgagagcagtcctcactgcacatgatgccagctgcatgaatgccaaagtt cttgccaacaatgctggtgttgaactacatgaatgggagagggatgctggcttccctgat gaaatagaagaatgctcctctccctctaacgctgtgccctactttgtcactgagccccaa agaaaacagttctttcatctgccttga >gi568815587f:30131903_30333797|GENSCAN_predicted_peptide_3|129_aa MKTLQFFFLFCCWKAICCNSCELTNITIAIEKEECRFCISINTTWCAGYCYTRDLVYKDP ARPKIQKTCTFKELVYETVRVPGCAHHADSLYTYPVATQCHCGKCDSDSTDCTVRGLGPS YCSFGEMKE >gi568815587f:30131903_30333797|GENSCAN_predicted_CDS_3|390_bp atgaagacactccagtttttcttccttttctgttgctggaaagcaatctgctgcaatagc tgtgagctgaccaacatcaccattgcaatagagaaagaagaatgtcgtttctgcataagc atcaacaccacttggtgtgctggctactgctacaccagggatctggtgtataaggaccca gccaggcccaaaatccagaaaacatgtaccttcaaggaactggtatacgaaacagtgaga gtgcccggctgtgctcaccatgcagattccttgtatacatacccagtggccacccagtgt cactgtggcaagtgtgacagcgacagcactgattgtactgtgcgaggcctggggcccagc tactgctcctttggtgaaatgaaagaataa >gi568815587f:30131903_30333797|GENSCAN_predicted_peptide_4|136_aa MDETGNHHSQQTIARTENQTPHVLTRRWELNNENTWTQEGEHYTLGPVVGESTRKYLMNS EVEMYLWQEACSGFLGQSREESAIGKEQGNTFTDSSVQSIDIFVGHYPAYRHRLHNLKVN PDMACDSLEEMQHDGA >gi568815587f:30131903_30333797|GENSCAN_predicted_CDS_4|411_bp atggatgaaactggaaaccatcattctcagcaaactatcgcaaggacagaaaaccaaaca ccgcatgttctcactcgtaggtgggaactgaacaatgagaacacatggacacaggaaggg gaacattacacactggggcctgttgtgggtgagagcaccagaaagtatttgatgaattct gaagttgaaatgtacctgtggcaagaagcttgttcagggttcttaggacaatctagagaa gagtcagctataggaaaggagcagggtaatacattcacagattccagtgttcagagcata gatatctttgtgggtcattatcctgcttaccgccacaggttacataatcttaaagttaac cctgatatggcttgtgattcactagaagaaatgcagcacgatggagcctga >gi568815587f:30131903_30333797|GENSCAN_predicted_peptide_5|221_aa MKNDKGDITTNPTEIQTTIREYYKHLYANKLENLEEMDKFLNTYILPRLNREEVESLNRP ITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSTEKEGILPKSFYEA SIILIPKPGRDTTKKENFRPISLMNINAKILNKILANRIQQPIKKLIHHDQMGFISGMQG WFNIHKSINVIHHINRTKDKNHMIISIDAEKAFDKIQQPSS >gi568815587f:30131903_30333797|GENSCAN_predicted_CDS_5|666_bp atgaaaaatgataaaggggatatcaccaccaatcccacagaaatacaaactaccatcaga gaatactataaacacctgtatgcaaacaaactagaaaatctagaagaaatggataaattc ctcaacacatacatcctcccaagactaaaccgggaagaagttgaatctctgaatagacca ataacaggctctgaaattgaggcaataatcaatagcttaccaaccaaaaaaagtccagga ccagatggattcacagccgaattctaccagaggtacaaagaggagctggtaccattcctt ctgaaactattccaatcaacagaaaaagagggaatcctccctaagtcattttatgaggcc agcatcatcctaataccaaaacctggcagagacacaaccaaaaaagagaattttagacca atatccttgatgaacattaatgcaaaaatcctcaataaaatactggcaaacagaatccag cagcccatcaaaaagcttatccatcatgatcaaatgggcttcatctctgggatgcaaggc tggttcaacatacacaaatcaataaatgtaatccatcatataaacagaaccaaagacaaa aaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaaccttca tcctag >gi568815587f:30131903_30333797|GENSCAN_predicted_peptide_6|80_aa MLIVNECSSSPAMEQSWTENDFDELREEGFRRSNYSELQEEIRTNGKEVKSFEKKLDKWI SRITNAEKSLKDLMELKTKA >gi568815587f:30131903_30333797|GENSCAN_predicted_CDS_6|243_bp atgctcattgttaacgaatgcagctcctcaccagcaatggaacaaagctggacggagaat gactttgacgagttgagagaagaaggcttcagacgatcaaactactccgagctacaggag gaaattcgaaccaatggcaaagaagtaaaaagctttgaaaaaaaattagacaaatggata agtaggataaccaatgcagagaagtccttaaaggacctgatggagctgaaaaccaaggca tga >gi568815587f:30131903_30333797|GENSCAN_predicted_peptide_7|185_aa MAAGHASLYKAGPGRMWSVCRPQPLPEGALHPGTPNKGNAGAAPVIEGGSPKAQERTWSI NGRLQLTMGESSRRVDPADRHVPTHPHSPTITASPELLCQYALTHSCTPFFVYMWHQLAN KGRWSHTPPPSLNCITTTTGKIALHRGQQFCIHQHPNPTPTTTSANIHTDASGLAPLLQV VTDAL >gi568815587f:30131903_30333797|GENSCAN_predicted_CDS_7|558_bp atggcagcagggcatgcctccctctacaaggctggtccaggaaggatgtggtcagtctgc cggccacagcctctgcctgagggagcgctgcatcctggaacacctaacaaaggaaatgca ggtgcagcaccagtgatcgaagggggctcccctaaggcacaggaaaggacatggtccatc aacggccgactgcaactgaccatgggggagagctccagaagagtggaccctgcagataga catgtaccaacccaccctcactctcccaccatcacagcctctcctgagctgctctgccag tacgcacttacccacagctgcacccccttctttgtttacatgtggcatcaacttgcaaac aaaggccggtggtcccacaccccacccccttccctgaactgcatcaccactaccactggc aaaattgccctacatagaggccagcagttctgcatccaccagcaccccaaccccacacca accaccaccagtgcaaacatccacactgatgccagtggccttgctcccctgctccaggtt gtcactgatgcactgtga >gi568815587f:30131903_30333797|GENSCAN_predicted_peptide_8|267_aa MYLMATVLDSTAVATNGMHNSHDDKGIIFPIPPSSKDRTTTHRLTSPKTLIFQLRSLSYP ALFFQLPAVYLHLEILVAPLAGTATMFLDITKSSGRSSRHQIACLVGEQYESGGRTIDAL QKTAYSRCRELLARLGLVVARGLAGKRSVECGLCGLASCPKPGGGELKLRWKGNGGGPGE EWESGDRPLGGEGARRRNGGTRGEVEEEARTREGDFQLLAGASHPYPRVVHLRRNALCQS PAQVLPPLTSLRVVACPVGPGPSKLRF >gi568815587f:30131903_30333797|GENSCAN_predicted_CDS_8|804_bp atgtacctaatggccactgtactggacagcacagcagtagccactaatggaatgcataat tctcatgatgataaaggtatcatcttccctatcccacccagctccaaagatagaactacc actcacagactaacatctcccaaaactcttatcttccagctgagatccctctcttatcct gccctcttcttccagctgccagctgtatatcttcacttggagatactagtagcacccctt gctggtacagccacaatgtttctagacattaccaaatccagtgggaggagttctaggcat caaatcgcttgcctagtcggcgagcagtatgaaagtggtgggaggaccatagacgcactg cagaaaacagcttacagtagatgcagagaattactagcacgcttgggccttgtggtagcg cggggactggctgggaagcggtcggtcgagtgtggcctgtgtggactcgcatcttgcccg aagccgggcggaggagagctcaagctaaggtggaaggggaacgggggaggccctggagag gagtgggagagtggagacaggcccctgggtggcgagggagcccggcggcgtaacggggga acccggggagaggtggaagaggaggcccgcacacgtgagggggatttccagctgctggcc ggggcctctcacccctacccccgcgtagttcatctgcgacgcaacgccttgtgtcaaagc ccagcacaggttctgccgcctctgacctctctgagggtcgtcgcctgtcctgtaggccca ggaccttcaaagctacgattttga >gi568815587f:30131903_30333797|GENSCAN_predicted_peptide_9|184_aa MDPCSVGVQLRTTNECHKTYYTRHTGFKTLQELSSNDMLLLQLRTGMTLSGNNTICFHHV KIYIDRFEDLQKSCCDPFNIHKKLAKKNLHVIDLDDATFLSAKFGRQLVPGWKLCPKCTQ IINGSVDVDTEDRQKRKPESDGRTAKALRSLQFTNPGRQTEFAPETGKREKRRLTKNATA GSDS >gi568815587f:30131903_30333797|GENSCAN_predicted_CDS_9|552_bp atggatccatgttcagttggagtccagcttcgtactacaaatgagtgccataaaacctac tatactcgtcacacaggttttaagactttgcaagaattgtcatcaaatgatatgctttta cttcaacttagaactggaatgacactttctgggaacaatacaatttgctttcatcatgta aaaatttacattgacagatttgaggatttacagaagtcatgttgtgacccatttaacata cacaagaaattagccaaaaaaaatttgcatgtaattgacttagatgatgccacttttctg agtgctaaatttggaagacagcttgtacctggttggaagctttgtccaaaatgcacacag ataatcaatggaagtgtggatgttgatactgaagaccgccagaaaaggaaacctgagtca gatggaagaactgctaaagctttgaggtcattacaatttacgaatccaggaaggcaaact gaatttgctccagaaactggtaaaagagaaaaaagaaggcttacaaaaaatgcaaccgct ggttcagacagn