GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:15:26 Sequence gi568815595f:98290666_98491628 : 200963 bp : 37.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5941 6083 143 1 2 80 37 108 0.240 3.98 1.02 Term + 13561 13724 164 2 2 72 48 98 0.280 1.32 1.03 PlyA + 14171 14176 6 1.05 2.00 Prom + 23242 23281 40 -4.85 2.01 Init + 36192 36306 115 2 1 73 66 94 0.955 6.12 2.02 Intr + 36967 37187 221 2 2 58 93 108 0.931 5.40 2.03 Intr + 38152 38375 224 0 2 27 91 89 0.448 -0.90 2.04 Intr + 38838 38944 107 0 2 64 81 90 0.860 4.84 2.05 Intr + 63175 63682 508 2 1 41 31 242 0.004 4.90 2.06 Intr + 71932 72195 264 1 0 62 74 114 0.029 3.10 2.07 Intr + 97347 97449 103 0 1 41 65 88 0.025 1.06 2.08 Term + 111775 111948 174 0 0 72 48 140 0.720 5.18 2.09 PlyA + 113616 113621 6 1.05 3.00 Prom + 113997 114036 40 -7.05 3.01 Init + 115914 116018 105 2 0 63 66 66 0.379 2.18 3.02 Term + 116472 116786 315 2 0 74 47 234 0.842 11.86 3.03 PlyA + 117042 117047 6 1.05 4.00 Prom + 118364 118403 40 -6.15 4.01 Sngl + 118457 119254 798 1 0 49 37 327 0.614 19.30 4.02 PlyA + 119291 119296 6 1.05 5.00 Prom + 120229 120268 40 -10.15 5.01 Init + 120289 120489 201 0 0 60 86 148 0.122 10.72 5.02 Intr + 123692 123752 61 1 1 94 89 -22 0.070 -4.11 5.03 Intr + 130856 131017 162 0 0 59 121 94 0.737 8.83 5.04 Intr + 131748 132041 294 1 0 20 73 159 0.417 3.66 5.05 Term + 138085 138581 497 2 2 48 48 266 0.049 12.34 5.06 PlyA + 139038 139043 6 1.05 6.05 PlyA - 141055 141050 6 1.05 6.04 Term - 154386 154142 245 1 2 45 42 178 0.367 3.98 6.03 Intr - 171090 170984 107 2 2 41 94 40 0.091 -1.16 6.02 Intr - 177267 175893 1375 1 1 63 53 478 0.344 28.75 6.01 Init - 177946 177889 58 1 1 49 82 42 0.506 1.22 6.00 Prom - 178051 178012 40 -8.75 7.00 Prom + 178114 178153 40 -4.55 7.01 Init + 178912 179545 634 0 1 64 40 323 0.529 20.65 7.02 Term + 181634 181710 77 0 2 72 48 47 0.468 -3.88 7.03 PlyA + 182031 182036 6 1.05 8.02 PlyA - 182460 182455 6 1.05 8.01 Sngl - 184639 184253 387 1 0 77 47 204 0.926 11.16 8.00 Prom - 187050 187011 40 -5.75 9.04 PlyA - 187316 187311 6 1.05 9.03 Term - 192099 191897 203 1 2 21 41 137 0.681 -1.13 9.02 Intr - 192966 192779 188 2 2 54 110 88 0.886 6.01 9.01 Intr - 197307 197185 123 2 0 62 55 109 0.219 3.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:98290666_98491628|GENSCAN_predicted_peptide_1|102_aa XSRNDCKSRNHSFLPLEEQKGKSKEDFDLQFGYQLNQSRTGHQEAPVPGQSACHQNNQHN VISISSVEYVVEKKEDQVPEYLQLIYPTKELYPEYTASSQNQ >gi568815595f:98290666_98491628|GENSCAN_predicted_CDS_1|309_bp nnatccaggaatgactgcaaatccaggaatcattccttccttccacttgaggagcagaaa ggaaaaagtaaagaggactttgatctgcaatttggataccagctcaatcagagtaggaca gggcaccaggaggccccagttccaggacaatcagcatgtcatcagaataatcagcacaat gtcatcagtatatcatcagtggaatatgtcgtggaaaagaaagaggatcaagttcccgaa tatttgcaactcatatatccgacaaaagaattgtatccagaatatacagcaagctctcaa aatcaatga >gi568815595f:98290666_98491628|GENSCAN_predicted_peptide_2|571_aa MAPPGPTNPSPVAPLRVIQCTEGQLQTPTISSLTQHISKTVHSPAVLPQEYTKKRRCILT HPPYCCVGFPWAGTGPHILYLSRLASNLELFKRGKGRGEQRKEEVTCGMLRKHLRKRCVR GLEDCPLSPQTSGNLIQGPHTPRMCELARGISQRLDGAEHRLAPAEPSHESPPARAACLP LRKSSPREPHGLKDLATTMTTTKILGRVPAIGGDFPRPTSRPEEKPGMARENHSLAAEFI LIGFTNYPELKTLLFVVFSAIYLVTMVGNLGLVALIYVERRLLTPMYIFLGNLALMDSCC SCAVTPKMLENFFSEDRIISLYECMAQFYFLCLAETTDCFLLATMAYDRYVAICHPLQYH TMMSKTLCIRMTTGAFKAGNLHSMIHVGLLLSQVGRETEKRNKTQRQSIEKQQWDQEDRH SAYQGPAPTPASESPQFLLIIILIISAKRNVVGQQGDNKEKISKKHVSKRTYVIIKFKGS LYANDSDGTGVTDFRCPNKYIVYRGVKCEEGNNRVGFTSLPHQQMEVQFASLLPTTIEDG ALAGPEAATIALSAPCTCTNTAKRTADPPPT >gi568815595f:98290666_98491628|GENSCAN_predicted_CDS_2|1716_bp atggctccacctggacccaccaacccatctcctgtggccccactcagagtgattcaatgc acagaaggacagcttcaaacccctacgatttcatctctgacccaacacatcagcaaaacg gtacactcaccagcagttctgccacaagagtacaccaaaaaaaggagatgcatcctgacg catccaccctactgctgtgtcgggtttccgtgggctggaacaggacctcacattctgtat ttgtcccgattggctagcaacttagaactttttaaaagaggcaaaggtagaggagaacaa agaaaggaggaagtcacttgtggaatgttgagaaagcacctgaggaagcggtgcgtgaga ggactggaggactgcccactctcaccacaaacctctgggaacctaatacaggggccccac acccctaggatgtgtgagctggcaagggggatctctcagagattagatggagcagagcac aggttagctccagcagaacccagtcatgaaagcccaccagccagggctgcctgtctgcct ctgagaaagtctagccccagggagccccatggcctgaaagacttagcaacaacaatgaca acaacaaaaatcttaggcagagtgccagcaatcggaggtgacttcccaaggcccacgagc agacctgaagagaagccaggaatggctagggaaaatcactccttagcagctgaattcatc ctcataggatttacaaattatccagagctgaagacgcttctgtttgtggtgttctctgcc atctatctggtcaccatggtggggaatcttggtctggtggcattaatttatgtagagcgt cgtcttctcacaccaatgtacatctttctgggcaacctggctctgatggattcctgctgt tcctgtgctgttacccccaagatgttagagaatttcttttctgaggatagaattatttcc ctgtatgaatgtatggcacaattttattttctctgtcttgctgaaaccacagactgcttt cttctggcgacaatggcctatgaccgctatgtggccatatgccacccactgcagtaccac accatgatgtccaagacgctctgcattcggatgaccacaggggccttcaaagctggaaac ctgcattccatgattcatgtagggcttttattaagtcaggtgggacgagagactgagaaa agaaataagacacagagacaaagtatagagaaacaacagtgggaccaggaggaccggcac tcagcataccaaggacctgcaccgacaccggcctctgagtcccctcagtttttattgatt attattctcattatttcagcaaaaaggaatgtagtaggacagcagggtgataataaggag aagatcagcaaaaaacatgtgagcaaaagaacctatgtcataattaagttcaagggaagc ctatatgctaatgatagtgatgggacaggagtgaccgattttagatgtcctaacaaatat atagtatatcgaggagtgaagtgtgaagaaggaaacaacagggtgggttttacttcattg cctcaccagcaaatggaagtgcagtttgcctctctgctaccaaccaccattgaagatgga gccttggcaggcccagaggcagcaaccattgccctgtcagcaccctgcacttgtactaac actgcaaagagaacagcagatcctcccccaacctga >gi568815595f:98290666_98491628|GENSCAN_predicted_peptide_3|139_aa MVMRWSQDGQIGTAPVYSSQCEQRRKRVISAFPPEGTDKQKDSSNFCRLKCPCLTALKRV VVRPARSWRSENGQTASSSGSLNPDQPNGLAPPSRGRLTPHMAGYSSETKLPEERSGSCI CGSPNSAVLQPPLFCSHRC >gi568815595f:98290666_98491628|GENSCAN_predicted_CDS_3|420_bp atggtcatgaggtggagccaagatggccaaataggaacagctccagtctacagctcccag tgtgagcaacgcagaaaacgggtgatttctgcatttccacctgagggcacagacaaacaa aaagacagcagtaacttctgcagacttaaatgtccctgtctgacagctttgaagagagta gtggttcgcccagcacgcagctggagatctgagaatgggcagactgcctcctcaagtggg tccctgaaccccgatcagcctaacgggttggcacccccaagtaggggcagactgacacct cacatggccgggtactcctctgagacaaaacttccagaggaacgatcaggcagctgcatt tgtgggtcaccaaattccgctgttctacagccaccgctgttctgcagccatcgctgctga >gi568815595f:98290666_98491628|GENSCAN_predicted_peptide_4|265_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTECTFFSAPHHTYS KVDHIVGSKALLSECKRTEIITNCLSDHSVIKQELRIKKLTQNCSTTWKLNNLLLNDYWV HDEMKAEIKMFFETNENKDTTYQKLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETHKTLQKINESRSWFLEKINKIDRLLARLIK KKRENNQIDAIKNDKGDVTTDLTEI >gi568815595f:98290666_98491628|GENSCAN_predicted_CDS_4|798_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagttaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatgtacatttttttcagcaccacaccacacctattcc aaagttgaccacatagttggaagtaaagcactcctcagcgaatgtaaaagaacagaaatt ataacaaactgtctgtcagaccacagtgtaatcaaacaagaactcaggattaagaaactc actcaaaactgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta catgacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaaactctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaaacaagagcaaacacattcaaaagctagcagaaggcaagaaataactaag atcagagcagaactgaaggaaatagagacacataaaacccttcaaaaaatcaatgaatcc aggagctggtttttagaaaagatcaacaaaattgatagactgctagcaagactaataaag aagaaaagagagaacaatcaaatagatgcaataaaaaatgataaaggggatgtcaccact gatctcacagaaatataa >gi568815595f:98290666_98491628|GENSCAN_predicted_peptide_5|404_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRIKIV KMAIRPKAPVCMFFPHVSMCSHPLAPSWVLQKGSQRYWKKATGRRKSPLNFGTICTDREV SWPELRGGHESGVQTPQAEEEGKGGEYYIKGTCQGTKESEPQPSALDLSSDRARGNWKRN QKTNSGNMTKQGSLTPPKNCNSSLSMDPNQEEIPNLPEKEFRKLVIKLIRETAEKDEGQS GTYIIEVPHHRGPKDKPTELSFTPPMPQHAIWGPGDFPAQSTNVDTGALFLGAHGGPITL PLMSHTGLIIISPTTSQPLTTQLLTTLEPAACLATAIAITHATPTVQGPKNLPTNWLTTI TIHIQASYLRTQVLGHLDPLTLVSHVTLGPKDRHTWPTAVNTGA >gi568815595f:98290666_98491628|GENSCAN_predicted_CDS_5|1215_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctgggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactcctcaatgaaataaaa gaggatacaaataagtggaagaacattccatgctcatgggtaggaagaatcaagatcgtg aaaatggccatacggcccaaggccccagtgtgcatgtttttcccccatgtgtccatgtgt tctcatcctttagctccctcttgggtccttcagaagggcagccagaggtactggaaaaag gccactgggagaaggaaatctccactgaactttggaacaatttgtactgatcgagaagtc tcctggccagaactcaggggagggcatgaatctggtgtgcagactccacaggcagaggaa gaaggaaaaggaggagagtactatatcaagggaacatgccaggggacaaaagaatctgaa ccacagccttcagccctagacctttcctctgacagagccagaggaaactggaaaagaaac cagaaaaccaactctggtaatatgacaaaacaaggttctttaacaccccccaaaaattgc aatagctcactgtcaatggatccaaaccaagaagaaatccctaatttacctgaaaaagaa ttcaggaagttagttattaagctaatcagggagacagcagaaaaagatgaaggccaatct ggcacatacatcatagaggtcccacatcatagaggtcccaaggacaagcccactgagctc agcttcactccccctatgccacagcatgcaatctgggggcctggagatttcccggcccag tctaccaatgttgacactggagcactcttcctgggggcccatggtgggcctatcaccctg ccactcatgtcacacacaggcctaattattatttcaccaaccacatcacaaccactgaca acacaactgcttaccactctggaaccagcagcttgtcttgccactgctattgccatcacc catgccacaccaactgttcaggggcccaaaaacttacccactaactggctcaccactatc actatccacatccaagctagctacctgaggacccaagtactgggccatctggacccacta acactggtgtcccatgtcaccctggggcccaaggacaggcacacttggcccactgctgtc aacactggggcttga >gi568815595f:98290666_98491628|GENSCAN_predicted_peptide_6|594_aa MLIEVSQSKKDKDSVIPFTVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENP IVSAQNLLKLISNFSKVSGYKINVKKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGI QLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCPWVGRINIVKMAILPKVIYRFNAIPIK LPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWY QNRDIDQWNRTEPSEIMLHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWIAIGRKLKLDP FLTAYTKINSRWIKDLNVRPKTIKTLEENLGITFQDIGVGKDFMSKTPKAMATKAKIDKW DLIKLKSFCTAKETTIRVNRQPTKWEKIFATYTSDKRLISRIYNELKQIYKKKTNNPIKK WAKDMNRHFSKEDIYAAKKHMKKCSSSLAITEMQIKTTMRYHLTPVRMAIIKKSGNNRFC SHGTYILIEGSRQAYINKYMKKKKAEITKCYAGEVLKFYIVTVSGYTQENGALANDLNRE NAIQEIITKGVKSKNSNKVTCDKLLLEGLVLTELRILSIGEVILQKLESLWGYS >gi568815595f:98290666_98491628|GENSCAN_predicted_CDS_6|1785_bp atgctaattgaagtaagccagtcaaaaaaggacaaagattctgttattccatttacagtg ttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtattcaattagga aaagaggaagtcaaattgtccctgtttgcagacgacatgattgtttatctagaaaacccc atcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatac aaaatcaatgtaaaaaaatcacaagcattcttatacaccaacaacagacaaacagagagc caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatc caacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaa ataaaagaggacacaaacaaatggaagaacattccatgcccatgggtaggaagaatcaat atcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaag ctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaa aaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcaca ctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtac caaaacagagatatagatcaatggaacagaacagagccctcagaaataatgctgcatatc tacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattcccta tttaataaatggtgctgggaaaactggatagccataggtagaaagctgaaactggatccc ttccttacagcttatacaaaaatcaattcaagatggattaaagatttaaacgttagacct aaaaccataaaaaccctagaagaaaacctaggcattacctttcaggacataggcgtgggc aaggacttcatgtccaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgg gatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacagg caacctacaaaatgggagaaaattttcgcaacctacacatctgacaaacggctaatatcc agaatctacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaag tgggcgaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacac atgaaaaaatgctcatcatcactggccatcacagaaatgcaaatcaaaaccactatgaga tatcatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacagattctgt tctcatggaacttacattctaatcgaggggagcaggcaagcatatataaataaatacatg aagaagaaaaaagcagaaattactaagtgctatgctggagaagttttaaagttttacatt gtcactgtatcagggtatactcaggaaaacggagcccttgccaacgatttaaatagagaa aatgcaattcaggaaataattacaaagggtgtgaaaagtaaaaattcaaacaaggtaacc tgtgataagctcctactggaaggactggtgttaacagaactcagaattctgtccatagga gaggttattctgcagaagctggaatctctctggggatacagctaa >gi568815595f:98290666_98491628|GENSCAN_predicted_peptide_7|236_aa MAEENHTMKNEFILTGFTDHPELKTLLFVVFFAIYLITVVGNISLVALIFTHRRLHTPMY IFLGNLALVDSCCACAITPKMLENFFSENKRISLYECAVQFYFLCTVETADCFLLAAMAY DRYVAICNPLQYHIMMSKKLCIQMTTGAFIAGNLHSMIHVGLVFRLVFCGSNHINHFYCD ILPLYRLSCVDPYINELVLFIFSGSVQVFTIVGPGNVEMHHFCTSDATQYQYNVGF >gi568815595f:98290666_98491628|GENSCAN_predicted_CDS_7|711_bp atggctgaagaaaatcataccatgaaaaatgagtttatcctcacaggatttacagatcac cctgagctgaagactctgctgtttgtggtgttctttgccatctatctgatcaccgtggtg gggaatattagtttggtggcactgatatttacacaccgtcggcttcacacaccaatgtac atctttctgggaaatctggctcttgtggattcttgctgtgcctgtgctattacccccaaa atgttagagaacttcttttctgagaacaaaaggatttccctctatgaatgtgcagtacag ttttattttctttgcactgtggaaactgcagactgctttcttctggcagcaatggcctat gaccgctatgtggccatatgcaacccactgcagtaccacatcatgatgtccaagaaactc tgcattcagatgaccacaggggccttcatagctggaaacctgcattccatgattcatgta gggcttgtatttaggttagttttctgtggatcgaatcacatcaaccacttttactgtgat attcttcccttgtatagactctcttgtgttgatccttatatcaatgaactggttctattc atcttctcaggttcagttcaagtctttaccatagtgggtccaggaaatgttgaaatgcat catttttgcacttccgatgctacacaataccagtacaatgtaggattttga >gi568815595f:98290666_98491628|GENSCAN_predicted_peptide_8|128_aa MTILPKAVYKFNAIPVKIPPSFFTKLEKKMLKFIWSQERAGITKARLSKRNKLGGITLPD FKAIVTKTEWYWYENKHIDQWNKMENPEIMPNTYSQLIFDKTKKNIKCGKTPYSTNGVGL IGKPHVED >gi568815595f:98290666_98491628|GENSCAN_predicted_CDS_8|387_bp atgaccatactgccaaaagcagtctacaaattcaatgcaattcctgtcaaaataccacca tcattcttcacaaaattagaaaaaaaaatgctaaaattcatatggagccaagaaagagct ggcataaccaaagcaagactaagcaaaaggaacaaacttggaggcatcacattacctgac ttcaaggctatagtcaccaaaacagaatggtactggtatgaaaataaacacatagaccaa tggaacaaaatggagaacccagaaataatgccaaatacttacagccaactgatcttcgac aaaacaaagaaaaacataaagtgtggaaagacaccctattcaacaaatggtgttgggtta attggcaagccacatgtagaagattga >gi568815595f:98290666_98491628|GENSCAN_predicted_peptide_9|171_aa XVCTFEEVGTYFSVHRLASAGDALQKSTSLEILVRPFGGVHGEAISDFISLKSLGRTASG IGEGQVEENFQLNFVIILTKCKFSWAESWVEGMGSTDRSTEAVAAQPTFKKISTLKTTTM DPQRVYFISLLPPLAQVLVSTAERLEDGSHDRTPCRHSSVPAQSSVALLGG >gi568815595f:98290666_98491628|GENSCAN_predicted_CDS_9|516_bp nntgtctgtacgtttgaagaagtaggcacctatttcagtgttcacagactggcttcagca ggagatgcccttcagaagtcaaccagtctagagattcttgttaggccatttggtggcgtt catggggaagccatttctgactttatctcactgaagtcactgggtaggactgccagtgga attggggaaggacaggtagaagaaaactttcagctgaactttgtaataattttgaccaag tgcaaattttcctgggcagaatcctgggtggagggaatgggaagtacagacaggagcaca gaagctgtggcagcacaaccaacattcaagaaaatcagcacactaaaaactacaaccatg gaccctcaaagagtctacttcatatccctgctaccaccactggcccaggtgctggtatcc acagctgagagacttgaagatggatcacatgacaggactccttgcagacactcttcagta ccagcccagagctcagtagcgctgctgggtggctag