GENSCAN 1.0 Date run: 3-Nov-116 Time: 20:26:15 Sequence gi568815591f:91164881_91366821 : 201941 bp : 39.71% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9470 9855 386 1 2 88 44 401 0.843 32.06 1.02 Intr + 9934 10181 248 1 2 -48 58 212 0.426 0.78 1.03 Intr + 11766 12013 248 1 2 53 41 228 0.694 10.86 1.04 Term + 12302 13657 1356 1 0 -21 54 491 0.615 25.24 1.05 PlyA + 15126 15131 6 1.05 2.00 Prom + 16144 16183 40 -4.25 2.01 Init + 32986 33117 132 0 0 27 88 148 0.794 8.89 2.02 Term + 39122 39241 120 1 0 42 38 128 0.617 0.69 2.03 PlyA + 45340 45345 6 1.05 3.03 PlyA - 45462 45457 6 1.05 3.02 Term - 47042 46945 98 0 2 101 49 127 0.936 7.25 3.01 Init - 48184 47842 343 1 1 64 -10 133 0.355 -1.34 3.00 Prom - 49340 49301 40 -3.65 4.00 Prom + 50159 50198 40 -7.45 4.01 Init + 52397 52813 417 1 0 66 41 212 0.252 10.78 4.02 Intr + 53510 53625 116 1 2 96 73 67 0.174 4.33 4.03 Intr + 77310 77412 103 0 1 13 98 154 0.884 8.16 4.04 Term + 78492 78695 204 2 0 86 44 95 0.698 1.29 4.05 PlyA + 78883 78888 6 1.05 5.03 PlyA - 80605 80600 6 1.05 5.02 Term - 84197 84001 197 0 2 62 42 150 0.452 4.39 5.01 Init - 88938 88611 328 0 1 71 1 242 0.385 11.33 5.00 Prom - 88992 88953 40 -3.05 6.00 Prom + 89415 89454 40 -6.35 6.01 Init + 93042 93149 108 2 0 28 96 50 0.146 0.10 6.02 Term + 99932 101944 2013 1 0 37 53 2566 0.619 232.35 6.03 PlyA + 103436 103441 6 1.05 7.03 PlyA - 103729 103724 6 1.05 7.02 Term - 108958 108849 110 2 2 117 52 59 0.768 2.89 7.01 Init - 118114 118027 88 1 1 84 70 116 0.725 10.25 7.00 Prom - 120334 120295 40 -7.55 8.05 PlyA - 120940 120935 6 1.05 8.04 Term - 121855 121712 144 1 0 34 43 136 0.129 0.63 8.03 Intr - 127144 127035 110 2 2 48 59 121 0.008 4.28 8.02 Intr - 132160 132067 94 1 1 104 48 63 0.150 2.52 8.01 Init - 156094 155945 150 1 0 86 52 90 0.367 5.29 8.00 Prom - 159271 159232 40 -6.35 9.04 PlyA - 160794 160789 6 1.05 9.03 Term - 161017 160808 210 1 0 28 49 139 0.021 0.31 9.02 Intr - 163217 163100 118 1 1 135 83 8 0.037 4.55 9.01 Init - 183094 182928 167 1 2 64 97 115 0.652 9.16 9.00 Prom - 186606 186567 40 -1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 74130 74212 83 2 2 56 78 39 0.802 0.20 S.002 Init - 162148 162031 118 1 1 60 91 45 0.828 2.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:91164881_91366821|GENSCAN_predicted_peptide_1|745_aa MGKKQNRKTENSKTQSASPPPKEHSSSPATEQSWMENDFDELREEGFRRSNYSELQEDIQ TKGKEVENFEKNLEECITRITNREKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKPNLRLIGVPESDVENGTKLENTLQDIIQENFPNLARKANIQIQEIQRTPQR YSLRRATPRHIIARFTKVEMKEKMLRAAREKEIQTTIREYYKHLYANKLENLEEMDKFLD TYTLPRLNQEKVESLNRPITGAEIVAIINSLPTKKGPGPDGFTAEFYQRYKEELYINRAK DKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDKPTANIILNGQKLEA FPLKTGTRQGYPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVRLSLFADDMIVYLEN PIVSARNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLG IQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPI KLPMPFFTELEKTTLKFIWNQKRARITKSILSQKNKAGGITLPDFKLYYKATVTKTAWYW YQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLD PFLTPYTKMNSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDK WDLIKLKSFCTAKETTIRVNRQPTK >gi568815591f:91164881_91366821|GENSCAN_predicted_CDS_1|2238_bp atggggaaaaaacagaacagaaaaactgaaaactctaaaacgcagagcgcctctcctcct ccaaaggaacacagttcctcaccagcaacggaacaaagctggatggagaatgactttgac gagctgagagaagaaggcttcagacggtcaaattactctgagctacaggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatagagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaaaccaaatctacgtctgattggtgtacctgaaagt gatgtggagaatggaaccaagttggaaaacactctgcaggatattatccaggagaacttc cccaatctagcaaggaaggccaacattcagattcaggaaatacagagaacgccacaaaga tactccttgagaagagcaactccaagacacataattgccagattcaccaaagttgaaatg aaggaaaaaatgttaagggcagccagagagaaagaaatacaaactaccatcagagaatac tacaaacacctctatgcaaataaactagaaaatctagaagaaatggataaattcctcgac acatacactctcccaagactaaaccaggaaaaagttgaatctctgaatagaccaataaca ggagctgaaattgtggcaataatcaatagcttaccaaccaaaaagggtccaggaccagat ggattcacagccgaattctaccagaggtacaaggaggaactgtatataaacagagccaaa gacaaaaaccacatgattatctcaatagatgcagaaaaagcctttgacaaaattcaacaa cccttcatgctaaaaactctcaataaattaggtattgatgggacgtatttcaaaataata agagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaactggaagca ttccctttgaaaactggcacaagacagggataccctctctcaccactcctattcaacata gtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggtattcaatta ggaaaagaggaagtcagattgtccctgtttgcagacgacatgattgtatatctagaaaac cccattgtctcagcccgaaatctccttaagctgataagcaacttcagcaaagtctcagga tacaaaatcaacgtacaaaaatcacaagcattcttatacaccaacaacagacaaacagag agccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctagga atccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaag gaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatc aatattgtgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatc aagctaccaatgcctttcttcacagaattggaaaaaactactttaaagttcatatggaac caaaaaagagcccgcatcaccaagtcaatcctaagccaaaagaacaaagctggaggcatc acactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacaccgcat atctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattcc ctattcaataaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggat cccttccttacaccttatacaaaaatgaattcaagatggattaaagacttaaacgttaga cctaaaaccataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatg ggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaa tgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaac aggcaacctacaaaatga >gi568815591f:91164881_91366821|GENSCAN_predicted_peptide_2|83_aa MECVLEGFTCSTQSFASTDEFSDLSLVDQLFVGTVVKQLRVVKQGGDFFKKRFDKGISAQ SSKDEEWSSEPYGELPTSSKDLR >gi568815591f:91164881_91366821|GENSCAN_predicted_CDS_2|252_bp atggagtgcgtgttggagggctttacatgtagtacacagagcttcgcttccacggatgaa ttcagtgatttgtccttggtcgatcagttgttcgttggcactgttgtgaagcagcttcgt gttgtgaagcagggtggtgacttctttaagaagaggtttgataaaggtatatctgcacaa tcaagcaaggatgaggagtggtcaagtgagccttatggagaactgcccacaagcagcaag gatttgagataa >gi568815591f:91164881_91366821|GENSCAN_predicted_peptide_3|146_aa MYPWHYPHLSGRHTLRYVAHSFAGTPAGSLLRIHPNGEPLRHLCSMPSIIIGGLAGKEKK DSSIRIPFSTSTVSTIGRGWLWGCKAALSYYCAWLSLSMTQRRQYSTPRAWGMNKSSSGG SNCLEMRLPVYRRFTASGEIEEALGG >gi568815591f:91164881_91366821|GENSCAN_predicted_CDS_3|441_bp atgtatccctggcattatccgcatctctcaggcaggcacaccttgcgctatgtagcacat agctttgcagggacccctgctgggtcacttctcaggatacaccccaacggggaaccctta cgccatctgtgcagcatgccctccataattattggtggtctcgcagggaaggagaaaaaa gacagctctatcagaatcccattctccactagcacagtcagcaccattggcagaggatgg ctctggggctgcaaggctgctctgagctattattgtgcctggctctctctcagcatgact caacgcagacaatatagcacaccgagagcatgggggatgaacaaatccagtagtggaggc agcaactgtctggaaatgcgtttgcctgtgtatagacgatttactgcttcaggagaaata gaagaagcacttgggggatga >gi568815591f:91164881_91366821|GENSCAN_predicted_peptide_4|279_aa METQKTLQKINESRNWFFKRINKIDRPLARLIKKKREKNQIDTIKNDKGDITTDPTEIQT TIREYYKHLYANKLENLEEMDKFLDTCTLPRLNQEEVESLNRPITGSEIVAIINSLPTKK SPGPDGFTAEFYQRYKEELLHPRFKGPQMHVCGHPSPHIPIIHTFSKQRLLGQPLGMRAL KVNAANELQVGNAPNLPAPDFYGLGNASESQRVMASWYTSRVSGKGVRKGYQRLSDSTHL IRVNEAPLKSIEEWKSFLHLCFGKRKRVKFSHQFAPELG >gi568815591f:91164881_91366821|GENSCAN_predicted_CDS_4|840_bp atggagacgcaaaaaacccttcaaaaaattaatgaatccaggaactggttttttaaaagg atcaacaaaattgatagaccgctagcaagactaataaagaagaaaagagagaagaatcaa atagacacaataaaaaatgataaaggggatatcaccaccgatcccacagaaatacaaact accatcagagaatactacaaacacctctatgcaaataaactagaaaatctagaagaaatg gataaattcctcgacacatgcaccctcccaagactaaaccaggaagaagttgaatctctg aatagaccaataacaggctctgaaattgtggcaataatcaatagcttaccaaccaaaaag agtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactgctc catccacgatttaaggggccgcaaatgcatgtgtgtggacatcccagcccgcacatcccc attatccataccttcagcaaacagcgcctccttgggcagcccttggggatgagagctttg aaagtcaatgcagctaatgaacttcaagtaggaaacgctccaaatcttcctgctccagat ttctacggacttggtaatgctagtgagtcacagagggtgatggctagctggtatactagc agagtgagtggaaaaggtgtcagaaaagggtatcagaggctcagcgactcaacccatctc attagagtcaacgaggcacctcttaaaagtatagaagaatggaaatcctttcttcatttg tgctttggaaaaagaaaaagagtaaaattctcccaccagtttgccccagaattgggatga >gi568815591f:91164881_91366821|GENSCAN_predicted_peptide_5|174_aa MGKHFMTKTPESMATKVKIDKWDLIKLKSVCTAKETIIRVNRKPKEWEKMFATYPSDKGL ISRIYKELKQIYKRKTNNIIKNWTKDVNRHFSKEDIYAAIKCEKKLIITVTTRGYIRVVP AAQLPSQEWVYNGPHSPAMSYASQSIFPSCPDSILRDFPVHGEKSPSANLGIQM >gi568815591f:91164881_91366821|GENSCAN_predicted_CDS_5|525_bp atgggcaaacacttcatgactaaaacaccagaatcaatggcaacaaaagtcaaaattgac aaatgggatctaattaaactaaagagcgtctgcacagcaaaagaaactatcatcagagtg aacaggaaacctaaagaatgggagaaaatgtttgcaacctatccatctgacaaagggcta atatccagaatctacaaagaacttaaacaaatttacaagagaaaaacaaacaacatcatc aaaaattggacgaaggatgtgaacagacacttctcaaaagaagacatttatgcagccatc aaatgtgaaaaaaagctcatcatcactgtaaccaccagaggctacatcagggttgttcct gctgcacagctgccctctcaggaatgggtctataatgggccacacagtcctgcaatgtcc tatgcgtcccagagcatcttcccctcctgcccagactcaattttgagagactttcctgtc catggtgagaaatctccttcagccaatcttggcattcaaatgtag >gi568815591f:91164881_91366821|GENSCAN_predicted_peptide_6|706_aa MKKKLLQLAQCLPKQPPSFVLESQAPGGVGTQGNLPGEPPPAVPLAAPAERRQERSRESM AEEEAPKKSRAAGGGASWELCAGALSARLAEEGSGDAGGRRRPPVDPRRLARQLLLLLWL LEAPLLLGVRAQAAGQGPGQGPGPGQQPPPPPQQQQSGQQYNGERGISVPDHGYCQPISI PLCTDIAYNQTIMPNLLGHTNQEDAGLEVHQFYPLVKVQCSAELKFFLCSMYAPVCTVLE QALPPCRSLCERARQGCEALMNKFGFQWPDTLKCEKFPVHGAGELCVGQNTSDKGTPTPS LLPEFWTSNPQHGGGGHRGGFPGGAGASERGKFSCPRALKVPSYLNYHFLGEKDCGAPCE PTKVYGLMYFGPEELRFSRTWIGIWSVLCCASTLFTVLTYLVDMRRFSYPERPIIFLSGC YTAVAVAYIAGFLLEDRVVCNDKFAEDGARTVAQGTKKEGCTILFMMLYFFSMASSIWWV ILSLTWFLAAGMKWGHEAIEANSQYFHLAAWAVPAIKTITILALGQVDGDVLSGVCFVGL NNVDALRGFVLAPLFVYLFIGTSFLLAGFVSLFRIRTIMKHDGTKTEKLEKLMVRIGVFS VLYTVPATIVIACYFYEQAFRDQWERSWVAQSCKSYAIPCPHLQAGGGAPPHPPMSPDFT VFMIKYLMTLIVGITSGFWIWSGKTLNSWRKFYTRLTNSKQGETTV >gi568815591f:91164881_91366821|GENSCAN_predicted_CDS_6|2121_bp atgaaaaaaaaacttctgcagctagctcagtgtctgcccaaacagccacccagttttgtg cttgaatcccaggcccctggtggagtaggcacccaagggaatctcccgggggagccgccg ccggccgtgcccctggcagccccagcggagcggcgccaagagaggagccgagaaagtatg gctgaggaggaggcgcctaagaagtcccgggccgccggcggtggcgcgagctgggaactt tgtgccggggcgctctcggcccggctggcggaggagggcagcggggacgccggtggccgc cgccgcccgccagttgacccccggcgattggcgcgccagctgctgctgctgctttggctg ctggaggctccgctgctgctgggggtccgggcccaggcggcgggccaggggccaggccag gggcccgggccggggcagcaaccgccgccgccgcctcagcagcaacagagcgggcagcag tacaacggcgagcggggcatctccgtcccggaccacggctattgccagcccatctccatc ccgctgtgcacggacatcgcgtacaaccagaccatcatgcccaacctgctgggccacacg aaccaggaggacgcgggcctggaggtgcaccagttctaccctctagtgaaagtgcagtgt tccgctgagctcaagttcttcctgtgctccatgtacgcgcccgtgtgcaccgtgctagag caggcgctgccgccctgccgctccctgtgcgagcgcgcgcgccagggctgcgaggcgctc atgaacaagttcggcttccagtggccagacacgctcaagtgtgagaagttcccggtgcac ggcgccggcgagctgtgcgtgggccagaacacgtccgacaagggcaccccgacgccctcg ctgcttccagagttctggaccagcaaccctcagcacggcggcggagggcaccgtggcggc ttcccggggggcgccggcgcgtcggagcgaggcaagttctcctgcccgcgcgccctcaag gtgccctcctacctcaactaccacttcctgggggagaaggactgcggcgcaccttgtgag ccgaccaaggtgtatgggctcatgtacttcgggcccgaggagctgcgcttctcgcgcacc tggattggcatttggtcagtgctgtgctgcgcctccacgctcttcacggtgcttacgtac ctggtggacatgcggcgcttcagctacccggagcggcccatcatcttcttgtccggctgt tacacggccgtggccgtggcctacatcgccggcttcctcctggaagaccgagtggtgtgt aatgacaagttcgccgaggacggggcacgcactgtggcgcagggcaccaagaaggagggc tgcaccatcctcttcatgatgctctacttcttcagcatggccagctccatctggtgggtg atcctgtcgctcacctggttcctggcggctggcatgaagtggggccacgaggccatcgaa gccaactcacagtattttcacctggccgcctgggctgtgccggccatcaagaccatcacc atcctggcgctgggccaggtggacggcgatgtgctgagcggagtgtgcttcgtggggctt aacaacgtggacgcgctgcgtggcttcgtgctggcgcccctcttcgtgtacctgtttatc ggcacgtcctttctgctggccggctttgtgtcgctcttccgcatccgcaccatcatgaag cacgatggcaccaagaccgagaagctggagaagctcatggtgcgcattggcgtcttcagc gtgctgtacactgtgccagccaccatcgtcatcgcctgctacttctacgagcaggccttc cgggaccagtgggaacgcagctgggtggcccagagctgcaagagctacgctatcccctgc cctcacctccaggcgggcggaggcgccccgccgcacccgcccatgagcccggacttcacg gtcttcatgattaagtaccttatgacgctgatcgtgggcatcacgtcgggcttctggatc tggtccggcaagaccctcaactcctggaggaagttctacacgaggctcaccaacagcaaa caaggggagactacagtctga >gi568815591f:91164881_91366821|GENSCAN_predicted_peptide_7|65_aa MGLKDPTGASKFGPKSLENGEPSDASHKKSDRPAPPLPSTMSKVFLRCGLEVDVGTMLLM QPAET >gi568815591f:91164881_91366821|GENSCAN_predicted_CDS_7|198_bp atggggctaaaggacccaacaggggccagcaagtttggacctaaatctttagaaaatggg gagccatctgatgcttctcataagaagagtgacaggcctgcaccccctttgccttccacc atgagtaaagtcttcctgaggtgtggactagaagtagatgttggcaccatgcttcttatg cagcctgcagaaacgtga >gi568815591f:91164881_91366821|GENSCAN_predicted_peptide_8|165_aa MKADLSDHAYLKEWLWWAGLSRGANEVANFTASGPAALAPPLGALIVLVKPNSPAVVWPW CLGTPDLEPNLVTRKAHDSLQGFRDPSLGGQQRESAWMKERKEGNCAGSLQIVDSGSKIC SGSVQHLCGIYEGTPAAPLLPLLALGHLLLLSKPVSPTCTDDAVT >gi568815591f:91164881_91366821|GENSCAN_predicted_CDS_8|498_bp atgaaagcagatctaagtgaccatgcatatctaaaagaatggttgtggtgggccggactc tcaaggggagccaatgaggttgccaactttactgcctctgggccagctgccttagcgcct ccattaggagctctcattgtccttgtaaagcccaactcccctgctgttgtgtggccctgg tgtttggggacccctgatctagagcccaatcttgtaactaggaaggcacatgactctctc caagggttcagggacccctcccttggtggacagcaaagagaaagtgcctggatgaaggag aggaaagagggcaactgtgctgggagccttcagattgtggacagtggaagtaagatttgc tctggcagtgtgcagcatctgtgcggcatctatgaaggcactcctgcagcccctctgctg cccctgttggcactgggccacctccttttactgtcaaagccagtcagccccacctgcact gatgacgcagtaacatga >gi568815591f:91164881_91366821|GENSCAN_predicted_peptide_9|164_aa MGAALEGQFLSSGHSFPSTLPSVLGDIPEGGVGQQEAPAGLTSCSQKRAVASGLERFQIP YNLSYGQTTPGESGKLTYLNLKVALFPLHSGPGNGLWSIILSTAFKKRSSRKCVSVGNLW LAYSNETANQLKQLPQKCITLLLDPKQLSQGPENTGNFRWNLHV >gi568815591f:91164881_91366821|GENSCAN_predicted_CDS_9|495_bp atgggtgcagctttggagggacagttcttgagctctggtcacagctttccatccacactg cccagtgtacttggagacatccctgaaggaggtgtaggccaacaggaagcccctgctggc ttgacctcatgctcccaaaaacgtgcagtagcatcaggactggaaagatttcaaattcct tataacctgtcgtatggccaaaccacccctggagaaagtggcaagctgacatatttaaat ttaaaagttgccttatttcctctgcattctggtcctggaaatggactgtggtcaattatc ctttctacagcctttaaaaaacgaagcagtaggaagtgcgtcagtgttggaaacttgtgg cttgcttattcaaatgaaacagccaaccagcttaagcagctccctcaaaaatgcatcacc ttgctgctagaccccaaacagctctcccaagggcctgaaaatactgggaacttccgctgg aatttacatgtatga