GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:36:37 Sequence gi568815591f:140307647_140526147 : 218501 bp : 46.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2549 2572 24 1 0 114 89 9 0.311 1.82 1.02 Intr + 7551 7687 137 2 2 65 119 47 0.417 4.87 1.03 Term + 8864 8897 34 2 1 67 48 49 0.260 -4.14 1.04 PlyA + 9839 9844 6 1.05 2.16 PlyA - 15178 15173 6 1.05 2.15 Term - 15825 15788 38 1 2 85 32 59 0.097 -2.50 2.14 Intr - 16546 16402 145 1 1 69 57 90 0.117 3.86 2.13 Intr - 22769 22627 143 0 2 87 53 118 0.915 8.27 2.12 Intr - 29703 29634 70 0 1 93 25 9 0.009 -6.15 2.11 Intr - 35917 35766 152 2 2 70 94 124 0.990 11.08 2.10 Intr - 37617 37570 48 1 0 117 96 61 0.997 8.45 2.09 Intr - 38324 38223 102 0 0 19 72 89 0.575 0.55 2.08 Intr - 41121 40980 142 0 1 61 57 152 0.995 9.33 2.07 Intr - 43805 43627 179 0 2 86 107 73 0.993 8.64 2.06 Intr - 44500 44416 85 1 1 79 73 93 0.990 6.29 2.05 Intr - 48118 48022 97 0 1 91 92 -31 0.853 -2.39 2.04 Intr - 51139 50994 146 1 2 97 67 180 0.959 15.88 2.03 Intr - 62036 61944 93 2 0 93 103 38 0.972 5.96 2.02 Intr - 72744 72636 109 2 1 94 99 54 0.987 7.39 2.01 Init - 74880 74792 89 0 2 87 115 -45 0.644 -1.67 2.00 Prom - 75524 75485 40 -5.06 3.00 Prom + 87759 87798 40 -2.96 3.01 Init + 89156 89181 26 1 2 63 80 13 0.282 -3.79 3.02 Intr + 90659 90869 211 2 1 45 54 137 0.103 4.92 3.03 Intr + 96422 96571 150 1 0 67 115 50 0.105 5.96 3.04 Intr + 99989 100201 213 1 0 48 77 159 0.246 9.71 3.05 Intr + 104228 104411 184 1 1 115 52 306 0.697 29.06 3.06 Term + 118236 118504 269 1 2 86 44 373 0.553 28.36 3.07 PlyA + 123053 123058 6 1.05 4.02 PlyA - 123513 123508 6 -0.45 4.01 Sngl - 128009 127668 342 2 0 62 54 211 0.817 11.30 4.00 Prom - 132268 132229 40 -4.66 5.11 PlyA - 133172 133167 6 1.05 5.10 Term - 139187 139075 113 0 2 51 49 167 0.411 7.82 5.09 Intr - 147083 146957 127 2 1 97 75 183 0.203 18.15 5.08 Intr - 147587 147449 139 1 1 68 60 70 0.989 2.67 5.07 Intr - 148254 148144 111 2 0 54 64 104 0.894 4.09 5.06 Intr - 149220 149006 215 0 2 82 50 315 0.996 24.61 5.05 Intr - 151587 151361 227 1 2 64 106 218 0.805 18.90 5.04 Intr - 152290 152061 230 0 2 80 86 103 0.930 6.81 5.03 Intr - 164365 164237 129 0 0 106 110 61 0.994 9.91 5.02 Intr - 171340 171281 60 0 0 70 62 78 0.614 1.25 5.01 Init - 171698 171514 185 2 2 79 91 162 0.964 12.29 5.00 Prom - 173301 173262 40 -7.66 6.00 Prom + 180951 180990 40 -1.66 6.01 Init + 188092 188211 120 0 0 83 81 105 0.963 8.88 6.02 Term + 199439 199510 72 1 0 116 49 41 0.625 0.91 6.03 PlyA + 203412 203417 6 1.05 7.06 PlyA - 204238 204233 6 1.05 7.05 Term - 209867 209767 101 0 2 45 50 67 0.222 -3.11 7.04 Intr - 212072 211986 87 0 0 90 82 98 0.998 9.34 7.03 Intr - 214454 214209 246 0 0 103 100 427 0.992 42.83 7.02 Intr - 215778 215661 118 0 1 107 127 158 0.989 21.64 7.01 Intr - 218146 218105 42 1 0 110 109 67 0.986 9.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:140307647_140526147|GENSCAN_predicted_peptide_1|64_aa KCRFTETQHACSAATGKGKQERSEAVHSCTPTQSHGFLDKSPNLDVCSVLARCCSWKSLI GNRN >gi568815591f:140307647_140526147|GENSCAN_predicted_CDS_1|195_bp aaatgcaggttcacagaaacacagcatgcgtgctctgcagcaacagggaagggaaaacag gaacgctctgaagcagtacattcctgcacacccacacagagccatggcttcctagacaag tccccgaacctagatgtctgcagtgtcctggcacgctgctgttcttggaagtcactgatt ggaaaccggaactga >gi568815591f:140307647_140526147|GENSCAN_predicted_peptide_2|545_aa MAWPNVFQRGSLLSQFSHHHVVVFLLTFFSYSLLHASRKTFSNVKVSISEQWTPSAFNTS VELPVEIWSSNHLFPSAEKATLFLGTLDTIFLFSYAVVFVFGALTEWLRFYNKWLYCCLW IVNGLLQSTGWPCVVAVMGNWFGKAGRGVVFGLWSACASVGNILGACLASSVLQYGYEYA FLVTASVQFAGGIVIFFGLLVSPEEIGLSGIEAEENFEEDSHRPLINGGENEDEYEPNYS IQDDSSVAQVKAISFYQACCLPGVIPYSLAYACLKLVNYSFFFWLPFYLSNNFGWKEAEA DKLSIWYDVGGIIGGTLQGFISDVLQKRAPVLALSLLLAVGSLIGYSRSPNDKSINALLM TVTGFFIGGPSNMISSAISADLGRQELIQRSSEALATVTGIVDGSGSIGAAVGQYLVSLI RDKLGWMWVFYFFILMVRRCTPAINGRRLSALPRFSDYVGFECILLSSGSWQPRQLSLRA QRQRTNSLVNGTFAVLFRCLDTESDSLTCWIKCEILHESSSLAIKSETDQRPVIYSQSYV TITTI >gi568815591f:140307647_140526147|GENSCAN_predicted_CDS_2|1638_bp atggcctggccaaatgtttttcaaagagggtctctgctgtcccagttcagccatcatcat gttgtagtgttcctgctcactttcttcagttattcgttgctccatgcttcacgaaaaaca tttagcaatgtcaaagtcagtatctctgagcagtggaccccaagtgcttttaacacgtca gttgagctgcctgtggagatctggagcagcaaccatttgttccccagtgcagagaaagcg actcttttcctcggcacactggataccattttcctcttctcctatgctgtggtgtttgtc tttggtgcgctcacagaatggctgcgtttctacaacaaatggctgtactgctgcctgtgg attgtgaacggcctgctgcagtccactggttggccctgtgtggttgctgttatgggcaac tggtttgggaaagccggacgaggagttgtttttggtctctggagtgcctgtgcttcggtg ggcaacattttgggagcgtgcctagcttcttctgttcttcagtatggttatgagtatgcc tttctggtgacggcgtctgtgcagtttgctggtgggatcgttatcttctttggactcctg gtgtcaccagaagaaattggtctctcgggtattgaggcagaagaaaactttgaagaagac tcacacaggccattaattaatggtggtgaaaatgaagacgaatatgagccgaattattca atccaagatgatagttctgttgcccaagtcaaggcgataagcttctaccaggcatgttgc cttcctggagtcataccgtactcactggcctacgcctgcttgaagttagtgaattactcc ttcttcttctggctccccttttatctgagtaacaacttcggctggaaggaggcggaagcc gacaagctgtccatttggtacgacgttggagggatcataggtggaactttgcaaggcttc atctctgatgtactacagaagagagcgccggttcttgccctgagtctgcttctggcagtt gggtccctcatcgggtatagtcgttctccaaatgataagtccatcaatgcccttctgatg actgttacaggattttttattggtggaccttctaatatgattagttctgctatttctgcg gacttgggtcgccaggagctcatccaaaggagcagtgaagctttggccactgtcacagga attgtggatggttcggggagcattggagctgcagtgggccagtatttagtgtctctgatc cgggacaagctaggatggatgtgggttttctactttttcattctcatggtaagacgctgc acgcctgcaattaatggccgcagactgtcggcgctgcccaggttctcggactacgtgggt tttgagtgcattctcctctcctctggctcttggcagcctcgccagctctccctgagggct cagcggcagcgaacaaatagtcttgttaacgggacttttgcggtgttattccgctgctta gacaccgagagtgactctcttacttgctggattaagtgtgagatcctgcacgaatcctca tctctagctataaaatctgagacagaccagagacctgttatttattcgcagagttatgtg accatcaccactatctaa >gi568815591f:140307647_140526147|GENSCAN_predicted_peptide_3|350_aa MVSFKRQARGAATYISPPPSRPGKPLGLDPPPPQQVFGPTHHRLRTRDPQWSPLSSPGSA RRVSLAPLPASPAASSPLADAKPDTWGQNYTSHQAARQEEEGGTGGLKSKPRRKRKHYLL TSRGERARQVARTMHFSSSARAADENFDYLFKIILIGDSNVGKTCVVQHFKSGVYTETQQ NTIGVDFTVRSLDIDGKKVKMQVWDTAGQERFRTITQSYYRSAHAAIIAYDLTRRSTFES IPHWIHEIEKYGAANVVIMLIGNKCDLWEKRHVLFEDACTLAEKYGLLAVLETSAKESKN IEEVFVLMAKELIARNSLHLYGESALNGLPLDSSPVLMAQGPSEKTHCTC >gi568815591f:140307647_140526147|GENSCAN_predicted_CDS_3|1053_bp atggtgtcattcaagagacaagccaggggggccgccacctacatctccccaccgccctcc cgacccgggaagcccctcggattggacccgcccccgccccagcaggtcttcggccccacg catcaccgcttgcgcacccgagatccgcagtggtcgccgctctccagccccggctccgct cgccgggtcagcttggctccgctgcccgcctcccccgccgccagctcacccctggcggac gccaaacccgacacctggggccagaactacacttcccaccaagcagcgaggcaggaggaa gaagggggcactggggggctgaaaagcaaacccagaagaaagcgaaaacattacctgctg accagcagaggtgagagagcgagacaggtggcaagaaccatgcacttctccagctcagcc agggcagcagatgagaactttgactatttgttcaagattatcctcattggggattccaat gtggggaagacgtgtgtggtgcagcatttcaagtctggagtctacactgagacacagcag aacacgattggagtggactttaccgtgcgttcccttgatattgacggcaaaaaagtgaag atgcaggtgtgggacacagctggccaggagcgcttccgcaccatcacccaaagctactac cgcagtgcccacgcagccatcatcgcctatgacctcacccggcggtccacgttcgagtcc atccctcactggattcatgagatagagaaatatggagctgcaaatgtggtcattatgctg attggaaataaatgtgacctctgggaaaagcggcacgtcctgttcgaggatgcctgcaca ctggctgagaagtacggcctcctggccgttttggagacatctgccaaggagtcaaagaac atagaagaagtcttcgtgctcatggccaaggagctgatcgcgcgcaacagcctgcaccta tatggggagagtgccctgaacggcctccccctggactccagccccgttcttatggcccag ggtccaagtgaaaagacccactgcacttgctaa >gi568815591f:140307647_140526147|GENSCAN_predicted_peptide_4|113_aa MGHSGGSWGTGSSALRGGFTLVGKDAGAQRVLAGHREGPRGPAGEALLPKACEGSSVVPR PGTIQGDFSAHISRNFFQASNSVEGARRWIPLWLPSRDLVSWARRQHSRSHPA >gi568815591f:140307647_140526147|GENSCAN_predicted_CDS_4|342_bp atggggcacagcggcggctcgtgggggacgggatccagcgctttgagaggtggcttcacg ctggtggggaaggacgcaggcgcacagcgcgtccttgccgggcaccgcgagggcccgcga gggcccgcgggggaagccctgctacccaaggcctgcgaagggtccagtgtggtcccgcgc cccgggaccatacagggagatttcagcgcccacatcagcaggaacttcttccaggcaagc aactccgtggaaggggctcggagatggatcccgctgtggctcccgagcagggacctggtg agctgggcacggcgccagcacagccgcagccacccagcctga >gi568815591f:140307647_140526147|GENSCAN_predicted_peptide_5|511_aa MAEAATPGTTATTSGAGAAAATAAAASPTPIPTVTAPSLGAGGGGGGSDGSGGGWTKQVT CSHCVNPRWAAEPSALSAEAWRYFMHGVCKEGDNCRYSHDLSDSPYSVVCKYFQRGYCIY GDRCRYEHSKPLKQEEATATELTTKSSLAASSSLSSIVGPLVEMNTGEAESRNSNFATVG AGSEDWVNAIEFVPGQPYCGRTAPSCTEAPLQGSVTKEESEKEQTAVETKKQLCPYAAVG ECRYGENCVYLHGDSCDMCGLQVLHPMDAAQRSQHIKSCIEAHEKDMELSFAVQRSKDMV CGICMEVVYEKANPSERRFGILSNCNHTYCLKCIRKWRSAKQFESKIIKSCPECRITSNF VIPSEYWVEEKEEKQKLILKYKEAMSNKACRYFDEGRGSCPFGGNCFYKHAYPDGRREEP QRQKVGTSSRYRAQRRNHFWELIEERENSNPFDNDEEEVVTFELGEMLLMLLAAAFISTS FLSMDGQCFIAGVLNPRTVYKYQYEYQSVAC >gi568815591f:140307647_140526147|GENSCAN_predicted_CDS_5|1536_bp atggcggaggctgcaactcccggaacaacagccacaacatcaggagcaggagcggcagcg gcgacggcggcagcagcctcccccaccccgatccccacagtcaccgccccgtccctgggg gcgggcggagggggcggcggcagcgacggcagcggcggcggctggactaaacaggtcacc tgcagtcactgcgtcaacccccgctgggccgcggagccctcagcgctgtccgcggaggcc tggcggtattttatgcatggggtttgtaaggaaggagacaactgtcgctactcgcatgac ctctctgacagtccgtatagtgtagtgtgcaagtattttcagcgagggtactgtatttat ggagaccgctgcagatatgaacatagcaaaccattgaaacaggaagaagcaactgctaca gagctaactacaaagtcatcccttgctgcttcctcaagtctctcatcgatagttggacca cttgttgaaatgaatacaggcgaagctgagtcaagaaattcaaactttgcaactgtagga gcaggttcagaggactgggtgaatgctattgagtttgttcctgggcaaccctactgtggc cgtactgcgccttcctgcactgaagcacccctgcagggctcagtgaccaaggaagaatca gagaaagagcaaaccgccgtggagacaaagaagcagctgtgcccctatgctgcagtggga gagtgccgatacggggagaactgtgtgtatctccacggagattcttgtgacatgtgtggg ctgcaggtcctgcatccaatggatgctgcccagagatcgcagcatatcaaatcgtgcatt gaggcccatgagaaggacatggagctctcatttgccgtgcagcgcagcaaggacatggtg tgtgggatctgcatggaggtggtctatgagaaagccaaccccagtgagcgccgcttcggg atcctctccaactgcaaccacacctactgtctcaagtgcattcgcaagtggaggagtgct aagcaatttgagagcaagatcataaagtcctgcccagaatgccggatcacatctaacttt gtcattccaagtgagtactgggtggaggagaaagaagagaagcagaaactcattctgaaa tacaaggaggcaatgagcaacaaggcgtgcaggtattttgatgaaggacgtgggagctgc ccatttggagggaactgtttttacaagcatgcgtaccctgatggccgtagagaggagcca cagagacagaaagtgggaacatcaagcagataccgggcccaacgaaggaaccacttctgg gaactcattgaggaaagagagaacagcaacccctttgacaacgatgaagaagaggttgtc acctttgagctgggcgagatgttgcttatgcttttggctgcagcatttatcagtacttca tttctttcaatggatggacagtgttttattgcaggggtcctcaaccctcggaccgtgtac aagtaccagtacgagtaccaatccgtggcctgttag >gi568815591f:140307647_140526147|GENSCAN_predicted_peptide_6|63_aa MREERTGQREQQVWPGGWALLGVIQDLLTPLWLDPGSQAKAPPCFAQGSTSPLSPESDNV NIS >gi568815591f:140307647_140526147|GENSCAN_predicted_CDS_6|192_bp atgagggaagagcgcacaggccagagggaacagcaggtatggcccggaggctgggccctg cttggtgtgatccaggacctgctgacgccactgtggctggacccagggagccaggccaag gcacccccttgttttgcacagggctccacctcccccctgtctcctgaatctgacaatgtc aacatcagctga >gi568815591f:140307647_140526147|GENSCAN_predicted_peptide_7|197_aa VLVVDLVNSRFLRQMDDEDSILPRKLQVALEHILEQRNELACEQDEGPLDGRHGPESSPL NEVVSEAFVRFFVEIVGHYSLFLTSGEREERTLQREAFRKAVSSKSLRHFLEVFMETQMF RGFIQERELRRQDAKGLFEVRAQEYLETLPSGEHSGVNKFLKGLALTSDPHRNTKGRGAD LNNGPEGPYCLAGVLNL >gi568815591f:140307647_140526147|GENSCAN_predicted_CDS_7|594_bp gtccttgtggttgacctcgtcaacagccggttcctcagacagatggacgatgaggactcc atcctgccccggaagcttcaggtggccctggaacacattctggaacagaggaacgagctg gcttgtgagcaggacgaagggcccctagacggcaggcacggtccagagtccagccccttg aacgaggtggtgtctgaagcctttgtccgcttcttcgtggagattgtgggacactactct ttgttcctgacgtcgggcgagcgtgaggagagaaccctgcagcgggaggccttccgcaaa gctgtctcctccaagagcctccgccacttcctggaggtcttcatggagactcagatgttt cggggcttcatccaggagcgggagctgcgccggcaggatgccaaaggtctgtttgaggtc cgagcccaagagtatctggaaacactccccagtggagagcacagcggtgtcaataagttc ctgaagggactagctctcaccagcgaccctcacagaaacacaaaagggaggggcgccgac ctcaacaatggcccagaggggccatactgcctggcaggggttctcaacctttag