GENSCAN 1.0 Date run: 19-Feb-121 Time: 20:42:43 Sequence gi568815584f:75337895_75569472 : 231578 bp : 45.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10526 10684 159 1 0 39 62 117 0.096 4.02 1.02 Intr + 17195 17294 100 1 1 86 78 49 0.178 3.38 1.03 Intr + 17512 17537 26 2 2 107 78 14 0.038 0.04 1.04 Intr + 26237 26331 95 1 2 89 77 19 0.020 -0.34 1.05 Intr + 37553 37753 201 2 0 82 36 346 0.994 27.20 1.06 Intr + 90679 90712 34 1 1 131 84 19 0.681 4.03 1.07 Intr + 94224 94372 149 2 2 109 24 183 0.252 13.03 1.08 Intr + 98376 98515 140 0 2 5 103 37 0.052 -2.99 1.09 Intr + 100004 100222 219 0 0 109 59 232 0.337 20.67 1.10 Intr + 103511 103676 166 0 1 52 113 85 0.505 6.42 1.11 Intr + 116130 116246 117 0 0 31 68 118 0.351 3.68 1.12 Intr + 119893 119980 88 1 1 66 60 31 0.109 -1.93 1.13 Intr + 123532 123636 105 0 0 96 88 183 0.944 19.51 1.14 Term + 131396 131581 186 1 0 71 47 424 0.820 34.09 1.15 PlyA + 134784 134789 6 1.05 2.00 Prom + 135916 135955 40 -0.46 2.01 Init + 151808 151924 117 1 0 83 -7 135 0.114 3.70 2.02 Intr + 157227 157346 120 2 0 62 94 16 0.063 0.29 2.03 Intr + 176120 176228 109 1 1 72 15 84 0.186 -0.64 2.04 Intr + 177255 177523 269 1 2 18 94 132 0.259 4.05 2.05 Intr + 187190 187294 105 1 0 114 119 114 0.996 17.51 2.06 Term + 208568 208777 210 1 0 83 49 404 0.898 33.29 2.07 PlyA + 209079 209084 6 1.05 3.06 PlyA - 209093 209088 6 -6.84 3.05 Term - 209408 209126 283 1 1 -9 48 257 0.188 6.90 3.04 Intr - 211009 210717 293 1 2 71 85 50 0.074 -1.07 3.03 Intr - 212944 212736 209 2 2 53 51 92 0.057 0.80 3.02 Intr - 221396 221255 142 2 1 -40 4 337 0.364 12.43 3.01 Init - 224062 223934 129 1 0 78 89 72 0.826 6.45 3.00 Prom - 226290 226251 40 -3.96 4.02 PlyA - 226646 226641 6 1.05 4.01 Term - 231503 231404 100 1 1 98 48 88 0.475 3.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 184789 184851 63 0 0 83 80 45 0.828 4.35 S.002 Term + 190035 190094 60 2 0 82 44 92 0.967 2.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:75337895_75569472|GENSCAN_predicted_peptide_1|594_aa MFDVVQTADPTHYFIHGAWLETEEARHHHHKSSDLITKYLMVLEISTCCLGLLFRGLDLT VFILSITKPVSLCIQEATKCLEIEEGEYYEFQHGETGFLHIMESMVAGSPRLSPPSRKDP VFSRKLCTAIATNAITITITITTITTTITITTTTINTYTITTISTIISTIIITMPPPPPL PLPMPEWKRSGGAREVLSSGGGEGGSWGQPSEEGLLLLAFTLSLGAFPIILLLAFWISST FGSRVTYVIQIPRERPLLQETESPRTMWPSEYINCPTEKYCPMDIRSSLFCQIHLSPWEE TGWPATPPAMMPGQIPDPSVTTGSLPGLGPLTGLPSSALTVEELKYADIRNLGAMIAPLH FLEVKLGKRPQPVKIPLFKPYLLSIYYVLSTAPGTGNRATDKAVKALHLSAPYIAVTASK SKQINTRYISYVFENAYEKSEFKITSGLLFVTVEMPRHLQTGPEKHGNGQCASREGRLPA LLAYYKQREEATPAEKAPLDEEEERRKRRREKNKVAAARCRNKKKERTEFLQRESERLEL MNAELKTQIEELKQERQQLILMLNRHRPTCIVRTDSVKTPESEGNPLLEQLEKK >gi568815584f:75337895_75569472|GENSCAN_predicted_CDS_1|1785_bp atgtttgatgttgtacagacagcagaccctacccactacttcattcatggtgcctggttg gagactgaggaagccaggcaccaccatcataaatcctcagacctcatcacgaaatacctg atggttctggagataagtacctgctgcctgggcctcttgtttcggggtctggacctcact gtgtttatactcagcatcacgaagcccgtgagcttgtgtatacaggaggccaccaaatgt ttggagatagaggagggagagtattatgagtttcagcatggtgagacaggctttctccac atcatggaatcaatggttgctggcagtcccagactctcacctcccagcagaaaggaccct gttttttcccgcaaactttgcactgccattgccacaaacgccatcactatcaccatcacc atcacaaccattactactaccatcaccatcaccaccaccaccatcaacacctataccata actaccatctccaccatcatcagcaccattatcatcaccatgccaccaccaccaccactg ccattgccaatgcctgagtggaagaggagtggaggagcgagggaggtgctctcctcgggc ggcggggaaggcgggagctggggacagccatctgaagagggccttttgctgctcgccttc actctcagtcttggggccttccccatcatcttgcttctcgccttctggatctcctcgact tttggaagccgcgtgacctacgtcattcagattccaagagagaggccactcctgcaggag acagagagcccaagaaccatgtggccatctgagtacattaactgcccaacggagaagtat tgtccgatggacatcagatcttccctcttctgccaaatacatctctctccatgggaagag acaggctggcctgccactcctcctgctatgatgcctgggcagatcccggacccttcggtg accacaggctccctgccagggcttggccccctgaccgggctccccagctcggccctgact gtggaggagctgaaatacgctgacatccgcaacctcggggccatgattgcacccttgcac ttcctggaggtgaaactgggcaagaggccccagcccgtgaaaatccctttgttcaagcca tatttgttaagcatctactatgtgctgagcactgcaccaggcactgggaacagagctaca gataaagcagtcaaagccttacatctctcggcaccgtatattgcagtgacggcatccaaa agtaagcaaattaatacacgatatataagctatgtttttgaaaacgcttatgaaaagtca gaattcaaaatcacatctgggctgctgtttgtcacagtggaaatgccgcgccacctgcag acggggcctgagaaacacggaaatggccagtgtgcttctagggaaggcaggctcccagct ttgctggcctactacaaacagagagaagaggccaccccggcagagaaagctcccctagat gaggaagaggagcgaaggaaaaggcgccgggagaagaacaaagtcgcagcagcccgatgc cggaacaagaagaaggagcgcacggagtttctgcagcgggaatccgagcggctggaactc atgaacgcagagctgaagacccagattgaggagctgaagcaggagcggcagcagctcatc ctgatgctgaaccgacaccgccccacctgcatcgtccggaccgacagtgtcaagaccccc gagtcagaaggcaacccactgctcgagcagctcgagaagaagtga >gi568815584f:75337895_75569472|GENSCAN_predicted_peptide_2|309_aa MQRMVFLTESEEAVCSVPLQNAKKSWLVLEPSTPEHLKTDMAILMMAEAAINQVHPSCKF MESSKQLHDTYTGNIFIFQVQEEEALQEAALMPTPAAASGWLDVLSAWPYPQVEVEGKQP TVAPKDPDRQRPRAAAEGTGSHFLHFTSSCLACIDDLNWGVNPKRSSASSRRMMRTDSGL VNLRERESVGANEAKGARLLSAPRQDSSDDVRRVQRREKNRIAAQKSRQRQTQKADTLHL ESEDLEKQNAALRKEIKQLTEELKYFTSVLNSHEPLCSVLAASTPSPPEVVYSAHAFHQP HVSSPRFQP >gi568815584f:75337895_75569472|GENSCAN_predicted_CDS_2|930_bp atgcaaaggatggtgtttctcacggagtcagaggaggccgtctgctccgtgcccttacag aatgccaagaagtcatggctggtgctggagccctcaaccccagaacaccttaagactgat atggccatattaatgatggcagaggctgccattaaccaagtgcacccaagctgtaaattc atggaatcctccaaacagctccatgatacatatactggtaatatcttcatttttcaggtg caggaagaggaggccctgcaggaagcagctctgatgcccactccggctgctgccagtggc tggctggatgtgctatcagcatggccatacccacaggttgaagtagagggaaagcaaccc actgtggccccgaaagacccagaccgccagcggccccgggcagccgccgagggcaccggc tcacacttcctgcatttcacaagctcttgcttggcatgcattgatgacttgaactggggg gttaacccaaaaagatcaagtgcgagctcaaggcggatgatgagaacagattcggggctg gttaacctcagagagcgggagtccgttggtgctaatgaggccaagggagcccggctcctt tctgcccccaggcaggactcatctgatgatgtgagaagagttcagaggagggagaaaaat cgtattgccgcccagaagagccgacagaggcagacacagaaggccgacaccctgcacctg gagagcgaagacctggagaaacagaacgcggctctacgcaaggagatcaagcagctcaca gaggaactgaagtacttcacgtcggtgctgaacagccacgagcccctgtgctcggtgctg gccgccagcacgccctcgccccccgaggtggtgtacagcgcccacgcattccaccaacct catgtcagctccccgcgcttccagccctga >gi568815584f:75337895_75569472|GENSCAN_predicted_peptide_3|351_aa MEYYAAIKRNKIMSFAATWMELETITLSELTQKWKNKYCIFSLKEKEEEEEEKGEEEEEK GEEEEEEKGEEEEDEEEEEEKKEKEKKWILGAEASSLDTSSRVWTRPLVCASLDPRDGSC GGECGRNLVFQAGMSAGMNWRLLWYRKNQGRARKRQAAVRRGGKELDRALATKIQVLKEI KKAPPHDTSAQSTWRLCTERGACFATFRWETKGYNARFCAGRNASTQGTSSGLSSQALNY GAQPSVHILGSHGHQLLCLSQDLRNSVKKKEEEEEKGEKRRRRRKEEEEEEKEKEKEKET PREGIESFPSTSINHGKEELLVESLELFEGTRVQKTKKVPTSQNLECQGTG >gi568815584f:75337895_75569472|GENSCAN_predicted_CDS_3|1056_bp atggaatactatgctgccataaaaaggaacaagatcatgtcctttgcagcaacatggatg gagctggagaccattaccctgagcgaactaacacagaaatggaaaaacaaatactgcata ttctcacttaaggagaaagaggaggaggaggaggagaaaggggaggaggaggaggagaaa ggggaggaggaggaggaggagaaaggggaggaggaggaggacgaagaggaggaggaggag aagaaggagaaggagaagaaatggattctgggagctgaagcctcttccctggatacttct tcccgagtgtggaccagaccactggtatgtgcatctctggacccaagggatgggtcctgt gggggtgaatgtggacggaatttggtcttccaggctggcatgtccgccggcatgaactgg aggctgctgtggtacaggaagaaccaggggagggcaagaaagcggcaagctgcagtccgg aggggtgggaaagagctagacagagcgttagctaccaaaatccaggttctgaaggaaatt aagaaagctcctcctcatgacacatctgcccagtccacatggaggctctgcactgaaagg ggagcctgttttgccacgtttcgttgggagaccaaaggttacaatgcaagattctgtgct gggagaaatgccagcactcagggaacgtcttctgggctgtcttcccaggctctgaattac ggggctcagcccagtgtccacatcctaggctcccatgggcaccagcttctctgcctgagc caagatctgagaaattctgtcaagaagaaggaggaggaggaggagaagggggagaagagg aggaggaggagaaaggaggaggaggaggaggagaaggagaaggagaaggagaaggagaca cctagagaaggaattgaaagcttcccaagtacatcaataaaccatggaaaagaggagctg ttagtggagtccctagagctctttgagggtactagagtccaaaaaacaaagaaggtccca acatcgcagaatttagagtgtcaagggacagggtga >gi568815584f:75337895_75569472|GENSCAN_predicted_peptide_4|33_aa XDLCHHKIKRPENLLPIQSVNSHENPIIKFYGK >gi568815584f:75337895_75569472|GENSCAN_predicted_CDS_4|102_bp nntgacctctgtcaccacaagatcaagaggccagagaacctacttccaattcagagtgtg aattcccatgaaaatccaatcatcaaattctatggaaagtaa