GENSCAN 1.0 Date run: 6-Nov-116 Time: 13:02:46 Sequence gi568815578f:59200813_59424456 : 223644 bp : 46.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6093 6244 152 0 2 121 95 2 0.785 4.01 1.02 Intr + 7456 7654 199 2 1 109 49 97 0.722 6.31 1.03 Intr + 8277 8380 104 0 2 51 56 60 0.281 -1.08 1.04 Intr + 9708 9799 92 1 2 129 44 52 0.495 4.61 1.05 Intr + 22441 22531 91 0 1 93 36 47 0.001 -0.43 1.06 Intr + 28168 28225 58 2 1 89 94 9 0.000 -0.46 1.07 Intr + 48703 48825 123 1 0 96 15 139 0.811 7.10 1.08 Intr + 51573 51607 35 0 2 48 80 54 0.846 -1.53 1.09 Intr + 52166 52326 161 0 2 130 86 30 0.959 6.71 1.10 Term + 53086 53931 846 0 0 108 48 276 0.939 18.48 1.11 PlyA + 55198 55203 6 1.05 2.05 PlyA - 56842 56837 6 1.05 2.04 Term - 77998 77721 278 2 2 76 41 237 0.503 13.42 2.03 Intr - 78692 78599 94 2 1 63 16 116 0.552 1.44 2.02 Intr - 80771 80701 71 0 2 79 82 31 0.089 0.50 2.01 Init - 87455 87380 76 2 1 87 84 30 0.101 3.78 2.00 Prom - 97382 97343 40 -5.96 3.00 Prom + 99019 99058 40 -9.55 3.01 Init + 100001 100052 52 1 1 51 115 76 0.880 5.96 3.02 Intr + 100664 100910 247 0 1 16 91 258 0.513 15.42 3.03 Term + 101068 101188 121 1 1 40 47 122 0.510 1.25 3.04 PlyA + 103446 103451 6 -0.45 4.00 Prom + 104788 104827 40 -3.36 4.01 Init + 105537 105623 87 2 0 80 53 61 0.065 2.74 4.02 Intr + 112409 112518 110 1 2 107 29 56 0.106 0.68 4.03 Intr + 120205 120381 177 1 0 120 80 148 0.989 16.23 4.04 Term + 126638 126824 187 2 1 82 49 138 0.731 6.26 4.05 PlyA + 132820 132825 6 1.05 5.14 PlyA - 132943 132938 6 1.05 5.13 Term - 141776 141568 209 0 2 77 43 157 0.256 7.60 5.12 Intr - 156219 156082 138 1 0 95 41 127 0.001 9.14 5.11 Intr - 158986 158870 117 2 0 46 103 35 0.000 1.24 5.10 Intr - 166693 166654 40 1 1 123 50 13 0.001 -1.20 5.09 Intr - 168402 168222 181 2 1 18 77 106 0.007 2.37 5.08 Intr - 179256 179081 176 0 2 116 60 78 0.479 6.54 5.07 Intr - 187790 187698 93 2 0 99 91 113 0.682 12.86 5.06 Intr - 189062 189032 31 1 1 82 85 28 0.042 0.03 5.05 Intr - 189725 189614 112 0 1 62 64 -7 0.005 -6.26 5.04 Intr - 193337 193254 84 0 0 81 94 33 0.011 2.99 5.03 Intr - 202662 202540 123 1 0 45 52 110 0.055 3.66 5.02 Intr - 208323 208183 141 1 0 60 82 47 0.058 1.62 5.01 Intr - 210647 210510 138 0 0 95 49 87 0.170 5.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 65621 65529 93 2 0 92 95 34 0.847 4.66 S.002 Intr + 158886 159042 157 2 1 94 109 111 0.989 13.38 S.003 Term + 164924 165150 227 0 2 61 48 136 0.871 3.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:59200813_59424456|GENSCAN_predicted_peptide_1|620_aa XYKGNFLQSCVQLRASRLRTPTWVRRRSRHPPALEGLKPCRTPGQTSSEIAAKLPIGLCL ILPRPMEGISGLNAGLYHQDKRMSPPCKGQGTMQLSAFSLGRPVTGTGAIDITSVLSSSD ATGNDIGRSVCFPGTSRVCLSRMSTVCQCEIQGDTVPNVFHGSWHPCLICSLSCMLMVTL GASGCKAESRILPQNAGSQSLHGDQDSRTLRLPTSVNHHSTLYLCEIYFFSLTLFKVEHG AWVIEGAQDVVVEQVNGVPKAWFCAIRLPEYTGGSSVRVKDEGKTGLNLQEEPSCATSES PPCCGKEEKKEGDCRQTLGTLSLGTSSRIVREMDKRTVKDISPSAGEHGDCTTHSTAATS GLSLQSDTCLAVVNDVPLPPGKGLDLGLLETQLLASQDSVSTDPKPYIFSDAQRPSSFGS KGTFPHHDIATSVAAVCISLPVRTDHIAQEIHSAESRDHSQTAGRTLTSSSPDSKVTEEG RAQTLLPGRPSSGQRISDSVPLESTEKTHLEIPASGPSSASSHHKEGRHKTFFPSRGQYG CGEMTVPCPSLGSDGRKRQVSGLITRKDSVVPSKPEQPIEIPEAPSKSLKKRSLEGMRKQ TRVEFSDTSSDDEDRLVIEI >gi568815578f:59200813_59424456|GENSCAN_predicted_CDS_1|1863_bp nngtacaaagggaatttcttgcagagctgtgttcagctgagagccagtagacttcgcaca ccaacctgggtgcgaagaagaagccgccaccctcccgcacttgagggactgaagccatgc aggacccctgggcagacctcttcagaaatagcagcgaaattgcccattgggctgtgcttg atactgccacgccccatggagggcatctcagggctaaatgctggtctgtaccaccaggac aagaggatgtctcccccatgcaaaggccaggggaccatgcaactctctgctttctcgttg ggccgtcctgtgacaggcacaggtgccatcgatatcacctctgtgctctccagctcagat gccaccgggaacgacattggcagatctgtttgcttccccgggacatctcgtgtctgtctg tctaggatgtcgacggtctgtcagtgtgagattcaaggggacacagttcccaacgtgttc catggaagttggcacccctgcctcatctgcagcttatcctgcatgctcatggtgaccctg ggggcctcaggatgtaaagcagaatccaggattctgccgcaaaatgcaggctcccaatcg ctgcacggtgaccaggactcccggacccttcgcttgccaacctctgttaaccaccattct actctctacctctgtgagatctacttctttagtctcacactcttcaaggtggagcatggt gcctgggtcattgaaggggcccaggatgtggttgttgaacaggtgaatggagtaccaaaa gcctggttctgtgccattagattacctgagtacacgggagggtcatcagtgcgggtgaag gatgaaggaaagacaggtctgaatctgcaagaggagccatcttgtgccacctcagaatca cctccttgttgtgggaaggaagagaagaaggaaggtgactgcagacaaaccttaggaacc ctctctcttggtacaagttcaagaattgtcagggaaatggacaaacgaactgtgaaggat atttctccatctgctggtgagcatggtgactgtactactcacagcactgctgccacatca ggattatctctgcaatctgacacctgcctggcagtggttaatgacgtgcctctaccccct ggcaaaggtcttgaccttgggttgctggagactcagctgctggcctcccaggattcagtc tcaacagatcccaaaccatacatcttctcagatgctcaaaggccttcttcctttgggtcc aaaggaacttttccccaccatgacattgctacctctgtggctgccgtttgtatttctctg ccagtgagaacagatcacatagcccaggaaattcacagtgctgaatcacgagaccacagc cagactgcagggaggactctgacatcaagctccccagacagcaaagtcacagaagagggc agagcacagaccctcttgccagggagaccttcatctggacaaagaatttcagattcggtt ccactggagtcaactgaaaaaactcatcttgaaataccagcttcaggaccaagttcagct agttcacaccacaaggaagggagacacaagacgttttttccttccagaggccagtatggg tgtggggaaatgactgtcccctgcccctctttaggaagtgacggtaggaaacgtcaggta tctggattaatcactcggaaagattctgtggttccttctaagccagagcagcccatagaa attcctgaagccccttctaaatccctcaagaagaggagtctggaaggaatgagaaagcaa actcgagtagagttcagtgacaccagcagcgacgatgaagaccgattagttatagaaata tga >gi568815578f:59200813_59424456|GENSCAN_predicted_peptide_2|172_aa MRSAKKEVFPAFIPSANLLKTICEPGTFPQWSQECRHFWQTEKPCGKGWPQKPLDFNQKF IGAKEPNKTSIRNSLDAFFETRNCHSSSLSGFLHTPVANSYSPNSKQEVTAGKQLNKAKV IKVTGWEMKDASTPQAEGSLKTGIRSDSSLSEIDTLPDKVKILNNHSLNPFV >gi568815578f:59200813_59424456|GENSCAN_predicted_CDS_2|519_bp atgcgctctgcaaagaaggaggtcttccctgctttcattccatcagcaaatttactaaag accatctgtgagccaggtacatttccccaatggagtcaagaatgccgacacttctggcaa acagaaaagccttgtgggaaaggctggccacagaaaccattggacttcaaccagaagttc attggagcaaaagaacccaacaaaacaagcatcaggaattctttagatgccttctttgaa accagaaattgccacagttcttcgctttctggttttctacacaccccagttgccaacagc tatagccctaactccaagcaggaagtgaccgctggaaagcagctgaacaaagcaaaggtg atcaaggtcactggatgggagatgaaggatgcatcgacacctcaggccgaagggtccttg aagacaggaatcaggtctgattcttctttgtctgaaatcgacacattacctgacaaagtc aaaatcctcaataaccactcactcaacccatttgtctaa >gi568815578f:59200813_59424456|GENSCAN_predicted_peptide_3|139_aa MEPGLWLLFGLTVTSAAARSEGDCEETVAGPGEETVAGPGEGTVAPTALQGPSPGSPGQE QAAEGAPEHHRSRRCTCFTYKDKECVYYCHLDIIWINTPESLLQVRGVTHIPPEGLAQEK LNPEFDPVEGGALPLNEVN >gi568815578f:59200813_59424456|GENSCAN_predicted_CDS_3|420_bp atggagccggggctgtggctccttttcgggctcacagtgacctccgccgcagccagatct gagggggactgtgaagagactgtggctggccctggcgaggagactgtggctggccctggc gaggggactgtggccccgacagcactgcagggtccaagccctggaagccctgggcaggag caggcggccgagggggcccctgagcaccaccgatccaggcgctgcacgtgcttcacctac aaggacaaggagtgtgtctactattgccacctggacatcatttggatcaacactcccgag tctcttctgcaggtcaggggtgtgacccacatcccccctgagggcctggcccaagagaaa ctcaatcctgagtttgatcctgtggaaggaggagccctccctctgaatgaagtgaactga >gi568815578f:59200813_59424456|GENSCAN_predicted_peptide_4|186_aa MTICVALSQEVLLPLCDPLEGADISRSRTVFIIEILGLDFEPDMVSAALDGEVNGADTTS GSGWTRQTVPYGLSNYRGSFRGKRSAGPLPGNLQLSHRPHLRCACVGRYDKACLHFCTQT LDVSRMWGSTQQELQRSLEALTGPGVQELPRCHGGPRQRRTAKDFQESWAVRGPDVAGFS GIWVEE >gi568815578f:59200813_59424456|GENSCAN_predicted_CDS_4|561_bp atgactatctgtgtggccttgagccaggaggtgctgttgcctctctgtgaccctctggaa ggtgcagacatcagccgctccaggacagtatttattattgagatattgggacttgatttt gagccagacatggtgtcagctgccctggatggagaggtgaatggggcagacacaaccagt ggctccggctggacaagacagacggtgccctatggactgtccaactacagaggaagcttc cggggcaagaggtctgcggggccacttccagggaatctgcagctctcacatcggccacac ttgcgctgcgcttgtgtggggagatatgacaaggcctgcctgcacttttgcacccaaact ctggacgtcagcaggatgtggggcagcacacagcaggagctgcagcgttcactcgaggca ctaacaggtcctggagtccaggagctgcccaggtgccacggaggccccaggcagaggagg acagccaaggacttccaggagtcatgggcagtccgaggcccagacgtggcaggcttttct ggcatctgggtggaggaatga >gi568815578f:59200813_59424456|GENSCAN_predicted_peptide_5|527_aa XPMARYFQMVVELHSPSCSRDPYFCSVSTVSDGHSARWQAMLSFFPGEETMHPLLVSSYV CLVLLTVCTIRESSACCSQTSHANEDLDRMYLAGASGREGVTGYIELLLRHLCIQAREKT ERLPARGDRFPPGLGEKGECQRGSPSVRVTHRRLPVHKRQNPDGTPSSGSEPTFPLSAQG KRHIWFGVGRLRPHHYLKPSLPGTISQSPLGRQALMTDICVDNCICSIQRPFWASQQLSE QSSEDEVGVGGSMFPLDSPGFSPQCSDVLGASSVLKNLLLHAKHSETHFCCSFPARSFRR PGGDVEPDYGSSSDPQAELPTVSWVLLDPSSHKVRKIQEQCIQDGSGTSRIKHKQEQGAH VLILKAHLSNFSALPDPRGRTEAAQGADTSAALLVSELAHLRFHLRDLQAQTVPEAKWCP AQANSQGQMEERAAKELQTLSSFLGLQYRIPGCTDTTAGLSQLTYLAKERHGPSAKQMLR HPHGIPRRALLVLISFPNNTLSCWGKDQVQMRGGHTPGPRAIRTDVQ >gi568815578f:59200813_59424456|GENSCAN_predicted_CDS_5|1584_bp nggcccatggctcgctacttccagatggtggtggagttgcacagccccagctgttcccgg gacccttatttttgctcggtgtccaccgtgtctgacggtcacagtgccaggtggcaggct atgctctcatttttcccaggtgaggagaccatgcatccacttcttgtttcctcctatgtc tgcttggtcctcctcacggtgtgcacgatcagagaaagttctgcctgttgctcccaaact tcccatgctaatgaggacttagaccgcatgtacctagcaggggcgtctggccgcgagggc gtcacggggtacattgaactcctgctgcgacacctgtgtatccaggcccgggagaagaca gagcgcctccctgcaaggggagaccggttcccacctggactaggtgagaaaggtgagtgt cagagaggctcaccaagtgtcagagtgactcaccgaaggttacctgttcacaagaggcag aacccagacgggactccatcctctgggtccgagccaactttcccactgtcagctcagggg aaaaggcatatctggtttggggttggaaggctgaggccacaccactacctgaagcccagc ctgcccggcaccatctcccagagccccctgggcaggcaggcgttgatgacggacatctgt gttgacaactgcatctgcagcatccagcgcccattctgggccagccagcagctctctgag cagagctctgaggatgaagtgggtgttggaggctctatgttcccactggacagtcctggg ttctccccacaatgcagtgatgttctgggagcttccagtgtgctcaagaacctgttactc catgcaaagcactcagagacccacttctgctgctctttcccagcacggagcttccgccgc cctggtggagatgtggagcctgactatggatcatcaagtgaccctcaggcagaactaccc acagtaagttgggttctcttagatccatcaagtcataaagtcaggaagatccaggaacaa tgtattcaagatggaagtggtacttcgaggatcaagcacaagcaggagcagggggcacat gtgttgatcctgaaggcacatctcagcaatttttctgcactgccggatccaagaggaagg actgaggctgcccaaggtgctgacaccagcgcggccctgcttgtgagtgagctcgcccat ctccgctttcatctcagagatctgcaggcacagacagtgccagaagcaaaatggtgccca gcacaggcaaacagccagggccagatggaagagagggctgccaaggagctgcagacgttg agcagcttcctggggctgcagtataggatcccgggctgcactgacaccactgcagggctg agccagctcacatacctggcaaaggagcgacatgggcccagtgcaaagcagatgctgcgc caccctcacggcatccccaggagggcactccttgttctcatttccttccccaacaacaca ctcagctgctggggcaaggaccaggtgcagatgagaggaggacacacgccagggccccgg gccatccgcacggacgtccagtga