GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:11:56 Sequence gi568815576f:32702002_32959374 : 257373 bp : 45.70% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 27 22 6 1.05 1.03 Term - 2378 2302 77 0 2 92 38 60 0.306 -0.60 1.02 Intr - 9855 9775 81 1 0 97 82 9 0.274 0.81 1.01 Init - 11452 11344 109 1 1 60 51 104 0.423 4.28 1.00 Prom - 14842 14803 40 -4.06 2.00 Prom + 26121 26160 40 -1.86 2.01 Init + 30582 30706 125 2 2 96 99 40 0.539 5.54 2.02 Intr + 42569 42647 79 2 1 98 74 25 0.297 1.65 2.03 Intr + 42672 42707 36 2 0 115 99 48 0.496 7.06 2.04 Intr + 44601 44628 28 2 1 99 92 21 0.477 1.29 2.05 Intr + 52675 52757 83 2 2 85 105 70 0.288 7.76 2.06 Term + 57660 58268 609 2 0 50 47 431 0.805 29.70 2.07 PlyA + 64754 64759 6 1.05 3.08 PlyA - 65690 65685 6 1.05 3.07 Term - 69322 69125 198 1 0 61 47 82 0.234 -1.20 3.06 Intr - 73405 73280 126 1 0 98 77 12 0.342 1.98 3.05 Intr - 78273 78169 105 0 0 88 86 61 0.834 6.31 3.04 Intr - 78751 78697 55 0 1 56 119 20 0.527 0.88 3.03 Intr - 85878 85810 69 2 0 150 53 54 0.061 6.40 3.02 Intr - 86894 86891 4 0 1 101 94 0 0.032 -5.91 3.01 Init - 88857 88761 97 0 1 67 67 92 0.558 3.53 3.00 Prom - 89292 89253 40 -3.86 4.00 Prom + 95775 95814 40 -3.76 4.01 Init + 100001 100121 121 1 1 88 111 168 0.182 17.47 4.02 Intr + 116685 116811 127 1 1 59 91 51 0.328 2.24 4.03 Intr + 121306 121367 62 1 2 92 68 41 0.242 0.88 4.04 Intr + 128575 128683 109 2 1 119 21 57 0.190 1.44 4.05 Intr + 128814 128865 52 0 1 67 21 104 0.034 0.51 4.06 Intr + 147291 147339 49 2 1 104 116 -11 0.407 1.55 4.07 Intr + 147451 147533 83 2 2 138 100 156 0.971 21.16 4.08 Intr + 155248 155359 112 0 1 70 95 194 0.980 18.25 4.09 Intr + 156016 156137 122 2 2 113 121 177 0.999 23.71 4.10 Term + 157179 157376 198 2 0 80 53 333 0.992 26.40 4.11 PlyA + 158932 158937 6 1.05 5.06 PlyA - 159782 159777 6 1.05 5.05 Term - 162187 162155 33 1 0 84 40 20 0.413 -5.61 5.04 Intr - 163003 162914 90 1 0 94 101 52 0.865 7.29 5.03 Intr - 167153 166965 189 2 0 51 66 273 0.601 21.18 5.02 Intr - 171618 171463 156 0 0 69 51 135 0.869 8.11 5.01 Init - 172791 172645 147 0 0 81 -2 88 0.269 -0.73 5.00 Prom - 177324 177285 40 -1.96 6.00 Prom + 184524 184563 40 -4.66 6.01 Init + 191390 191392 3 1 0 98 91 0 0.236 1.30 6.02 Intr + 198904 198984 81 0 0 75 69 62 0.444 2.83 6.03 Intr + 202643 202776 134 1 2 86 75 24 0.514 0.34 6.04 Intr + 209635 209764 130 1 1 95 56 102 0.980 8.40 6.05 Term + 216130 216243 114 0 0 101 42 117 0.952 6.97 6.06 PlyA + 216488 216493 6 1.05 7.04 PlyA - 219128 219123 6 1.05 7.03 Term - 222541 222463 79 0 1 72 42 97 0.710 0.74 7.02 Intr - 229480 229389 92 1 2 102 107 83 0.851 10.39 7.01 Init - 235495 235490 6 1 0 84 87 0 0.094 0.38 7.00 Prom - 237875 237836 40 -5.16 8.00 Prom + 238358 238397 40 -5.96 8.01 Sngl + 240488 240706 219 1 0 88 42 205 0.982 11.06 8.02 PlyA + 241366 241371 6 1.05 9.00 Prom + 241902 241941 40 -3.76 9.01 Sngl + 242445 243308 864 2 0 42 41 259 0.934 12.48 9.02 PlyA + 243455 243460 6 1.05 10.00 Prom + 244219 244258 40 -2.46 10.01 Init + 255312 255419 108 2 0 37 76 86 0.023 0.77 10.02 Term + 255880 256029 150 0 0 115 51 58 0.031 2.71 10.03 PlyA + 257038 257043 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 61811 61753 59 0 2 134 48 16 0.911 0.05 S.002 Init - 64975 64921 55 1 1 98 113 4 0.918 5.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:32702002_32959374|GENSCAN_predicted_peptide_1|88_aa MIFSNMLEPTEGLHEKQSKPLKVYLTTEELGAENLKYQLLEASISEKMFELGIIQEMQIL TKLGTWMELEAIIFSKLMQEQKTKYYMF >gi568815576f:32702002_32959374|GENSCAN_predicted_CDS_1|267_bp atgatcttcagcaatatgctggagcccactgaaggcttgcatgagaagcaaagcaagcct ctgaaagtgtacctgaccacggaggaactaggggcagaaaatttaaaataccaactcctg gaagcaagcatttccgagaagatgtttgaacttggtattattcaagaaatgcaaattcta acaaaattaggaacatggatggagctggaggccattatctttagcaaactaatgcaggaa cagaaaaccaaatactacatgttctga >gi568815576f:32702002_32959374|GENSCAN_predicted_peptide_2|319_aa MDESFNLPGHRNFICKVRINSVYISYEVNITACLTDDDVSHRLISKRRSNANLGLVDPQV HAFTQHATDSVNPEKQLTVSEAFSDSSSTASPQTFPEHTIQSYTLMPLLLRHLLKGKPAP TAPPASTHAPPASTHPPSASTHPPPASTHPAPVSTHGTPSQHPRTTSQHPPTISQHPPTT SQYPPTPSQHPPTSSQYPPTSSQYPRTRSQHPPTSSQHPPTPSQYPPTPSQHPPTSSQYP PTPSQHPPTSSQYPPTTSQHPRAPSQRPPTSSQHPCIRSQHPLTPQPAPTYPPASTHPPP ANSHAPTASTHAGLRPFLD >gi568815576f:32702002_32959374|GENSCAN_predicted_CDS_2|960_bp atggatgagtcattcaatctccctggacatcgtaatttcatctgtaaagtgaggattaac agtgtctacatatcctatgaggtaaatattactgcctgcttgacagatgatgatgtttca cacaggttaataagtaagagacggtcaaatgcaaatttgggtctagtggaccctcaagtc catgctttcacccaacatgctactgactctgtcaacccggagaagcagctcactgtgtcg gaagccttctctgattcttccagcacagcaagtcctcagacgttccctgagcataccata cagtcctacaccttgatgcctttgctccttcggcacttgctgaaagggaagccagcaccc acggcacccccagccagcacccacgcaccaccagccagcacccacccaccatcagccagc acccacccacccccagccagcacccacccagccccagtcagcacccacggcacccccagc cagcacccacgcaccaccagccagcacccacccaccatcagccagcacccacccaccacc agccagtacccacccacccccagccagcacccaccaaccagcagccagtacccacccacc agcagccagtacccacgcacccgcagccagcacccacccaccagcagccagcacccaccc acccccagccagtacccacccacccccagccagcacccacccaccagcagccagtaccca cccacccccagccagcacccacccaccagcagccagtacccacccaccaccagccagcac ccacgtgcccccagccagcgcccacccaccagcagccagcacccatgcatccgcagccag cacccactcaccccccagccagcacccacgtaccccccagccagcacccacccgccccca gccaacagccacgcaccaacagccagcacccatgcaggcctgcggcccttccttgattga >gi568815576f:32702002_32959374|GENSCAN_predicted_peptide_3|217_aa MVKSKVLETPVLHWSWLTLACQSLLLYIREFAGRSCHGQSDPSNSAASSSDITFSGRIKK LRHRKFSRLSKVMQLGQLQPVKFQHSSAKHRPDMASHTSELAAASFVTRQVFVAQMLLRT ELCPPKIHVEASTSSAAVFSDRACKEVVRLDKNFNISMVDNFRGQGWLYTMEYYAAIKKN EFMSFAGTWMKLETILLSKLTQEQKTKHYMLSPTSGS >gi568815576f:32702002_32959374|GENSCAN_predicted_CDS_3|654_bp atggtgaagagcaaggttctagagaccccagtgctgcactggagctggctcaccctggct tgtcagagcctattgttatacattcgggaatttgcaggcaggtcttgccatggccagtca gatccttctaattctgcagcttccagctcagatattaccttctcggggaggataaagaaa ctaaggcacaggaagttcagtagattgtccaaggtcatgcagctgggccagcttcagccc gtgaaattccagcacagttcagcaaagcacagaccagacatggcgtcccacaccagtgag ctggcagctgcttcctttgtaaccagacaggtgtttgtggctcagatgctgctaaggact gaattatgtccccccaaaattcatgttgaagcctcaacctccagtgcagctgtatttagt gatagggcctgtaaggaggtagttagattagacaagaattttaacatctccatggtggac aacttcagagggcaaggctggttatataccatggaatactatgcagccataaaaaagaat gagttcatgtcctttgcagggacatggatgaagctggaaaccatccttctcagcaaacta acacaggaacagaaaaccaaacactacatgttgtcacccacaagtgggagttga >gi568815576f:32702002_32959374|GENSCAN_predicted_peptide_4|344_aa MTPWLGLIVLLGSWSLGDWGAEACTCSPSHPQDAFCNSDIALAPLTAVRKADVTVAGLGG KSQDKTASTWIWRRGSDWQEIRRSLQEMGPATENAARDVSVGSSQVMHRQLGIDMRVKGI LVGKHQRDPQGHPPAPAIPLGFRKVFPANPADGLRWHGIPTLVNTIIYSSWLQVIRAKVV GKKLVKEGPFGTLVYTIKQMKMYRGFTKMPHVQYIHTEASESLCGLKLEVNKYQYLLTGR VYDGKMYTGLCNFVERWDQLTLSQRKGLNYRYHLGCNCKIKSCYYLPCFVTSKNECLWTD MLSNFGYPGYQSKHYACIRQKGGYCSWYRGWAPPDKSIINATDP >gi568815576f:32702002_32959374|GENSCAN_predicted_CDS_4|1035_bp atgaccccttggctcgggctcatcgtgctcctgggcagctggagcctgggggactggggc gccgaggcgtgcacatgctcgcccagccacccccaggacgccttctgcaactccgacatc gctcttgcaccattaactgcagtcagaaaagctgatgtcactgtggccggcctaggggga aagtctcaggataaaactgcttccacctggatctggagaaggggaagtgactggcaagag attcgcaggtccctgcaggagatgggacctgccacagagaatgccgcaagggatgtttct gttgggagctcacaggtcatgcacaggcagctgggaatagatatgcgagttaaaggaatt ctcgttggaaaacaccagagggatccacagggccacccccccgcccccgcaataccactt ggattccgcaaggtgtttcctgcaaatcctgctgatggtcttcgctggcacggaattccc actcttgtaaacacaatcatctattccagttggctccaagtgatccgggccaaggtggtg gggaagaagctggtaaaggaggggcccttcggcacgctggtctacaccatcaagcagatg aagatgtaccgaggcttcaccaagatgccccatgtgcagtacatccatacggaagcttcc gagagtctctgtggccttaagctggaggtcaacaagtaccagtacctgctgacaggtcgc gtctatgatggcaagatgtacacggggctgtgcaacttcgtggagaggtgggaccagctc accctctcccagcgcaaggggctgaactatcggtatcacctgggttgtaactgcaagatc aagtcctgctactacctgccttgctttgtgacttccaagaacgagtgtctctggaccgac atgctctccaatttcggttaccctggctaccagtccaaacactacgcctgcatccggcag aagggcggctactgcagctggtaccgaggatgggcccccccggataaaagcatcatcaat gccacagacccctga >gi568815576f:32702002_32959374|GENSCAN_predicted_peptide_5|204_aa MDVRKSGAFAGLAGISSGCDGSQHVNKISMNITLCQPGSWGPGDSYEIVTSYSPGKCGSL IREDFRMGSVSRKRRSSGYRADKTSTVHYTPNNEGTFIQSQCGNTAVGFLSRSFKPDFIL VRQHAYSMALGEDYRSLVIGLQYGGLPAVNSLYSVYNFCSKPWVFSQLIKIFHSLGPEKF PLVEQTFFPNHKPMGMRLWEQNNE >gi568815576f:32702002_32959374|GENSCAN_predicted_CDS_5|615_bp atggacgtcagaaagtctggagccttcgcaggcctggctggaatcagctctgggtgtgat gggtctcagcatgtgaacaaaatcagcatgaacatcaccctttgccaacctggctcttgg ggaccaggggacagttatgaaatagtgaccagttacagcccaggaaaatgtggttctctg atcagagaggatttccgcatgggctctgtttctagaaaaaggaggagttctgggtacaga gcagataaaaccagcacagtccactacacccccaacaacgagggcacattcattcagtcg cagtgtggtaacactgctgttggctttctcagcagatccttcaagccagacttcatcctg gtccgccagcatgcctacagcatggccctgggggaagactaccgcagcctggtcatcggc ctgcagtatggagggctgcctgctgtcaactctctctactccgtctacaacttctgcagc aagccctgggtgttctctcagctcattaagatcttccattccctgggtcctgagaagttc ccgcttgtggagcaaacatttttccccaaccataagccaatgggaatgcgtctttgggaa cagaataatgaatag >gi568815576f:32702002_32959374|GENSCAN_predicted_peptide_6|153_aa MTISYMTAAEIFGLLMQQQHPIDVSSIQAEKSPYAYRVLGFYKAFEQVMGQFLPSTPRNT WTVDSCPLWEAFSSEGGNKRWLHGSPRETEREPNHHVVNKCCDRSTAELLLHRIKKSCRI LIHAKSRLVMNAFSKDLGIVPGAGATIRKNSLT >gi568815576f:32702002_32959374|GENSCAN_predicted_CDS_6|462_bp atgactatcagctacatgacggcagcggaaatatttggtttactgatgcagcaacagcat ccaatagatgtatctagcatacaggcagaaaaatctccttatgcatatagagtcttggga ttttataaagcatttgaacaagtaatgggacagtttcttccttccactccacgcaacact tggactgtggactcttgtccactgtgggaagctttcagctccgagggtggtaataaaaga tggctgcacggctctccccgggaaactgaacgggagccaaaccatcatgtggtgaacaaa tgctgtgatcgctcaactgcagagctgctgctgcaccgtattaaaaagagttgtaggatt cttattcatgcaaagagtcggctggtcatgaatgccttcagcaaagatttaggcattgtg ccaggtgctggggccacaattcggaaaaacagcttaacttga >gi568815576f:32702002_32959374|GENSCAN_predicted_peptide_7|58_aa MIAEFSELNLAAYVTGGCMVDMQVVRNGTKVVRFSLSSKDVNKLLDAADAAVGRSHFE >gi568815576f:32702002_32959374|GENSCAN_predicted_CDS_7|177_bp atgattgctgaattctcagagttgaacctagctgcctatgtgaccgggggctgcatggtg gacatgcaggtcgtgagaaatgggaccaaagtggtgaggttcagtttgagcagcaaggat gtgaacaagctcctggatgctgctgatgctgctgttggaagatcacattttgagtag >gi568815576f:32702002_32959374|GENSCAN_predicted_peptide_8|72_aa MGKKQSRKAENSRNQSASSPPKERSFSPAMEQSWMENDFDELSEEGFRRSNFSELKEEVQ THRKEAKTLKKD >gi568815576f:32702002_32959374|GENSCAN_predicted_CDS_8|219_bp atggggaaaaaacagagcagaaaagctgaaaattctagaaatcagagcgcctcttctcct ccaaaggaacgcagcttctcaccagcaatggaacaaagctggatggagaatgactttgac gagttgagtgaagaaggcttcagacgatcaaacttctctgagctaaaggaggaagttcaa acccatcgcaaagaagctaaaaccttgaaaaaagattag >gi568815576f:32702002_32959374|GENSCAN_predicted_peptide_9|287_aa MIISIDAEKAFDKIQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQKLEAFPLK TGISQGCPLSPLLFNTVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVS AQNLLKLISNSAVSGYKINMQKSQAFLYTNNRQTEGQIMSKLPFTIASKRIKYLGIQLTR DVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIVKMAILPKAIYGFNAIPIKLPMT FFTELEITTLKFIWNQKRAHTAKTILSERTKLEASRYLTLNYTTRLQ >gi568815576f:32702002_32959374|GENSCAN_predicted_CDS_9|864_bp atgattatctcaatagatgcagaaaaggcctttgacaaaattcaacagcccttcatgcta aaaactctcaataaattaggtattgatgggatgtatctcaaaataataagagctatttat gacaaacccacagccaatatcatactgaatggacaaaaactggaagcattccccttgaaa actggcataagccagggatgccctctctcaccactcctattcaacacagtgttggaagtt ctggccagggcaatcaggcaggagaaagaaataaagggcattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccatcgtctca gcccaaaatctccttaagctgataagcaattcagcagtctcaggatacaaaatcaatatg caaaaatcacaagcattcttatacaccaataacagacaaacagagggccaaatcatgagt aaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaggatctcttcaaggagaactacaaaccactgctcaacgaaataaaggaggac acaaacaaatggaagaacattccatgctcatggataggaagaatcaatattgtgaaaatg gccatactgcccaaggcaatttatggattcaatgccatccccattaagctaccaatgact ttcttcacagaactggaaataactactttaaagttcatatggaaccaaaaaagagcccac actgccaagacaatcctaagcgaaagaacaaaactggaggcatcacgctacctgacttta aactatactacaaggctacagtaa >gi568815576f:32702002_32959374|GENSCAN_predicted_peptide_10|85_aa MQEADSCSLGMRGLAGLVLPQENEHRRDAGWLSTSPGLPWFHPSMTQIELRANPNLISKT GTARDSVREKNPPVANAFRIKKQRQ >gi568815576f:32702002_32959374|GENSCAN_predicted_CDS_10|258_bp atgcaggaagcagactcgtgctctctgggaatgagaggcctggccggacttgtgcttcca caagaaaatgaacatcgccgggatgcaggatggcttagcacttctcctgggctaccgtgg ttccaccccagcatgactcagatagagcttcgcgctaacccaaacctaatctccaaaacg gggactgcaagggacagtgtcagggaaaagaacccacctgtggcaaatgcattccgcatc aagaagcaaaggcaatga