GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:53:33 Sequence gi568815597r:8903303_9126345 : 223043 bp : 48.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1701 1696 6 1.05 1.02 Term - 5369 5228 142 1 1 74 47 103 0.572 2.20 1.01 Init - 18252 18164 89 0 2 96 99 118 0.393 12.55 1.00 Prom - 35587 35548 40 -0.76 2.00 Prom + 40846 40885 40 -1.76 2.01 Init + 42585 42663 79 2 1 96 99 164 0.931 17.54 2.02 Intr + 45961 46140 180 2 0 80 110 156 0.931 16.84 2.03 Intr + 53835 53983 149 1 2 102 67 177 0.916 16.95 2.04 Intr + 55608 55700 93 2 0 114 99 99 0.996 13.76 2.05 Intr + 59285 59354 70 1 1 132 110 83 0.998 13.65 2.06 Intr + 64357 64514 158 2 2 99 95 93 0.949 10.83 2.07 Intr + 67565 67679 115 1 1 99 96 142 0.965 16.12 2.08 Intr + 78752 78867 116 0 2 36 74 90 0.088 2.57 2.09 Intr + 82108 82246 139 0 1 95 -1 58 0.367 -2.36 2.10 Term + 88348 88538 191 2 2 86 36 129 0.674 5.11 2.11 PlyA + 89657 89662 6 1.05 3.13 PlyA - 90333 90328 6 1.05 3.12 Term - 100216 99998 219 1 0 103 40 235 0.820 17.24 3.11 Intr - 101577 101450 128 1 2 131 109 109 0.997 17.70 3.10 Intr - 104083 104008 76 1 1 113 81 138 0.915 14.69 3.09 Intr - 106942 106841 102 1 0 123 41 215 0.926 20.67 3.08 Intr - 110333 110223 111 2 0 94 109 168 0.998 20.08 3.07 Intr - 111566 111379 188 0 2 88 80 326 0.994 31.21 3.06 Intr - 111940 111815 126 2 0 90 86 185 0.980 19.15 3.05 Intr - 115073 114921 153 0 0 82 91 165 0.999 16.24 3.04 Intr - 116031 115907 125 2 2 90 82 222 0.999 22.03 3.03 Intr - 119801 119616 186 1 0 41 97 94 0.587 4.40 3.02 Intr - 121772 121663 110 2 2 63 86 177 0.565 14.08 3.01 Init - 123043 122993 51 1 0 80 98 -5 0.737 0.86 3.00 Prom - 128090 128051 40 -3.96 4.15 PlyA - 129927 129922 6 1.05 4.14 Term - 134487 134284 204 0 0 107 41 335 0.997 28.07 4.13 Intr - 134722 134595 128 2 2 99 109 166 0.999 20.20 4.12 Intr - 135204 135129 76 0 1 112 62 114 0.780 10.29 4.11 Intr - 135366 135274 93 0 0 45 52 79 0.392 0.16 4.10 Intr - 135627 135526 102 0 0 97 94 114 0.999 13.27 4.09 Intr - 136360 136250 111 1 0 88 78 351 0.841 34.68 4.08 Intr - 136685 136498 188 0 2 76 6 486 0.629 38.61 4.07 Intr - 136887 136762 126 1 0 74 92 221 0.977 21.75 4.06 Intr - 138635 138483 153 0 0 97 101 202 0.998 22.44 4.05 Intr - 144432 144308 125 2 2 110 97 -32 0.393 0.13 4.04 Intr - 145968 145798 171 2 0 40 61 95 0.084 1.06 4.03 Intr - 154306 154146 161 1 2 79 110 49 0.212 4.99 4.02 Intr - 154948 154850 99 1 0 76 82 119 0.177 10.41 4.01 Init - 168490 168371 120 1 0 100 51 145 0.062 10.19 4.00 Prom - 173092 173053 40 -2.16 5.05 PlyA - 173345 173340 6 1.05 5.04 Term - 200942 200797 146 0 2 76 55 96 0.602 3.17 5.03 Intr - 202142 201977 166 2 1 93 65 -11 0.266 -3.37 5.02 Intr - 202378 202184 195 1 0 71 53 405 0.783 34.91 5.01 Intr - 208187 207974 214 1 1 130 66 379 0.534 38.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 218715 218790 76 2 1 89 56 100 0.879 6.20 S.002 Term + 218890 218975 86 2 2 95 47 77 0.979 2.12 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:8903303_9126345|GENSCAN_predicted_peptide_1|76_aa MEELKAWRLEEALDPALGLSGKKESVKDSSAAGLESPRLSWSRQAVPNMGAHIRVKGSPE RQLENVKLNWRTPEYS >gi568815597r:8903303_9126345|GENSCAN_predicted_CDS_1|231_bp atggaggaactaaaggcctggaggctggaggaagcccttgacccagcgctgggtctttcg ggcaagaaagagtcggtcaaggattccagtgccgccgggttagagtctccgcgactgagc tggtctcggcaagcggtgcccaacatgggggctcacatccgggtcaaagggtcgccggag cgacagttggagaacgtgaaactaaactggaggacacccgagtactcttaa >gi568815597r:8903303_9126345|GENSCAN_predicted_peptide_2|429_aa MRALVLLLSLFLLGGQAQHVSDWTYSEGALDEAHWPQHYPACGGQRQSPINLQRTKVRYN PSLKGLNMTGYETQAGEFPMVNNGHTVQISLPSTMRMTVADGTVYIAQQMHFHWGGASSE ISGSEHTVDGIRHVIEIHIVHYNSKYKSYDIAQDAPDGLAVLAAFVEVKNYPENTYYSNF ISHLANIKYPGQRTTLTGLDVQDMLPRNLQHYYTYHGSLTTPPCTENVHWFVLADFVKLS RTQVWKLENSLLDHRNKTIHNDYRRTQPLNHRVVESNFPNQGRQPLETLSSVNITEHEIR VHTVLLETQATEECVIRHEEGGLAYGSGCTVALERICPVGGKGCPLGGIMVVSRNLLHWI PGLEEPHLCETQRLLFVLPSSQSMCFEWDVSACYNLLATRIVFRDRCDQGAQSGPTPECG GTQPQHVQN >gi568815597r:8903303_9126345|GENSCAN_predicted_CDS_2|1290_bp atgagggccctggtgcttctgctgtccctgttcctgctgggtggccaggcccagcatgtg tctgactggacctactcagaaggggcactggacgaagcgcactggccacagcactacccc gcctgtgggggccagagacagtcgcctatcaacctacagaggacgaaggtgcggtacaac ccctccttgaaggggctcaatatgacaggctatgagacccaggcaggggagttccccatg gtcaacaatggccacacagtgcagatcagcctgccctccaccatgcgcatgacagtggct gacggcactgtatacatagcccagcagatgcactttcactggggaggtgcgtcctcggag atcagcggctctgagcacaccgtggacgggatcagacatgtgatcgagattcacattgtt cactacaattctaaatacaagagctatgatatagcccaagatgcgccggatggtttggct gtactggcagccttcgttgaggtgaagaattaccctgaaaacacttattacagcaacttc atttctcatctggccaacatcaagtacccaggacaaagaacaaccctgactggccttgac gttcaggacatgctgcccaggaacctccagcactactacacctaccatggctcactcacc acgcctccctgcactgagaacgtccactggtttgtgctggcagattttgtcaagctctcc aggacacaggtttggaagctggagaattccttactggatcaccgcaacaagaccatccac aacgattaccgcaggacccagcccctgaaccacagagtggtggaatccaacttcccgaat cagggccgacaacctttggaaacactgagcagtgttaacatcactgaacacgaaataaga gtacacactgtgcttctggaaactcaggccacagaagaatgtgtaattcgccatgaagaa ggagggctcgcctatggaagcggctgcacagtggctctagaaagaatatgtcctgtagga gggaagggctgccctctgggtggaatcatggtggtctccagaaatctcctccactggatt cctgggctggaggagccacatctctgtgagacacagcggctgctcttcgtcctccccagt tctcagtccatgtgttttgagtgggatgtgtcagcgtgctacaacctgttggctacaagg attgttttccgggatagatgtgatcaaggagcccaatcaggtcccaccccagaatgtggg gggactcaaccacaacatgtacaaaactaa >gi568815597r:8903303_9126345|GENSCAN_predicted_peptide_3|524_aa MENKEAGTPPPIPSREGRLQPTLLLATLSAAFGSAFQYGYNLSVVNTPHKVGTSCGWGNV FQVFKSFYNETYFERHATFMDGKLMLLLWSCTVSMFPLGGLLGSLLVGLLVDSCGRKGTL LINNIFAIIPAILMGVSKVAKAFELIVFSRVVLGVCAGISYSALPMYLGELAPKNLRGMV GTMTEVFVIVGVFLAQIFSLQAILGNPAGWPVLLALTGVPALLQLLTLPFFPESPRYSLI QKGDEATARQALRRLRGHTDMEAELEDMRAEARAERAEGHLSVLHLCALRSLRWQLLSII VLMAGQQLSGINAINYYADTIYTSAGVEAAHSQYVTVGSGVVNIVMTITSAVLVERLGRR HLLLAGYGICGSACLVLTVVLLFQNRVPELSYLGIICVFAYIAGHSIGPSPVPSVVRTEI FLQSSRRAAFMVDGAVHWLTNFIIGFLFPSIQEAIGAYSFIIFAGICLLTAIYIYVVIPE TKGKTFVEINRIFAKRNRVKLPEEKEETIDAGPPTASPAKETSF >gi568815597r:8903303_9126345|GENSCAN_predicted_CDS_3|1575_bp atggagaacaaagaggcgggaacccctccacccattccatccagggaggggcggctccag ccgacgctgttgctggcgacactgagcgcggcctttggctcagccttccagtacggctac aacctctctgtggtcaacacgccgcacaaggtgggcacaagctgtggatggggcaatgtt ttccaggtcttcaagtcattttacaacgaaacctactttgagcgacacgcaacattcatg gacgggaagctcatgctgcttctatggtcttgcaccgtctccatgtttcctctgggcggc ctgttggggtcattgctcgtgggcctgctggttgatagctgcggcagaaaggggaccctg ctgatcaacaacatctttgccatcatccccgccatcctgatgggagtcagcaaagtggcc aaggcttttgagctgatcgtcttttcccgagtggtgctgggagtctgtgcaggcatctcc tacagcgcccttcccatgtacctgggagaactggcccccaagaacctgagaggcatggtg ggaacaatgaccgaggttttcgtcatcgttggagtcttcctagcacagatcttcagcctc caggccatcttgggcaacccggcaggctggccggtgcttctggcgctcacaggggtgccc gccctgctgcagctgctgaccctgcccttcttccccgaaagcccccgctactccctgatt cagaaaggagatgaagccacagcgcgacaagctctgaggaggctgagaggccacacggac atggaggccgagctggaggacatgcgtgcggaggcccgggccgagcgcgccgagggccac ctgtctgtgctgcacctctgtgccctgcggtccctgcgctggcagctcctctccatcatc gtgctcatggccggccagcagctgtcgggcatcaatgcgatcaactactatgcggacacc atctacacatctgcgggcgtggaggccgctcactcccaatatgtaacggtgggctctggc gtcgtcaacatagtgatgaccatcacctcggctgtccttgtggagcggctgggacggcgg cacctcctgctggccggctacggcatctgcggctctgcctgcctggtgctgacggtggtg ctcctattccagaacagggtccccgagctgtcctacctcggcatcatctgtgtctttgcc tacatcgcgggacattccattgggcccagtcctgtcccctcggtggtgaggaccgagatc ttcctgcagtcctcccggcgggcagctttcatggtggacggggcagtgcactggctcacc aacttcatcataggcttcctgttcccatccatccaggaggccatcggtgcctacagtttc atcatctttgccggaatctgcctcctcactgcgatttacatctacgtggttattccggag accaagggcaaaacatttgtggagataaaccgcatttttgccaagagaaacagggtgaag cttccagaggagaaagaagaaaccattgatgctgggcctcccacagcctctcctgccaag gaaacttccttttag >gi568815597r:8903303_9126345|GENSCAN_predicted_peptide_4|618_aa MTASRHAPVPALGSLQGGSTGRALVTPWLCLRRPVTGRNRRLTLVLALATLIAAFGSSFQ YGYNVAAVNSPALLMQQFYNETYYGRTGEFMEDFPLTLLWSVTVSMFPFGGFIGSLLVGP LVNKFGRCAVSISGSHVWQVSQCLGMLELHSLTASSIMRVLIMRAPLREAPPEKFALLLS GSPGKGALLFNNIFSIVPAILMGCSRVATSFELIIISRLLVGICAGVSSNVVPMYLGELA PKNLRGALGVVPQLFITVGILVAQIFGLRNLLANVDGWPILLGLTGVPAALQLLLLPFFP ESPRYLLIQKKDEAAAKKALQTLRGWDSVDREVAEIRQEDEAEKAAGFISVLKLFRMRSL RWQLLSIIVLMGGQQLSGVNAIYYYADQIYLSAGVPEEHVQYVTAGTGAVNVVMTFCAVF VVELLGRRLLLLLGFSICLIACCVLTAALALQLIAHVQLISQWLIEVALLGEEPGAILHP PGQDTVSWMPYISIVCVISYVIGHALGPSPIPALLITEIFLQSSRPSAFMVGGSVHWLSN FTVGLIFPFIQEGLGPYSFIVFAVICLLTTIYIFLIVPETKAKTFIEINQIFTKMNKVSE VYPEKEELKELPPVTSEQ >gi568815597r:8903303_9126345|GENSCAN_predicted_CDS_4|1857_bp atgacggcctcccgccacgcccccgtccccgcgctcggctccctccagggcggaagcacg ggtcgagcgttggtgacgccatggctgtgcttgcgacgccctgtcactggcaggaaccgg aggctgacgcttgtgcttgccctggcaaccctgatagctgcctttgggtcatccttccag tatgggtacaacgtggctgctgtcaactccccagcactgctcatgcaacaattttacaat gagacttactatggtaggaccggtgaattcatggaagacttccccttgacgttgctgtgg tctgtaaccgtgtccatgtttccatttggagggtttatcggatccctcctggtcggcccc ttggtgaataaatttggcaggtgtgctgttagtataagcggctctcacgtgtggcaagtt tctcagtgtttaggaatgctggagctccactcgctgacagcttcgtccatcatgagggtc ctcatcatgagggctccattgcgagaggcccctccagagaagtttgctttgcttctgtca ggatctccaggaaaaggggccttgctgttcaacaacatattttctatcgtgcctgcgatc ttaatgggatgcagcagagtcgccacatcatttgagcttatcattatttccagacttttg gtgggaatatgtgcaggtgtatcttccaacgtggtccccatgtacttaggggagctggcc cctaaaaacctgcggggggctctcggggtggtgccccagctcttcatcactgttggcatc cttgtggcccagatctttggtcttcggaatctccttgcaaacgtagatggctggccgatc ctgctggggctgaccggggtccccgcggcgctgcagctccttctgctgcccttcttcccc gagagccccaggtacctgctgattcagaagaaagacgaagcggccgccaagaaagcccta cagacgctgcgcggctgggactctgtggacagggaggtggccgagatccggcaggaggat gaggcagagaaggccgcgggcttcatctccgtgctgaagctgttccggatgcgctcgctg cgctggcagctgctgtccatcatcgtcctcatgggcggccagcagctgtcgggcgtcaac gctatctactactacgcggaccagatctacctgagcgccggcgtgccggaggagcacgtg cagtacgtgacggccggcaccggggccgtgaacgtggtcatgaccttctgcgccgtgttc gtggtggagctcctgggtcggaggctgctgctgctgctgggcttctccatctgcctcata gcctgctgcgtgctcactgcagctctggcactgcagctcatagcccacgttcagctgatt tcccagtggctcatcgaggtggcactgctgggggaggagccaggtgccatcctccaccca ccagggcaggacacagtgtcctggatgccatacatcagcatcgtctgtgtcatctcctac gtcataggacatgccctcgggcccagtcccatacccgcgctgctcatcactgagatcttc ctgcagtcctctcggccatctgccttcatggtggggggcagtgtgcactggctctccaac ttcaccgtgggcttgatcttcccgttcatccaggagggcctcggcccgtacagcttcatt gtcttcgccgtgatctgcctcctcaccaccatctacatcttcttgattgtcccggagacc aaggccaagacgttcatagagatcaaccagattttcaccaagatgaataaggtgtctgaa gtgtacccggaaaaggaggaactgaaagagcttccacctgtcacttcggaacagtga >gi568815597r:8903303_9126345|GENSCAN_predicted_peptide_5|240_aa XWGVPLVITVAAVALKKIGYDASDVSVGWCWIDLEAKDHVLWMLLTGKLWEMLAYVLLPL LYLLVRKHINRAHTALSEYRPILSQEHRLLRHSSMADKKLVLIPLIFIGLRVWSTVRFVL TLCGSPAVQTPVLVVLHRRHSLIPSRSTVAAPQCRPRHSCREELGPTVLCLPLCLRQDLE ERRQSGISVVRGDPQAASSTGTSVLAHGWAFICIRVAVRFGKPVAHSCGHSAPTQWTLMS >gi568815597r:8903303_9126345|GENSCAN_predicted_CDS_5|723_bp nnctggggggtcccgttggtcatcactgtggcagccgtcgccctgaagaagattggctat gacgcctcggacgtgtctgtgggctggtgctggatcgacctggaggccaaggaccatgtc ctgtggatgctgctgacggggaagctgtgggagatgctggcatatgtgctgctgcctctg ctgtacctcctggtccggaagcacatcaacagagcgcacacggcactctctgagtaccgg cccatcctctcccaggagcaccgcctgctgcgccactcctccatggcggacaagaagctg gtgctcatcccgctcatcttcatcggcctcagggtctggagcaccgtgcggttcgtgctg accctctgtggctccccggccgtgcagacgccggtgctggtggttctgcataggaggcac agcctgattccttcccgcagcacagtggctgcaccccagtgtcggccaaggcacagctgc agggaggagctcggccccactgtgctgtgccttcctctctgcctgagacaggaccttgaa gagagaaggcagagtgggatcagtgtggtccggggtgatcctcaggcagcaagttccacc ggtaccagcgtcctggcccacgggtgggcgttcatctgcataagggtagcagtgagattc gggaagccggtggcccacagctgtggccacagtgcccccacccagtggaccttgatgtcc tga