GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:36:26 Sequence gi568815597r:150712024_150958461 : 246438 bp : 42.21% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.35 PlyA - 70 65 6 1.05 1.34 Term - 11667 11424 244 2 1 27 49 126 0.000 -3.01 1.33 Intr - 35856 35754 103 1 1 78 90 117 0.873 9.21 1.32 Intr - 38148 37983 166 0 1 103 43 170 0.978 12.61 1.31 Intr - 39985 39758 228 1 0 85 74 290 0.994 24.44 1.30 Intr - 43127 42978 150 2 0 87 89 128 0.991 12.24 1.29 Intr - 45957 45835 123 0 0 82 78 57 0.915 3.96 1.28 Intr - 52741 52615 127 0 1 80 71 48 0.441 2.06 1.27 Intr - 87250 87145 106 2 1 52 95 84 0.926 3.85 1.26 Intr - 87686 87521 166 2 1 88 116 165 0.992 17.91 1.25 Intr - 92216 91998 219 2 0 71 94 271 0.985 23.58 1.24 Intr - 93993 93838 156 0 0 107 78 154 0.957 15.59 1.23 Intr - 96302 96246 57 2 0 62 54 79 0.100 0.06 1.22 Intr - 101315 101149 167 0 2 76 110 199 0.996 19.66 1.21 Intr - 102213 102054 160 0 1 89 97 146 0.738 14.24 1.20 Intr - 104383 104236 148 0 1 95 98 150 0.999 16.02 1.19 Intr - 104867 104765 103 0 1 114 99 -15 0.998 0.51 1.18 Intr - 105179 105059 121 2 1 106 79 103 0.999 10.35 1.17 Intr - 105410 105338 73 1 1 26 109 73 0.993 1.69 1.16 Intr - 106007 105897 111 1 0 96 98 97 0.999 10.08 1.15 Intr - 111322 111171 152 1 2 53 47 106 0.813 1.04 1.14 Intr - 114594 114520 75 0 0 91 93 34 0.885 2.99 1.13 Intr - 117204 117070 135 0 0 109 94 114 0.731 13.94 1.12 Intr - 119091 119078 14 1 2 44 110 9 0.416 -7.72 1.11 Intr - 119880 119795 86 2 2 81 110 43 0.677 4.34 1.10 Intr - 120376 120311 66 0 0 92 94 24 0.434 0.50 1.09 Intr - 122617 122515 103 2 1 79 105 50 0.830 4.11 1.08 Intr - 124470 124257 214 0 1 110 94 169 0.807 17.07 1.07 Intr - 127631 127418 214 1 1 101 20 218 0.790 14.10 1.06 Intr - 130445 130401 45 1 0 69 91 87 0.927 3.71 1.05 Intr - 130869 130702 168 2 0 60 89 46 0.502 0.04 1.04 Intr - 134284 134240 45 0 0 118 90 82 0.995 8.01 1.03 Intr - 140783 140739 45 1 0 74 94 42 0.499 0.01 1.02 Intr - 146437 146326 112 2 1 85 96 181 0.256 17.12 1.01 Init - 164544 164520 25 0 1 117 93 35 0.059 6.66 1.00 Prom - 170257 170218 40 -3.35 2.00 Prom + 173422 173461 40 -5.75 2.01 Sngl + 191873 192190 318 1 0 48 38 339 0.974 20.62 2.02 PlyA + 199809 199814 6 1.05 3.00 Prom + 210040 210079 40 -5.35 3.01 Init + 215692 215951 260 0 2 51 94 362 0.305 29.76 3.02 Intr + 217944 218095 152 0 2 64 110 179 0.975 16.59 3.03 Intr + 227917 227951 35 2 2 121 115 -9 0.980 2.12 3.04 Intr + 229306 229405 100 0 1 95 72 149 0.997 12.76 3.05 Intr + 230540 230665 126 0 0 68 76 134 0.999 9.93 3.06 Intr + 230829 231030 202 1 1 77 82 207 0.990 16.42 3.07 Intr + 231897 231970 74 0 2 109 94 80 0.999 8.83 3.08 Intr + 232895 233118 224 0 2 77 73 139 0.433 8.22 3.09 Intr + 234863 234989 127 1 1 105 71 129 0.999 12.23 3.10 Intr + 237099 237255 157 1 1 109 61 102 0.994 7.75 3.11 Intr + 237344 237502 159 2 0 85 66 174 0.606 13.08 3.12 Intr + 238435 239067 633 1 0 97 89 489 0.347 40.22 3.13 Intr + 239342 239458 117 2 0 81 110 64 0.400 6.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 164586 164747 162 2 0 72 39 175 0.851 7.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:150712024_150958461|GENSCAN_predicted_peptide_1|1408_aa MAATTANPEMTSDVPSLGPAIASGNSGPGIQGGGAIVQRAIKRRPGLDFDDDGEGNSKFL RCDDDQMSNDKERFARLGAVHENGIPEFPWLVLVQTLYIFGPYRDRLAKCFLCLDHVNLL YIGFAFMLTFILSDDEQSSADKERLARENHSEIERRRRNKMTAYITELSDMVPTCSALAR KPDKLTILRMAVSHMKSLRGTGNTSTDGSYKPSFLTDQELKHLILEAADGFLFIVSCETG RVVYVSDSVTPVLNQPQSEWFGSTLYDQVHPDDVDKLREQLSTSENALTGRILDLKTGTV KKEGQQSSMRMCMGSRRSFICRMRCGSSSVDPVSVNRLSFVRNRCRNGLGSVKDGEPHFV VVHCTGYIKAWPPAVASPRVTSSPNCTDMSNVCQPTEFISRHNIEGIFTFVDHRCVATVG YQPQELLGKNIVEFCHPEDQQLLRDSFQQVVKLKGQVLSVMFRFRSKNQEWLWMRTSSFT FQNPYSDEIEYIICTNTNVKNSSQEPRPTLSNTIQRPQLGPTANLPLEMGSGQLAPRQQQ QQTELDMVPGRDGLASYNHSQVVQPVTTTGPEHSKPLEKSDGLFAQDRDPRFSEIYHNIN ADQSKGISSSTVPATQQLFSQGNTFPPTPRPAENFRNSGLAPPVTIVQPSASAGQMLAQI SRHSNPTQGATPTWTPTTRSGFSAQVATQATAKTRTSQFGVGSFQTPSSFSSMSLPGAPT ASPGAAAYPSLTNRGSNFAPETGQTAGQFQTRTAEGVGVWPQWQGQQPHHRSSSSEQHVQ QPPAQQPGQPEVFQPITGADFRNPDGINLAPLMTSEEVVQKMTGLKVPLSHSRSNDTLYI PEWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLV DCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIP EGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKG NKHWIIKNRMKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLI WEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNP NRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCS TEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELP YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGK EYWLVKNRALMEVNMEINVVFVSANKKTILQPMDQEVILTFKPYKLRNTFCKGCSCHSDS SDGCRQRKLKTFWKGVTILDAIKNICDS >gi568815597r:150712024_150958461|GENSCAN_predicted_CDS_1|4227_bp atggcggcgactactgccaaccccgaaatgacatcagatgtaccatcactgggtccagcc attgcctctggaaactctggacctggaattcaaggtggaggagccattgtccagagggct attaagcggcgaccagggctggattttgatgatgatggagaagggaacagtaaatttttg aggtgtgatgatgatcagatgtctaacgataaggagcggtttgccagattgggagcagtg catgaaaatggtattcctgaattcccttggttggttcttgttcagactctgtatatcttt ggtccctacagagatcgattggcaaaatgctttctgtgtttagatcatgttaatttacta tatattggctttgcttttatgttgacctttatcttgtcggatgatgagcagagctctgcg gataaagagagacttgccagggaaaatcacagtgaaattgaacggcggcgacggaacaag atgacagcctacatcacagaactgtcagatatggtacccacctgtagtgccctggctcga aaaccagacaagctaaccatcttacgcatggcagtttctcacatgaagtccttgcgggga actggcaacacatccactgatggctcctataagccgtctttcctcactgatcaggaactg aaacatttgatcttggaggcagcagatggctttctgtttattgtctcatgtgagacaggc agggtggtgtatgtgtctgactccgtgactcctgttttgaaccagccacagtctgaatgg tttggcagcacactctatgatcaggtgcacccagatgatgtggataaacttcgtgagcag ctttccacttcagaaaatgccctgacagggcgtatcctggatctaaagactggaacagtg aaaaaggaaggtcagcagtcttccatgagaatgtgtatgggctcaaggagatcgtttatt tgccgaatgaggtgtggcagtagctctgtggacccagtttctgtgaataggctgagcttt gtgaggaacagatgcaggaatggacttggctctgtaaaggatggggaacctcacttcgtg gtggtccactgcacaggctacatcaaggcctggcccccagcagtggcatcacctagggta actagttctcccaactgtacagacatgagtaatgtttgtcaaccaacagagttcatctcc cgacacaacattgagggtatcttcacttttgtggatcaccgctgtgtggctactgttggc taccagccacaggaactcttaggaaagaatattgtagaattctgtcatcctgaagaccag cagcttctaagagacagcttccaacaggtagtgaaattaaaaggccaagtgctgtctgtc atgttccggttccggtctaagaaccaagaatggctctggatgagaaccagctcctttact ttccagaacccttactcagatgaaattgagtacatcatctgtaccaacaccaatgtgaag aactctagccaagaaccacggcctacactctccaacacaatccagaggccacaactaggt cccacagctaatttacccctggagatgggctcaggacagctggcacccaggcagcagcaa cagcaaacagaattggacatggtaccaggaagagatggactggccagctacaatcattcc caggtggttcagcctgtgacaaccacaggaccagaacacagcaagccccttgagaagtca gatggtttatttgcccaggatagagatccaagattttcagaaatctatcacaacatcaat gcggatcagagtaaaggcatctcctccagcactgtccctgccacccaacagctattctcc cagggcaacacattccctcctaccccccggccggcagagaatttcaggaatagtggccta gcccctcctgtaaccattgtccagccatcagcttctgcaggacagatgttggcccagatt tcccgccactccaaccccacccaaggagcaaccccaacttggacccctactacccgctca ggcttttctgcccaggtggctacccaggctactgctaagactcgtacttcccagtttggt gtgggcagctttcagactccatcctccttcagctccatgtccctccctggtgccccaact gcatcgcctggtgctgctgcctaccctagtctcaccaatcgtggatctaactttgctcct gagactggacagactgcaggacaattccagacacggacagcagagggtgtgggtgtctgg ccacagtggcagggccagcagcctcatcatcgttcaagttctagtgagcaacatgttcaa caaccgccagcacagcaacctggccagcctgaggtcttccagccgatcactggagctgac ttccgcaatcccgatggaataaatctagcacccctgatgaccagtgaagaggtggttcag aagatgactggactcaaagtacccctgtctcattcccgcagtaatgacaccctttatatc ccagaatgggaaggtagagccccagactctgtcgactatcgaaagaaaggatatgttact cctgtcaaaaatcagggtcagtgtggttcctgttgggcttttagctctgtgggtgccctg gagggccaactcaagaagaaaactggcaaactcttaaatctgagtccccagaacctagtg gattgtgtgtctgagaatgatggctgtggagggggctacatgaccaatgccttccaatat gtgcagaagaaccggggtattgactctgaagatgcctacccatatgtgggacaggaagag agttgtatgtacaacccaacaggcaaggcagctaaatgcagagggtacagagagatcccc gaggggaatgagaaagccctgaagagggcagtggcccgagtgggacctgtctctgtggcc attgatgcaagcctgacctccttccagttttacagcaaaggtgtgtattatgatgaaagc tgcaatagcgataatctgaaccatgcagttttggcagtgggatatggaatccagaaggga aacaagcactggataattaaaaacagaatgaaacggctggtttgtgtgctcttggtgtgc tcctctgcagtggcacagttgcataaagatcctaccctggatcaccactggcatctctgg aagaaaacctatggcaaacaatacaaggaaaagaatgaagaagcagtacgacgtctcatc tgggaaaagaatctaaagtttgtgatgcttcacaacctggagcattcaatgggaatgcac tcatacgatctgggcatgaaccacctgggagacatgaccagtgaagaagtgatgtctttg atgagttccctgagagttcccagccagtggcagagaaatatcacatataagtcaaaccct aatcggatattgcctgattctgtggactggagagagaaagggtgtgttactgaagtgaaa tatcaaggttcttgtggtgcttgctgggctttcagtgctgtgggggccctggaagcacag ctgaagctgaaaacaggaaagctggtgtctctcagtgcccagaacctggtggattgctca actgaaaaatatggaaacaaaggctgcaatggtggcttcatgacaacggctttccagtac atcattgataacaagggcatcgactcagacgcttcctatccctacaaagccatggatcag aaatgtcaatatgactcaaaatatcgtgctgccacatgttcaaagtacactgaacttcct tatggcagagaagatgtcctgaaagaagctgtggccaataaaggcccagtgtctgttggt gtagatgcgcgtcatccttctttcttcctctacagaagtggtgtctactatgaaccatcc tgtactcagaatgtgaatcatggtgtacttgtggttggctatggtgatcttaatgggaaa gaatactggcttgtgaaaaacagagctctaatggaggtgaacatggagattaatgttgtt ttcgtgtctgctaacaaaaagaccattctgcagcccatggatcaagaagtaattttgact ttcaagccttataagttaagaaacacattttgtaagggctgtagctgccatagtgattcc tctgatggatgtaggcaaagaaaattgaaaaccttctggaaaggcgtcactattctcgat gccattaagaatatttgtgattcatag >gi568815597r:150712024_150958461|GENSCAN_predicted_peptide_2|105_aa MGDVEKGKKIFVQKCAQCHTVEKGGKHKTGPNLHGLFSQKTGQAVGFSYTDANKNKGIIW GEDTLMEYLENPKKYIPGTKMIFAGIKKKAEKADLTAYLKKATNE >gi568815597r:150712024_150958461|GENSCAN_predicted_CDS_2|318_bp atgggtgatgttgagaaaggcaagaagatttttgttcagaagtgtgcccagtgtcacacc gtggaaaagggaggcaagcacaagactgggcctaatctccatggtctcttcagtcagaag acaggtcaggctgttggattctcttacacagatgccaataagaacaaaggcatcatctgg ggagaggatacgctgatggagtatttggaaaatcccaagaagtacatccctggaacaaaa atgatctttgccggcattaaaaagaaggcagaaaaggccgacttgacagcttatctcaaa aaagctactaatgagtaa >gi568815597r:150712024_150958461|GENSCAN_predicted_peptide_3|789_aa MSSLPGCIGLDAATATVESEEIAELQQAVVEELGISMEELRHFIDEELEKMDCVQQRKKQ LAELETWVIQKESEVAHVDQLFDDASRAVTNCESLVKDFYSKLGLQYRDSSSEDESSRPT EIIEIPDEDDDVLSIDSGDAGSRTPKDQKLREAMAALRKSAQDVQKFMDAVNKKSSSQDL HKGTLSQMSGELSKDGDLIVSMRILGKKRTKTWHKGTLIAIQTVGPGKKYKVKFDNKGKS LLSGNHIAYDYHPPADKLYVGSRVVAKYKDGNQVWLYAGIVAETPNVKNKLRFLIFFDDG YASYVTQSELYPICRPLKKTWEDIEDISCRDFIEEYVTAYPNRPMVLLKSGQLIKTEWEG TWWKSRVEEVDGSLVRILFLVLFFSTILEAEDDKRCEWIYRGSTRLEPMFSMKTSSASAL EKKQGQLRTRPNMGAVRSKGPVVQYTQDLTGTGTQFKPVEPPQPTAPPAPPFPPAPPLSP QAGDSESLESQLAQSRKQVAKKSTSFRPGSVGSGHSSPTSPALSENVSGGKPGINQTYRS PLGSTASAPAPSALPAPPAPPVFHGMLERAPAEPSYRAPMEKLFYLPHVCSYTCLSRVRP MRNEQYRGKNPLLVPLLYDFRRMTARRRVNRKMGFHVIYKTPCGLCLRTMQEIERYLFET GCDFLFLEMFCLDPYVLVDRKFQPYKPFYYILDITYGKEDVPLSCVNEIDTTPPPQVAYS KERIPGKGVFINTGPEFLVGCDCKDGCRDKSKCACHQLTIQATACTPGGQINPNSGYQYK RLEECLPTG >gi568815597r:150712024_150958461|GENSCAN_predicted_CDS_3|2367_bp atgtcttcccttcctgggtgcattggtttggatgcagcaacagctacagtggagtctgaa gagattgcagagctgcaacaggcagtggttgaggaactgggtatctctatggaggaactt cggcatttcatcgatgaggaactggagaagatggattgtgtacagcaacgcaagaagcag ctagcagagttagagacatgggtaatacagaaagaatctgaggtggctcacgttgaccaa ctctttgatgatgcatccagggcagtgactaattgtgagtctttggtgaaggacttctac tccaagctgggactacaataccgggacagtagctctgaggacgaatcttcccggcctaca gaaataattgagattcctgatgaagatgatgatgtcctcagtattgattcaggtgatgct gggagcagaactccaaaagaccagaagctccgtgaagctatggctgccttaagaaagtca gctcaagatgttcagaagttcatggatgctgtcaacaagaagagcagttcccaggatctg cataaaggaaccttgagtcagatgtctggagaactaagcaaagatggtgacctgatagtc agcatgcgaattctgggcaagaagagaactaagacttggcacaaaggcacccttattgcc atccagacagttgggccagggaagaaatacaaggtgaaatttgacaacaaaggaaagagt ctactgtcggggaaccatattgcctatgattaccaccctcctgctgacaagctgtatgtg ggcagtcgggtggtcgccaaatacaaagatgggaatcaggtctggctctatgctggcatt gtagctgagacaccaaacgtcaaaaacaagctcaggtttctcattttctttgatgatggc tatgcttcctatgtcacacagtcggaactgtatcccatttgccggccactgaaaaagact tgggaggacatagaagacatctcctgccgtgacttcatagaggagtatgtcactgcctac cccaaccgccccatggtactgctcaagagtggccagcttatcaagactgagtgggaaggc acgtggtggaagtcccgagttgaggaggtggatggcagcctagtcaggatcctcttcctg gtactgttcttctctacaattttggaggcagaggatgacaaaagatgtgagtggatctat cgaggctctacacggctggagcccatgttcagcatgaaaacatcctcagcctctgcactg gagaagaagcaaggacagctcaggacacgtccaaatatgggtgctgtgaggagcaaaggc cctgttgtccagtacacacaggatctgaccggtactggaacccagttcaagccagtggaa cccccacagcctacagctccacctgccccacctttcccacctgctccacctctatccccc caagcaggtgacagtgaaagcttggaaagccagcttgcccagtcacggaagcaggtagcc aaaaagagcacgtcctttcgaccaggatctgtgggctctggtcattcctcccctacatct cctgcactcagtgaaaatgtctctggtgggaaacctgggatcaaccagacatatagatca cctttaggctccacagcctctgccccagcaccctcagcactcccggcccctccagcaccc ccagtcttccatggcatgctggagcgggccccagcagagccctcctaccgtgctcccatg gagaagcttttctacttacctcatgtctgcagctatacctgtctgtctcgagtcagacct atgaggaatgagcagtaccggggcaagaaccctctgctggtcccgttactatatgacttc cggcggatgacagcccggcgtcgagttaaccgcaagatgggctttcatgttatctataag acaccttgtggtctctgccttcggacaatgcaggagatagaacgctaccttttcgagact ggctgtgacttcctcttcctggagatgttctgtttggatccatatgttcttgtggaccga aagtttcagccctataagcctttttactatattttggacatcacttatgggaaggaagat gttcccctatcctgtgtcaatgagattgacacaacccctccaccccaggtggcctacagc aaggaacgtatcccgggcaagggtgttttcattaacacaggccctgaatttctggttggc tgtgactgcaaggatgggtgtcgggacaagtccaagtgtgcctgccatcaactaactatc caggctacagcctgtaccccaggaggccaaatcaaccctaactctggctaccagtacaag agactagaagagtgtctacccacaggn