GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:12:02 Sequence gi568815576f:32375003_32598527 : 223525 bp : 44.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1680 1719 40 2 1 105 110 70 0.792 9.29 1.02 Intr + 7288 7394 107 2 2 44 87 27 0.220 -1.87 1.03 Term + 10454 10633 180 1 0 88 42 125 0.472 5.51 1.04 PlyA + 11900 11905 6 1.05 2.13 PlyA - 12814 12809 6 -3.24 2.12 Term - 13097 12990 108 2 0 88 42 120 0.958 5.91 2.11 Intr - 17357 17238 120 2 0 99 107 70 0.998 10.69 2.10 Intr - 19000 18890 111 1 0 71 91 78 0.991 6.98 2.09 Intr - 20212 20024 189 1 0 116 70 155 0.970 16.28 2.08 Intr - 21287 21072 216 2 0 28 92 218 0.628 14.90 2.07 Intr - 23098 22994 105 1 0 57 89 153 0.803 12.71 2.06 Intr - 24757 24601 157 0 1 53 93 129 0.900 9.91 2.05 Intr - 26901 26745 157 1 1 111 71 121 0.882 11.77 2.04 Intr - 31759 31660 100 1 1 73 110 96 0.983 9.98 2.03 Intr - 33240 33173 68 1 2 107 84 74 0.983 7.52 2.02 Intr - 33831 33753 79 0 1 80 103 66 0.994 6.42 2.01 Init - 37154 37062 93 2 0 104 110 149 0.989 19.08 2.00 Prom - 37842 37803 40 -3.66 3.09 PlyA - 38492 38487 6 1.05 3.08 Term - 49786 49550 237 1 0 -20 49 571 0.684 38.57 3.07 Intr - 57541 57371 171 1 0 26 88 167 0.872 10.64 3.06 Intr - 58770 58717 54 0 0 77 114 27 0.140 3.38 3.05 Intr - 60878 60702 177 2 0 131 78 122 0.096 15.62 3.04 Intr - 62849 62758 92 0 2 47 88 113 0.103 6.91 3.03 Intr - 67729 67669 61 1 1 64 96 60 0.565 2.71 3.02 Intr - 70696 70633 64 0 1 92 101 27 0.776 3.12 3.01 Init - 71838 71729 110 0 2 70 75 27 0.379 -0.71 3.00 Prom - 72341 72302 40 -1.06 4.00 Prom + 74111 74150 40 -1.76 4.01 Init + 100001 100122 122 1 2 97 101 117 0.935 12.29 4.02 Intr + 103979 104273 295 2 1 70 64 139 0.978 6.71 4.03 Intr + 108895 109122 228 0 0 124 111 -6 0.905 3.37 4.04 Intr + 110066 110207 142 1 1 113 46 127 0.940 10.93 4.05 Intr + 112743 112826 84 1 0 102 58 19 0.489 0.09 4.06 Intr + 116084 116179 96 0 0 28 116 49 0.648 1.68 4.07 Intr + 118103 118279 177 0 0 124 45 47 0.840 3.89 4.08 Intr + 122529 122707 179 1 2 44 70 88 0.547 2.24 4.09 Term + 130774 131082 309 0 0 84 38 128 0.051 2.46 4.10 PlyA + 132668 132673 6 1.05 5.11 PlyA - 133041 133036 6 1.05 5.10 Term - 138822 138690 133 2 1 113 42 135 0.994 8.96 5.09 Intr - 143332 143041 292 2 1 87 108 106 0.745 8.69 5.08 Intr - 153003 152916 88 0 1 98 86 -18 0.299 -1.46 5.07 Intr - 154006 153872 135 1 0 135 86 113 0.998 16.56 5.06 Intr - 157701 157618 84 0 0 96 75 26 0.573 2.12 5.05 Intr - 158893 158791 103 0 1 129 78 163 0.971 19.58 5.04 Intr - 161201 161154 48 1 0 99 56 47 0.559 0.40 5.03 Intr - 163108 163034 75 0 0 90 94 91 0.996 8.53 5.02 Intr - 166711 166569 143 1 2 93 84 391 0.995 38.35 5.01 Init - 178996 178928 69 1 0 72 66 116 0.457 6.93 5.00 Prom - 187236 187197 40 -5.76 6.00 Prom + 188487 188526 40 -6.36 6.01 Sngl + 189624 190205 582 2 0 68 41 192 0.170 8.69 6.02 PlyA + 190372 190377 6 -0.45 7.00 Prom + 194033 194072 40 -3.76 7.01 Init + 201439 201519 81 0 0 57 63 62 0.125 1.57 7.02 Term + 207699 207812 114 2 0 53 41 148 0.535 5.17 7.03 PlyA + 209918 209923 6 1.05 8.03 PlyA - 210433 210428 6 1.05 8.02 Term - 212804 212718 87 2 0 36 42 106 0.348 -1.54 8.01 Intr - 221734 221672 63 1 0 127 86 16 0.295 4.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 130280 130232 49 2 1 72 131 36 0.831 7.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:32375003_32598527|GENSCAN_predicted_peptide_1|108_aa MAPVLLLLLSHFPASSPTTGYTFKASEILMFKRGICLPMVTETLVTTAKYENEETYRVRT FQIGTGEKLVKALVPAFLGRTISSYATCLGPSWDFITVPDFMEELFTR >gi568815576f:32375003_32598527|GENSCAN_predicted_CDS_1|327_bp atggctcctgtcctcctgctgctcctctctcacttcccagctagcagtcctaccactggg tatacctttaaagcaagtgaaatcctcatgttcaaaagaggtatctgccttcccatggtg actgaaacactagtcacgacagccaagtatgaaaatgaggagacttacagggtgcgaaca ttccagataggtacaggggagaaactggtgaaggccctggttccagcctttctgggtaga accatctcctcctatgccacctgtttgggcccctcctgggactttatcaccgtgccagac ttcatggaggaactgtttaccaggtga >gi568815576f:32375003_32598527|GENSCAN_predicted_peptide_2|500_aa MSRSYNDELQFLEKINKNCWRIKKGFVPNMQVEGVFYVNDALEKLMFEELRNACRGGGVG GFLPAMKQIGNVAALPGIVHRSIGLPDVHSGYGFAIGNMAAFDMNDPEAVVSPGGVGFDI NCGVRLLRTNLDESDVQPVKEQLAQAMFDHIPVGVGSKGVIPMNAKDLEEALEMGVDWSL REGYAWAEDKEHCEEYGRMLQADPNKVSARAKKRGLPQLGTLGAGNHYAEIQVVDEIFNE YAAKKMGIDHKGQGKLTGLDFVALVPDALVAMEKAMKRDKIIVNDRQLACARIASPEGQD YLKGMAAAGNYAWVNRSSMTFLTRQAFAKVFNTTPDDLDLHVIYDVSHNIAKVEQHVVDG KERTLLVHRKGSTRAFPPHHPLIAVDYQLTGQPVLIGGTMGTCSYVLTGTEQGMTETFGT TCHGAGRALSRAKSRRNLDFQDVLDKLADMGIAIRVASPKLVMEEAPESYKNVTDVVNTC HDAGISKKAIKLRPIAVIKG >gi568815576f:32375003_32598527|GENSCAN_predicted_CDS_2|1503_bp atgagtcgcagctataatgatgagctgcagttcttggagaagatcaataaaaactgctgg aggatcaagaagggcttcgtgcccaacatgcaggttgaaggtgttttctatgtgaatgat gctctggagaaattgatgtttgaggaattaaggaatgcctgtcgaggtggtggtgttggt ggcttcctgccagccatgaaacagattggcaatgtggcagccctgcctggaattgttcat cgatctattgggcttcctgatgtccattcaggatatgggtttgctattgggaacatggca gcctttgatatgaatgaccctgaagcagtagtatccccaggtggtgtcgggtttgacatc aactgtggtgtccgcttgctaagaaccaatttagatgaaagtgatgtccagcctgtgaag gagcaacttgcccaagctatgtttgaccacattcctgttggggtggggtcaaaaggtgtc atcccaatgaatgccaaagacttggaggaggccttggagatgggggtggactggtcctta agagaagggtatgcctgggctgaagacaaggagcactgcgaggagtacggaaggatgctg caggctgaccccaataaagtttctgcaagggcgaagaaaagaggccttcctcagttgggg accctgggagcaggcaaccattatgcagaaatccaggttgtggatgagattttcaatgag tatgctgctaaaaaaatgggcatcgaccataagggacagggaaagctaactggtttggac tttgtggctcttgttccagatgcgctggtagctatggagaaggccatgaagagagacaag attatagtcaatgatcggcagttggcttgtgctcgaatcgcttccccagagggtcaagac tatctgaagggaatggcagctgctgggaactatgcctgggtcaaccgctcttccatgacc ttcttaacccgtcaggctttcgccaaggtcttcaacacaacccctgatgacttggaccta catgtgatctatgatgtttctcacaacattgccaaagtggagcagcatgtggtggacgga aaggaacggacactgttagtacacaggaagggatccacccgcgctttccctcctcaccat cccctcattgctgttgattaccaactcactggacagccagtgctcattggtggcaccatg ggaacctgtagttatgttcttactggcactgaacagggcatgactgagacctttggaaca acctgtcatggagcgggccgtgcattgtcccgagcaaaatctcgacgtaatttagatttc caggatgtcttagacaaattggcagatatgggaattgcgatccgtgttgcctcacccaaa ctggttatggaagaggctcctgagtcctataagaatgtgacagatgtggtaaatacctgc catgatgctggaatcagcaagaaagccattaaactgagaccaattgctgtgatcaaagga tag >gi568815576f:32375003_32598527|GENSCAN_predicted_peptide_3|321_aa MESFSLTQRLKNGSQLKPEPNWKAFAPSCRRRKSDSSVLYNSFAEPMEKPILKNLNEMLC PIIASEVKALNANLSTLEVLTKIDNYTLLDYSLISSPEITENYLDLNLKGVFYPLENLTD PPFSPVPFVLPERSNSMLYIGIAEYFFKSASFAHFTAGVFNVTLSTEEISNHFVQNSQGL GNVLSRIAEIYILSQPFMVRIMATEPPIINLQPGNFTLDIPASIMMLTQPKNSTVETIVS MDFEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEKKKKRRRR RRRRRRRRRRRRRRKKITLKI >gi568815576f:32375003_32598527|GENSCAN_predicted_CDS_3|966_bp atggagtctttctccctcacacagaggttgaagaatggtagtcagcttaagcctgagccc aactggaaggcctttgcaccctcgtgtagaaggaggaaatcagattcaagtgttctgtat aactcctttgctgagcccatggagaaacccattttaaagaacttaaatgaaatgctctgt cccattattgcaagtgaagtcaaagcgctaaatgccaacctcagcacactggaggtttta accaagattgacaactacactctgctggattactccctaatcagttctccagaaattact gagaactaccttgacctgaacttgaagggtgtattctacccactggaaaacctcaccgac ccccccttctcaccagttccttttgtgctcccagaacgcagcaactccatgctctacatt ggaatcgccgagtatttctttaaatctgcgtcctttgctcatttcacagctggggttttc aatgtcactctctccaccgaagagatttccaaccattttgttcaaaactctcaaggcctt ggcaacgtgctctcccggattgcagagatctacatcttgtcccagcccttcatggtgagg atcatggccacagagcctcccataatcaatctacaaccaggcaatttcaccctggacatc cctgcctccatcatgatgctcacccaacccaagaactccacagttgaaaccatcgtttcc atggacttcgaagaagaagaagaagaagaagaagaagaagaagaggaagaagaagaggaa gaagaagaggaagaagaagaggaagaagaagaggaagaagaagaggaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagagaagaagaagaagaggaggaggagg aggaggaggaggaggagaagaagaagaagaagaagaagaagaaagaaaataacacttaag atttag >gi568815576f:32375003_32598527|GENSCAN_predicted_peptide_4|543_aa MRLRVRLLKRTWPLEVPETEPTLGHLRSHLRQSLLCTWGYSSNTRFTITLNYKDPLTGDE ETLASYGIVSGDLICLILQDDIPAPNIPSSTDSEHSSLQNNEQPSLATSSNQTSMQDEQP SDSFQGQAAQSGVWNDDSMLGPSQNFEAESIQDNAHMAEGTGFYPSEPMLCSESVEGQVP HSLETLYQSADCSDANDALIVLIHLLMLESGYIPQGTEAKALSMPEKWKLSGVYKLQYMH PLCEGSSATLTCVPLGNLIVVNATLKINNEIRSVKRLQLLPESFICKEKLGENVANIYKD LQKLSRLFKDQLVYPLLAFTRQALNLPDVFGLVVLPLELKLRIFRLLDVRSVLSLSAVCR DLFTASNDPLLWRFLYLRDFRVVSWNGIYSWTTKDFEYYINLVDKTEARFEKVDSDFESF TVGKMPSNSIACTEKSFMKGRHALKGLKPVITRLLQHGLLKPINSPYNSPILPVLKPDKP YKLVQDLHRINQIVLPIPPVGPNPYTLLSSIPPSTTHYSVLDRKDAFFTIPLHPSSQPLF AFT >gi568815576f:32375003_32598527|GENSCAN_predicted_CDS_4|1632_bp atgaggctgcgggtgcggcttctgaagcggacctggccgctggaggtgcccgagacggag ccgacgctggggcatttgcgctcgcacctgaggcagtccctgctgtgcacctgggggtac agttctaatacccgatttacaattacattgaactacaaggatcccctcactggagatgaa gagaccttggcttcatatgggattgtttctggggacttgatatgtttgattcttcaagat gacattccagcgcctaatataccttcatccacagattcagagcattcttcactccagaat aatgagcaaccctctttggccaccagctccaatcagactagcatgcaggatgaacaacca agtgattcattccaaggacaggcagcccagtctggtgtttggaatgacgacagtatgtta gggcctagtcaaaattttgaagctgagtcaattcaagataatgcgcatatggcagagggc acaggtttctatccctcagaacccatgctctgtagtgaatcggtggaagggcaagtgcca cattcattagagaccttgtatcaatcagctgactgttctgatgccaatgatgccttgata gtgttgatacatcttctcatgttggagtcaggttacatacctcagggcaccgaagccaaa gcactgtccatgccggagaagtggaagttgagcggggtgtataagctgcagtacatgcat cctctctgcgagggcagctccgctactctcacctgtgtgcctttgggaaacctgattgtt gtaaatgctacactaaaaatcaacaatgagattagaagtgtgaaaagattgcagctgcta ccagaatcttttatttgcaaagagaaactaggggaaaatgtagccaacatatacaaagat cttcagaaactctctcgcctctttaaagaccagctggtgtatcctcttctggcttttacc cgacaagcactgaacctaccagatgtatttgggttggtcgtcctcccattggaactgaaa ctacggatcttccgacttctggatgttcgttccgtcttgtctttgtctgcggtttgtcgt gacctctttactgcttcaaatgacccactcctgtggaggtttttatatctgcgtgatttt cgagtggtttcttggaatggaatctattcctggacaacaaaggatttcgaatattacata aatttagttgacaaaacagaagcaaggtttgagaaggttgactctgattttgaaagtttt actgtgggtaaaatgccatcaaacagcatcgcatgtacagagaaatctttcatgaaagga agacacgctttaaaaggattaaagcctgttatcactcgcctgctacagcatggcctttta aagcctataaactctccttacaattcccctattttacctgtcctaaaaccagacaagcct tacaagttagttcaggatctgcaccgtatcaaccaaattgttttgcctatcccccctgtg gggcccaacccgtacactcttttgtcctcaatacctccctccacaactcactattccgtg cttgatcgtaaagatgctttcttcactattcccctgcacccctcgtcccagcctctcttt gctttcacttag >gi568815576f:32375003_32598527|GENSCAN_predicted_peptide_5|389_aa MMVWTVVVAVEVAMSLLQLLLPHIKVENQLDFQDITSVVAMAKTYATTEAFIDSKYDIRI QKIGSNYKAYMRTSISGNWKANTGSAMLEQVAMTERCSRDSRGHLLYGPSGMYRLWVDSC SEMFGGLDICAVKAVHSKDGRDYIIEGPTAKQDQVTQWVGAQSCQEGVHTQALEVMDSSM PLIGEHVEEDRQLMADLVVSKMSQLPMPGGTAPSPLRPWAPQIKSAKSPGQAQLGPQLGQ PQPRPPPQGGPRQAQSPQPQRSGSPSQQRLSPQGQQPLSPQSGSPQQQRSPGSPQLSRAS SGSSPNQASKPGATLASQPRPPVQGRSTSQQGEESKKPAPPHPHLNKSQSLTNSLSTSDT SQRGTPSEDEAKAETIRNLRKSFASLFSD >gi568815576f:32375003_32598527|GENSCAN_predicted_CDS_5|1170_bp atgatggtgtggaccgtggtggtggcagtggaggtggcaatgtctctccttcagctgctg ctcccccatatcaaagtggaaaaccagcttgacttccaggacatcaccagcgtggtcgcc atggccaaaacctacgccaccaccgaggccttcatcgactccaagtacgacatccgcatc cagaaaattggatccaactacaaggcttacatgagaacctccatctctgggaactggaag gccaacacaggctctgccatgctggagcaggtggccatgacagagaggtgttcacgggac agtcgtggccatcttctctatggcccttccggaatgtacaggctgtgggtggacagctgc tcggaaatgtttggcggcctggacatctgtgccgtcaaggctgtccacagcaaggatggc agagattacatcatcgaggggcccacggccaagcaggaccaggtgactcagtgggtggga gcccagtcatgccaggagggtgttcacacccaggccttggaggtaatggacagctcaatg ccgctgattggagagcatgtggaagaggacagacagctgatggccgaccttgttgtctcc aaaatgagccagctcccgatgccaggaggcacagcgccctcccccctcagaccttgggct ccacagattaaatcagcgaaatccccagggcaagcccagctggggcctcagctaggccag ccccagccacgcccacctccgcaaggaggccctcgccaagctcagtctcctcagccccag agatctggaagcccctcccaacagaggctctccccacaaggccagcagcccctgagcccc cagtccggatctccacagcagcaaaggtcaccaggctctccgcagctatcccgggcatcc agtggcagctccccaaaccaggcctccaagccaggtgccaccctcgcctcacagccccgg ccccctgtgcagggccgtagtacctcccagcagggtgaagagtccaagaagccagcacca ccccatccgcatctcaacaaatctcagtccctgactaacagcctcagcacatccgacacc tcccagcgtgggaccccaagtgaagacgaggccaaggctgaaaccatccgcaacctgagg aagtcttttgccagcctgttctctgactaa >gi568815576f:32375003_32598527|GENSCAN_predicted_peptide_6|193_aa MHPNGAPGLYTNSAPRLHPKSALGLHPNSAPGLHPNSAPGLHPVLPHCTQTVLLGCTQTV LPHCTQTVLPDYTQTVLPDCTLTALPDCTLTVLPDYTQTVLPDCTQTVLLDCTQTVLLDC TQTVLPDCTQTVLPDCTQTVISDCTQTVLPDCTQTVLPDCTQTVLPDCTQTVLLGCTQTV LSDCTQTVLSDCT >gi568815576f:32375003_32598527|GENSCAN_predicted_CDS_6|582_bp atgcacccaaacggtgctcccggactgtacacaaacagtgctcccagactgcacccaaag agtgctctcggactgcacccaaacagtgctcccggactgcacccaaacagtgctcctggg ctgcacccagtgctcccgcattgtacccaaacagtgctcctgggctgcacccaaacagtg ctcccgcattgtacccaaacagtgctcccggactacacccaaacagtgctcccggactgc accctaacagcgctcccggactgcaccctaacagtgctcccggattacacccaaacagtg ctcccggattgtacccaaacagtgctcctggactgcacccaaacagtgctcctggactgc acccaaacagtgctcccggactgcacccaaacagtgctcccggactgcacccaaacagtg atctcagactgcacccaaacagtgctccctgactgtacacaaacagtgctcccggactgc acccagacagtgctcccggactgcacccaaacagtgctcctgggctgcacccaaacagtg ctctcggactgcacccaaacagtgctctcggactgcacctaa >gi568815576f:32375003_32598527|GENSCAN_predicted_peptide_7|64_aa MNYEAQGLAPSMRSINTRFMMVTIFRKGLPGDTDAASTSIGHTEQQGYRRAQDRLTSGPI AQII >gi568815576f:32375003_32598527|GENSCAN_predicted_CDS_7|195_bp atgaactacgaagcacagggcctggcacctagtatgcggtcaataaatacacggtttatg atggtgactattttccgtaaggggctcccaggtgatacggatgctgccagtacttccata gggcacactgagcagcaaggctaccgaagagctcaggaccgtctgacttcagggcccatt gcccagatcatctga >gi568815576f:32375003_32598527|GENSCAN_predicted_peptide_8|49_aa VTAPHFPVVVKLGHAHAGMGKTPLSVPGDERAAPPGDVSSSASILHTLA >gi568815576f:32375003_32598527|GENSCAN_predicted_CDS_8|150_bp gtcacagccccacacttcccggtggtagtcaagctgggacatgcccacgctggaatggga aagacacccctgtctgttccgggtgatgagagggccgctcctccaggcgatgtttccagc agcgccagcatcttgcacacgctggcttag