GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:24:57 Sequence gi568815593r:156819460_157063198 : 243739 bp : 41.28% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1661 1656 6 1.05 1.02 Term - 10649 10480 170 0 2 73 33 135 0.494 3.56 1.01 Init - 12664 12601 64 1 1 54 98 34 0.392 2.37 1.00 Prom - 16965 16926 40 -4.85 2.00 Prom + 30438 30477 40 -3.85 2.01 Sngl + 31104 31721 618 2 0 91 43 917 0.816 83.44 2.02 PlyA + 32344 32349 6 1.05 3.00 Prom + 33341 33380 40 -5.75 3.01 Init + 41533 41621 89 0 2 83 41 108 0.505 5.76 3.02 Intr + 59064 59261 198 0 0 -40 42 247 0.271 4.84 3.03 Term + 59385 59547 163 0 1 30 47 185 0.378 5.03 3.04 PlyA + 60100 60105 6 1.05 4.00 Prom + 64733 64772 40 -5.15 4.01 Init + 70269 70410 142 2 1 52 98 137 0.822 11.44 4.02 Term + 91042 91193 152 2 2 93 49 62 0.040 -0.11 4.03 PlyA + 91683 91688 6 -0.45 5.09 PlyA - 92587 92582 6 1.05 5.08 Term - 94957 94740 218 2 2 69 54 117 0.278 2.72 5.07 Intr - 102757 102640 118 1 1 78 94 77 0.716 6.42 5.06 Intr - 106853 106804 50 0 2 115 98 49 0.983 6.08 5.05 Intr - 108328 108281 48 2 0 97 81 55 0.254 3.43 5.04 Intr - 130272 130192 81 1 0 124 75 81 0.871 9.09 5.03 Intr - 132331 132053 279 2 0 50 101 299 0.999 23.93 5.02 Intr - 135297 134956 342 1 0 110 99 141 0.649 11.88 5.01 Init - 143739 143682 58 0 1 72 80 24 0.482 1.53 5.00 Prom - 143806 143767 40 -7.45 6.00 Prom + 161330 161369 40 -6.75 6.01 Init + 166023 166168 146 2 2 81 58 116 0.757 7.54 6.02 Intr + 175050 175089 40 0 1 93 84 22 0.009 -0.39 6.03 Term + 188154 188432 279 2 0 71 41 250 0.813 13.06 6.04 PlyA + 189980 189985 6 1.05 7.00 Prom + 193894 193933 40 -3.65 7.01 Init + 195715 195794 80 0 2 83 47 55 0.148 1.48 7.02 Intr + 204257 204365 109 2 1 109 91 25 0.628 4.27 7.03 Term + 205961 206116 156 1 0 65 48 77 0.553 -1.65 7.04 PlyA + 206521 206526 6 1.05 8.11 PlyA - 209156 209151 6 1.05 8.10 Term - 210382 210274 109 0 1 76 47 119 0.913 3.60 8.09 Intr - 213428 213395 34 0 1 82 121 16 0.935 0.66 8.08 Intr - 217902 217788 115 0 1 88 106 84 0.976 9.30 8.07 Intr - 218079 217980 100 2 1 36 106 39 0.149 -0.31 8.06 Intr - 228629 228445 185 2 2 55 110 39 0.065 0.46 8.05 Intr - 229125 228995 131 1 2 89 98 -9 0.059 -0.31 8.04 Intr - 233195 232902 294 0 0 70 98 407 0.073 35.96 8.03 Intr - 236074 235742 333 2 0 130 113 122 0.990 13.52 8.02 Intr - 238496 238439 58 2 1 101 111 2 0.624 1.34 8.01 Init - 240950 240840 111 2 0 58 61 93 0.483 3.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 233190 232907 284 1 2 10 46 566 0.906 39.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:156819460_157063198|GENSCAN_predicted_peptide_1|77_aa MPVFALCPGDPRPGVACGFWEEWSSIVMMEKGVPVIIFTLQGPGGWVWCEVRRNILSFKE HFPEIAQNQLTIRWVVA >gi568815593r:156819460_157063198|GENSCAN_predicted_CDS_1|234_bp atgccagtttttgccctgtgtcctggagacccacgacctggcgtggcatgtgggttttgg gaagaatggagctccattgtcatgatggaaaagggtgttccagtcatcatattcacgttg caaggaccaggtggatgggtctggtgcgaggtgaggagaaacatcctctcctttaaagaa cactttccagaaattgcacaaaatcaacttacaatccgttgggttgtggcatag >gi568815593r:156819460_157063198|GENSCAN_predicted_peptide_2|205_aa MAASTASHRPIKGILKNKTSTTSSMVASAEQPRRSVDEELSKKSQKWDEINILATYHPAD KGYGLMKIDEPSPPYHSMMGDDEDACRDTETTEAMAPDILAKKLAAAEGLEPKYRIQEQE SSGEEDSDLSPEEREKKRQFEMRRKLHYNEGLNIKLARQLISKDLHDDDEDEEMLETADG ESMNTEESNQGSTPSDQQQNKLRSS >gi568815593r:156819460_157063198|GENSCAN_predicted_CDS_2|618_bp atggcggcctcgacggcctcccaccggcccatcaaggggatcttgaagaacaagacctct acgacttcctctatggtggcgtcggccgaacagccccgcaggagtgtcgacgaggagctg agcaaaaaatcccagaagtgggatgaaattaacatcttggcgacctatcatccagcagac aaaggctatggtttaatgaaaatagatgaaccaagccctccttaccatagtatgatgggt gatgatgaagatgcgtgtagggacaccgagaccactgaagccatggcgccagacatccta gccaagaaattagctgctgctgaaggcttggagccaaagtaccggattcaggaacaagaa agcagtggagaggaggatagtgacctctcacctgaagaacgagaaaaaaagcgacaattt gaaatgagaaggaagcttcactacaatgaaggactcaatatcaaactagccagacaatta atttcaaaagacctacatgatgatgatgaagatgaagaaatgttagagactgcagatgga gaaagcatgaatacggaagaatcaaatcaaggatctactccaagtgaccaacagcaaaac aaattacgaagttcatag >gi568815593r:156819460_157063198|GENSCAN_predicted_peptide_3|149_aa MATQNVMCSQQQQQQQQNPRDCSKCKILAGNETAQCCGELQGQWCGHDGQSPPRPPVIEA RLSKTDSKPGERSKKLDGAVNLPQDMQQRVSDLHEDLIVFRDVKFSPSTRAASQRRASCG SPPLVNLKRLKGDAPGDKKLSLQELTNSG >gi568815593r:156819460_157063198|GENSCAN_predicted_CDS_3|450_bp atggctactcagaatgtaatgtgtagccagcagcagcagcagcagcagcaaaatccaaga gattgttcaaaatgcaaaatcttagccggaaatgaaacagcccagtgttgtggagaattg cagggccagtggtgtggtcatgatggacagtctcctccaaggccacctgtgattgaagcc cgattgtcaaaaacagacagtaaaccaggagagagaagcaagaagttagatggtgctgtc aacttgccacaggatatgcagcaaagggtatcggacttgcatgaagatctcattgtcttc cgggatgtgaagttctctcctagtaccagggctgcaagccagagaagagccagctgtggg tccccacctctagtgaacctcaagagactcaaaggggatgctccaggagacaagaagctc tctcttcaggagctgaccaacagtgggtag >gi568815593r:156819460_157063198|GENSCAN_predicted_peptide_4|97_aa MWEVGYEKAWKETLDNWTLSLKHAGFQVPVSVRVEMASRLLEILREPGTSEIIQIIGIMK WVIKIEMALQTVSCLGSGTGTLLFLTMPQLPTGPFGT >gi568815593r:156819460_157063198|GENSCAN_predicted_CDS_4|294_bp atgtgggaggtcggatatgagaaagcctggaaggagacgctagacaactggacgctgtct ttgaaacatgctggcttccaagtgccagtgagtgtgagggtggaaatggctagcaggctg ttggaaatactcagagaaccaggcacatcagaaattatccaaataattggaatcatgaaa tgggtgataaagatagaaatggctttgcagactgtaagctgcttaggatcagggactggg actctcttgttcctcacaatgccccagctccccacagggccctttggcacatag >gi568815593r:156819460_157063198|GENSCAN_predicted_peptide_5|397_aa MSKEPLILWLMIEFWWLYLTPVTSETVVTEVLGHRVTLPCLYSSWSHNSNSMCWGKDQCP YSGCKEALIRTDGMRVTSRKSAKYRLQGTIPRGDVSLTILNPSESDSGVYCCRIEVPGWF NDVKINVRLNLQRASTTTHRTATTTTRRTTTTSPTTTRQMTTTPAALPTTVVTTPDLTTG TPLQMTTIAVFTTANTCLSLTPSTLPEEATGLLTPEPSKEGPILTAESETVLPSDSWSSV ESTSADTVLLTSKVLVDSSGHGQVSQVGTASDTAVPEQNKTTKTGQMDGIPMSMKNEMPI SQLLMIIAPSLGFVLFALFVAFLLREEAVKGEPGRKLNEARWDTFKWECVELTHEELKPA VSRLNERLTGIQQWVAGETLCLSKENRKLRSGLPGWR >gi568815593r:156819460_157063198|GENSCAN_predicted_CDS_5|1194_bp atgtccaaagaacctctcattctctggctgatgattgagttttggtggctttacctgaca ccagtcacttcagagactgttgtgacggaggttttgggtcaccgggtgactttgccctgt ctgtactcatcctggtctcacaacagcaacagcatgtgctgggggaaagaccagtgcccc tactccggttgcaaggaggcgctcatccgcactgatggaatgagggtgacctcaagaaag tcagcaaaatatagacttcaggggactatcccgagaggtgatgtctccttgaccatctta aaccccagtgaaagtgacagcggtgtgtactgctgccgcatagaagtgcctggctggttc aacgatgtaaagataaacgtgcgcctgaatctacagagagcctcaacaaccacgcacaga acagcaaccaccaccacacgcagaacaacaacaacaagccccaccaccacccgacaaatg acaacaaccccagctgcacttccaacaacagtcgtgaccacacccgatctcacaaccgga acaccactccagatgacaaccattgccgtcttcacaacagcaaacacgtgcctttcacta accccaagcacccttccggaggaagccacaggtcttctgactcccgagccttctaaggaa gggcccatcctcactgcagaatcagaaactgtcctccccagtgattcctggagtagtgtt gagtctacttctgctgacactgtcctgctgacatccaaagtcttagtggacagctcaggc catggccaggtgtctcaagtaggaactgcatctgatacagcagttcctgagcagaacaaa acaacaaaaacaggacagatggatggaatacccatgtcaatgaagaatgaaatgcccatc tcccaactactgatgatcatcgccccctccttgggatttgtgctcttcgcattgtttgtg gcgtttctcctgagagaggaagctgtaaagggagaacctggcaggaaacttaatgaagcc aggtgggacacatttaaatgggaatgtgttgaactaactcatgaagagctaaagccagct gtcagcaggcttaatgaaaggttaacaggaatccagcaatgggtggctggggagactctt tgtctctctaaagaaaacaggaagttgagaagtgggctccctgggtggaggtga >gi568815593r:156819460_157063198|GENSCAN_predicted_peptide_6|154_aa MKKNQSKKAENSQNQNGSSPKDHNSSPAREQNWTVNEFDELTEEVSEDRLYKTNPTPHIM PSGCRLLVVPPCWNLDSGSLIPTTPLGSAQLGTLYEGSNPTFHLGIVTVEALAGKGSTPQ QASAWAPRLSHSTSETQVEAAKPSSLLHSVHLQA >gi568815593r:156819460_157063198|GENSCAN_predicted_CDS_6|465_bp atgaagaaaaaccagagcaaaaaggctgaaaattcccaaaaccagaatggctcttctcca aaggatcacaactcctctccagcaagggaacaaaactggacggtgaatgaatttgatgaa ttgacagaggaggtttcagaagatagattgtataaaacaaatcctacacctcatatcatg ccatctggttgccggctgctggtggtgccaccatgctggaatctggacagtggcagtctt attccaacaactccactaggcagtgcccaactggggactctgtatgagggctccaacccc acatttcaccttggtattgtgacagtagaggctcttgcagggaagggctccaccccgcag caggcttctgcctgggcacccaggctttcccattcaacctctgaaacccaggtagaagct gccaagccttcttcactcttgcattctgtacacctacaggcctaa >gi568815593r:156819460_157063198|GENSCAN_predicted_peptide_7|114_aa MDEAGNHHSQQTITRTENQTLDVLTHREHYEPVTNIHKLDHENQSVNKLPLHFLGLPIFW RLQKQPLTNQLKLHHFLPPNKHPLPQWFSTVPAGTLSVVTTGGCDWHLVGGGQG >gi568815593r:156819460_157063198|GENSCAN_predicted_CDS_7|345_bp atggatgaagctggaaaccatcattctcagcaaactatcacaagaacagaaaaccaaaca ctggatgttctcactcatagagagcactacgaacctgtaacaaacatccacaaactagac catgaaaaccagagtgtaaataagcttcccctacattttttgggtcttcctatattttgg agactccagaagcagccgctgaccaaccagctaaagctacaccacttcctaccccccaac aaacaccctctacctcagtggttctcaactgtgcccgcagggacattgtcagttgtcacc actggaggatgtgactggcatctagtgggtggaggccagggatga >gi568815593r:156819460_157063198|GENSCAN_predicted_peptide_8|489_aa MENTATEKALGLRQVGKKALVAGAESAGERVVEEIYGADPIMHPQVVILSLILHLADSVA GSVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNGIVWTNGTHVTYRKDTRYKL LGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDMKITVSLEIVPPKVTTTPIVTTVP TVTTVRTSTTVPTTTTVPMTTVPTTTVPTTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTT TSVPVTTTVSTFVPPMPLPRQNHEPVLGLQASATVPSCFVICKKDVIPRSCRMLVRIKWD ERFGTWLQGGNSQTGFSVSSHTTSTTIINSDEDFCDRMCGVFSPYIKQWTPAVSLIQFLH CLPGERVISHSCPRFSPYGYKLRSFGLTLMFQKSPSTNTGKCGWQLFLEHSLLTANTTKG IYAGVCISVLVLLALLGVIIAKKYFFKKEVQQLSVSFSSLQIKALQNAVEKEVQAEDNIY IENSLYATD >gi568815593r:156819460_157063198|GENSCAN_predicted_CDS_8|1470_bp atggagaacacagccacggaaaaggccttagggttgaggcaagttggaaagaaagctcta gtagctggggctgagtcagcaggggagagagtggtagaagaaatctatggggctgatccc ataatgcatcctcaagtggtcatcttaagcctcatcctacatctggcagattctgtagct ggttctgtaaaggttggtggagaggcaggtccatctgtcacactaccctgccactacagt ggagctgtcacatccatgtgctggaatagaggctcatgttctctattcacatgccaaaat ggcattgtctggaccaatggaacccacgtcacctatcggaaggacacacgctataagcta ttgggggacctttcaagaagggatgtctctttgaccatagaaaatacagctgtgtctgac agtggcgtatattgttgccgtgttgagcaccgtgggtggttcaatgacatgaaaatcacc gtatcattggagattgtgccacccaaggtcacgactactccaattgtcacaactgttcca accgtcacgactgttcgaacgagcaccactgttccaacgacaacgactgttccaatgacg actgttccaacgacaactgttccaacaacaatgagcattccaacgacaacgactgttctg acgacaatgactgtttcaacgacaacgagcgttccaacgacaacgagcattccaacaaca acaagtgttccagtgacaacaactgtctctacctttgttcctccaatgcctttgcccagg cagaaccatgaaccagtgctgggattacaggcatcagccaccgtgcccagctgttttgtc atttgtaaaaaggatgtaataccaagatcttgcagaatgcttgtgaggatcaaatgggat gaaagatttgggacatggttacaggggggaaactcccaaactgggttttctgtctcttct cacaccacgtcaacaacaatcatcaactcagatgaagacttctgtgaccgaatgtgtgga gttttttccccatacatcaagcagtggacaccggctgtgtctttaattcagttcttacac tgtctacccggagagagggtcatatcccacagttgtcctagattttcaccgtatggttat aaactgagatcatttggtcttacccttatgttccagaagtccccaagcaccaatacaggt aaatgtggctggcaactgttcctagaacatagtctactgacggccaataccactaaagga atctatgctggagtctgtatttctgtcttggtgcttcttgctcttttgggtgtcatcatt gccaaaaagtatttcttcaaaaaggaggttcaacaactaagtgtttcatttagcagcctt caaattaaagctttgcaaaatgcagttgaaaaggaagtccaagcagaagacaatatctac attgagaatagtctttatgccacggactaa