GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:47:31 Sequence gi568815591r:135086154_135311502 : 225349 bp : 43.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 163 158 6 1.05 1.03 Term - 4031 3976 56 0 2 82 47 39 0.530 -3.08 1.02 Intr - 4865 4748 118 2 1 103 84 91 0.777 10.24 1.01 Init - 20818 20414 405 1 0 76 32 112 0.052 1.01 1.00 Prom - 22484 22445 40 -3.46 2.00 Prom + 41697 41736 40 -2.56 2.01 Init + 42703 43291 589 0 1 39 78 391 0.041 28.59 2.02 Intr + 57864 57942 79 1 1 66 55 99 0.022 3.01 2.03 Intr + 69062 69083 22 2 1 78 96 14 0.002 -1.25 2.04 Term + 78265 78846 582 0 0 81 43 318 0.081 21.10 2.05 PlyA + 79122 79127 6 -1.75 3.03 PlyA - 79697 79692 6 1.05 3.02 Term - 80718 80458 261 0 0 84 49 144 0.992 5.63 3.01 Init - 82725 82633 93 0 0 64 86 171 0.173 14.88 3.00 Prom - 90305 90266 40 -3.46 4.14 PlyA - 90840 90835 6 1.05 4.13 Term - 100162 99998 165 1 0 134 45 244 0.999 22.62 4.12 Intr - 101016 100819 198 0 0 55 88 242 0.985 20.45 4.11 Intr - 102392 102280 113 0 2 57 102 201 0.875 18.50 4.10 Intr - 103299 103191 109 0 1 75 64 225 0.989 18.66 4.09 Intr - 107246 107078 169 1 1 84 96 196 0.937 19.95 4.08 Intr - 107519 107425 95 2 2 93 66 123 0.999 9.36 4.07 Intr - 108473 108417 57 2 0 119 81 -8 0.509 0.68 4.06 Intr - 108931 108781 151 0 1 98 105 109 0.998 13.76 4.05 Intr - 110184 109991 194 0 2 83 94 136 0.999 11.99 4.04 Intr - 111998 111840 159 2 0 70 80 140 0.999 11.58 4.03 Intr - 118280 118115 166 1 1 83 115 87 0.517 10.86 4.02 Intr - 119905 119775 131 1 2 33 74 193 0.997 11.89 4.01 Init - 120035 119949 87 2 0 63 72 60 0.912 1.23 4.00 Prom - 120596 120557 40 -6.66 5.04 PlyA - 120729 120724 6 -1.75 5.03 Term - 122845 122612 234 1 0 105 49 208 0.973 14.92 5.02 Intr - 123602 123423 180 2 0 99 51 178 0.979 15.26 5.01 Init - 125349 125227 123 0 0 95 80 269 0.930 26.97 5.00 Prom - 127037 126998 40 -5.36 6.00 Prom + 134736 134775 40 -4.26 6.01 Init + 145826 145885 60 1 0 83 99 63 0.890 8.15 6.02 Intr + 154366 154563 198 0 0 -4 86 131 0.101 3.25 6.03 Intr + 156628 156703 76 0 1 79 90 53 0.897 3.69 6.04 Intr + 157173 157257 85 1 1 57 110 118 0.981 9.78 6.05 Intr + 159135 159374 240 0 0 83 58 330 0.538 26.16 6.06 Intr + 160264 160549 286 1 1 57 83 411 0.993 34.84 6.07 Intr + 165643 165716 74 0 2 75 94 61 0.790 3.60 6.08 Intr + 168961 169072 112 1 1 81 91 63 0.891 6.28 6.09 Term + 172265 172339 75 1 0 125 38 114 0.993 7.94 6.10 PlyA + 172834 172839 6 1.05 7.10 PlyA - 174211 174206 6 1.05 7.09 Term - 175378 175361 18 1 0 111 47 9 0.016 -2.58 7.08 Intr - 181326 181226 101 1 2 114 78 -1 0.086 1.33 7.07 Intr - 183961 183866 96 2 0 92 74 78 0.676 6.78 7.06 Intr - 184167 184059 109 0 1 106 62 137 0.936 12.76 7.05 Intr - 185287 185256 32 2 2 54 86 16 0.014 -4.15 7.04 Intr - 188503 188448 56 0 2 94 94 90 0.503 8.82 7.03 Intr - 191244 191069 176 0 2 88 103 149 0.474 15.14 7.02 Intr - 202244 202085 160 1 1 133 77 23 0.128 5.69 7.01 Intr - 221157 221018 140 0 2 103 -24 129 0.001 2.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 42703 43326 624 0 0 39 47 423 0.925 29.50 S.002 Sngl + 78289 78846 558 0 0 90 43 306 0.805 22.43 S.003 Init + 84159 84267 109 2 1 86 100 101 0.868 10.03 S.004 Init + 154372 154563 192 0 0 45 86 131 0.853 7.57 S.005 Init - 184465 184418 48 1 0 86 94 55 0.914 7.09 S.006 Term - 221157 221008 150 0 0 103 50 128 0.901 8.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:135086154_135311502|GENSCAN_predicted_peptide_1|192_aa MDKFLDIYTLPRLNQEEIDSLNSSIVTSEIESVINSLSSKKIPGQNRFTAEFNQMYKDEL VPFLLKLFQKAEEEGLLPNSFYEASIILIPKPGRATTIKENFRPISLVNIDAKILNKILA NQIQQHIKKLIHHDQAQSFLTAEGAAWTPLAVLPPEKELSSWNWNYSRSMSCTRGSKVTT TQCFIHWNLGYH >gi568815591r:135086154_135311502|GENSCAN_predicted_CDS_1|579_bp atggataaattcctggatatatacaccctcccaagactgaaccaggaagaaatcgattcc ctgaacagttcaatagtgacctctgaaatagaatcagtaataaatagcttatcatccaaa aaaatcccaggacaaaacagattcacagctgaattcaatcagatgtacaaagacgagctg gtaccattcctactgaaactatttcaaaaagctgaggaggagggactcctccccaactca ttctatgaagccagcattatcctgataccaaaacctggcagagccacaacaataaaagaa aacttcaggccaatatccttggtgaacattgatgcaaaaatcctcaacaaaatacttgca aaccaaatccagcagcacatcaaaaagctaatccaccatgatcaagcacagtccttcctc acagctgaaggagctgcctggactccactagctgtgctccccccagagaaggagctgtca tcttggaactggaactacagcagaagtatgtcttgcaccagggggtctaaagtcactact acacagtgcttcatccactggaatctgggttatcactga >gi568815591r:135086154_135311502|GENSCAN_predicted_peptide_2|423_aa MIQEILDFDQAQQIKCFNSNLFLCNICFCKKLGSECMYFLECRHVYCKDCLKDYFEIQIR DGQVQCLNCPEPKCPSVATTGQVKELVEAELFAYYDCLLLQSTLDLMADVVYCPHACCQL PVMQEPGYIMGICSSCNFAFCTLCRLTYHGVSPCKVTAEKLIDLLNEYLQADKANKRLLE QRYDKRVIQKALEEMERGEGCQRFSSPYRCEAETTFEANEAKSLYVFMGKVPRQRAVEMA GPRPRWRDQLLFMSIIVLVIVVICLMFYALLWEAGNLTDLPNLRIGFYNFCLWNEDTSTL QCHQFPELEALGVPRVGLGLARLGVYGSLVLTLFAPQPLLLAQCNSDERAWRLAVGFLAV SSVLLAGGLGLFLSYVWKWVRLSLPGPGFLALGSAQALLILLLIAMAVFPLRAERAESKL ESC >gi568815591r:135086154_135311502|GENSCAN_predicted_CDS_2|1272_bp atgatccaggaaatcttggactttgatcaagctcagcagataaaatgctttaatagtaat ttgttcctgtgcaatatctgtttctgtaagaagctgggtagtgaatgcatgtacttcttg gagtgcaggcatgtgtactgcaaagactgtctgaaggactactttgaaatccagatcaga gatggccaggttcaatgcctcaactgcccagaaccaaagtgcccttcagtggccactact ggtcaggtcaaagagctagtggaagcagagttatttgcctattatgactgccttctcctc cagtccaccttggacctgatggcagatgtggtgtactgcccccacgcatgctgccagctg cctgtgatgcaggagcctggctacatcatgggtatctgctccagctgcaattttgccttc tgtaccttgtgcaggttgacctaccatggggtctctccatgtaaggtgactgcagagaaa ttaatagacttactaaatgaatacctgcaagcagataaggccaataaaagacttttggaa caaaggtatgataagagggtgattcagaaggcactggaagagatggaaagaggagaaggc tgccagcgtttctcatctccctaccgctgtgaagcagaaacaacatttgaggctaatgag gccaagagtctatatgtctttatgggtaaggtcccccggcagagggcagtagagatggcc ggcccaaggcctcggtggcgcgaccagctgctgttcatgagcatcatagtcctcgtgatt gtggtcatctgcctgatgttttacgctcttctctgggaggctggcaacctcactgacctg cccaacctgagaatcggcttctataacttctgcctgtggaatgaggacaccagcacccta cagtgtcaccagttccctgagctggaagccctgggggtgcctcgggttggcctgggcctg gccaggcttggcgtgtacgggtccctggtcctcaccctctttgccccccagcctctcctc ctagcccagtgcaacagtgatgagagagcgtggcggctggcagtgggcttcctggctgtg tcctctgtgctgctggcaggcggcctgggcctcttcctctcctatgtgtggaagtgggtc aggctctccctcccggggcctgggtttctagctctgggcagcgcccaggccttactcatc ctcttgcttatagccatggctgtgttccctctgagggctgagagggctgagagcaagctt gagagctgctaa >gi568815591r:135086154_135311502|GENSCAN_predicted_peptide_3|117_aa MADSPGGYKECGTNEGPQEDENGSSASGSSKSRKQEKACEQPALAGADNPEHSPPCSVSP HTSSGSSSEEEDSGKQALAPGLSPSQRPGGSSSACSRSPEEEEEEDVLKYVREIFFS >gi568815591r:135086154_135311502|GENSCAN_predicted_CDS_3|354_bp atggctgacagcccaggtggctacaaagaatgtggcaccaatgaaggcccccaagaggat gagaatggcagcagtgccagtggcagcagcaagagccgcaaacaggaaaaggcctgcgag cagccggccctggcgggggctgataacccagagcactcccctccctgctccgtgtcgcct cacacaagttctgggagcagcagtgaggaagaggacagtgggaaacaggcactggctcca ggcctcagcccttcccagaggccggggggttccagctctgcctgtagcaggagccctgag gaggaggaggaagaggatgtgctgaaatacgtccgggagatctttttcagctag >gi568815591r:135086154_135311502|GENSCAN_predicted_peptide_4|597_aa MGPLTLRRVALRPVGLDAMCLEIFLRRGKLFALQAEIHRLKKEEQQPEEEEALVQHKLPP YVSNMDRLGDSELAMVCSQRNASLSQSPRVGFLSSLLPQSKKSPSRLSPAQGPPQPQSSA KKESFGGQGTKGKDPTSGAKDGKSLLSGLATGESGWSQHRQRRLQDHGKERKELFSTTTS QCAEKKPEASGPEAEPCPELHTEPVEPLTRASSAGPEGGGVRPEQPFIVLGQEEYGEHHS SIMHCRVDCSGRRVASLDVDGVIKVWSFNPIMQTKASSISKSPLLSLEWATKRDRLLQFQ WAPSRARPHGKWLPLLLLGSGVGTVRLYDTEAKKNLCEININDNMPRILSLACSPNGASF VCSAAAPSLTSQVDFSAPDIGSKGMNQVPGRLLLWDTKTMKQQLQFSLDPEPIAINCTAF NHNGNLLVTGAADGVIRLFDMQQHECAMSWRAHYGEVYSVEFSYDENTVYSIGEDGKFIQ WNIHKSGLKVSEYSLPSDATGPFVLSGYSGYKQVQVPRGRLFAFDSEGNYMLTCSATGGV IYKLGGDEKVLESCLSLGGHRAPVVTVDWSTAMDCGTCLTASMDGKIKLTTLLAHKA >gi568815591r:135086154_135311502|GENSCAN_predicted_CDS_4|1794_bp atgggccccttgacgctgaggcgagtggcactcagaccagtgggcttggatgcgatgtgt ctagaaatcttcctccgcagagggaagctttttgcattgcaagctgaaatccaccgactg aagaaagaggagcaacagccagaagaggaagaggccttggtccaacacaaattgcctcct tatgtctccaacatggaccgcctgggggactcggaacttgccatggtgtgcagccaaagg aatgcctccctctcccagtcacctcgtgtgggcttcctgtcctcgctgctgcctcagagt aagaagagcccctcaaggttgtcgcctgctcagggccctcctcaacctcagagctcggcc aagaaagagtccttcggtggtcagggcaccaagggaaaggacccgacgtccggagccaag gatgggaagagcctcctcagcgggctggccactggggagtccggttggtcacagcaccgg cagcggcgcctgcaggaccatggcaaggagaggaaggagcttttctccacaaccacttcc cagtgtgcagagaagaaaccagaagccagtggcccagaggctgagccctgcccagagctc cacacggagccagtggagccactgactcgggcatcctcggcaggccctgagggtggagga gtccgccccgagcagccctttattgtgctgggacaggaggagtacggggaacaccactca tccatcatgcactgcagagtggactgctctgggaggagagtcgccagcttagacgtagat ggggtcatcaaagtgtggtccttcaaccccatcatgcagaccaaagcatcctccatttcc aaatcaccgctgctgtctttggaatgggccaccaaacgggacagactgctgcagtttcag tgggcgccctcccgtgccaggccccacgggaagtggcttccgctgctcttgctgggcagt ggtgtgggaacagtgcgtctctatgacacggaagccaagaagaatctctgtgaaatcaat atcaacgacaacatgcccagaatcctgtctcttgcgtgcagccccaacggggcctctttc gtctgttcggcagcagctccgagcctcacttcccaggtggacttctcagcaccagacatc ggcagcaagggcatgaaccaggttcctggcaggctgctgctgtgggacacgaaaaccatg aagcagcagctccagttctccctggatccagaacccattgctatcaactgtacagccttc aatcacaacgggaacctgctggtcacaggggcagctgatggcgtcatccggctgtttgac atgcagcagcatgagtgcgcgatgagctggagggcccactacggggaggtctactctgtg gagttcagctatgatgagaacaccgtgtacagcatcggcgaggacgggaagttcatccag tggaacatccacaagagtggcctcaaggtatccgagtacagcctcccctcagatgccacg ggcccctttgtgctgtctggatacagcggctacaagcaggttcaagtccccaggggccga ctcttcgcttttgactcggagggaaattacatgctgacatgttctgccacaggcggcgtc atctacaagctgggtggcgatgagaaggttctggagagctgcttgagcctaggtggccac cgagcccctgtggtcaccgtggactggagcactgccatggactgtgggacctgcctcacc gcctccatggatggcaagatcaagctgaccaccctcctggcccataaagcctga >gi568815591r:135086154_135311502|GENSCAN_predicted_peptide_5|178_aa MAEAVERTDELVREYLLFRGFTHTLRQLDAEIKADKEKGFRVDKIVDQLQQLMQVYDLAA LRDYWSYLERRLFSRLEDIYRPTIHKLKTSLFRFYLVYTIQTNRNDKAQEFFAKQATELQ NQAEWKDWFVLPFLPSPDTNPTFATYFSRQWADTFIVSLHNFLSVLFQCMHILSVAWG >gi568815591r:135086154_135311502|GENSCAN_predicted_CDS_5|537_bp atggcggaggccgtggagcgcactgacgagctggtccgggagtacctgctcttccgcggg ttcacgcacacactgcggcagctggacgccgagatcaaggcggacaaggagaaggggttc cgggtggataagattgtggaccagctgcagcagttaatgcaggtgtatgacttggctgcc cttcgggattattggagctacttggagcgtcggctcttcagccgcttggaggatatatac agacccacaatccacaagctgaaaaccagcctgtttcgattttatcttgtctacacaatc cagacaaacagaaatgacaaggctcaggagttctttgcaaagcaggccacggaactccag aaccaggctgagtggaaggattggtttgtcctgcccttcctgccatccccggacaccaac cccacctttgctacctacttttctcgacagtgggctgacaccttcattgtgtccctgcac aacttcctgagcgtcctgtttcagtgcatgcatatcctttcagttgcctggggctga >gi568815591r:135086154_135311502|GENSCAN_predicted_peptide_6|401_aa MGKIDVDKILFFNQEIRLWQLIMATPEENSNPHDRATPQLPAQLQELEHRVARRRLSQAR HRATLAALFNNLRKTVYSQSDLIASKWQVLNKAKSHIPELEQTLDNLLKLKASFNLEDGH ASSLEEVKKEYASMYSGNDSLLSNSFPQNGSSPWCPTEAVRKDAEEEEDEEEEDQEEEEE EEEEEEEEEEEEEEEEEEEEKKVILYSPGTLSPDLMEFERYLNFYKQTMDLLTGSGIITP QEAALPIVSAAISHLWQNLSEERKASLRQAWAQKHRGPATLAEACREPACAEGSVKDSGV DSQGASCSLVSTPEEILFEDAFDVASFLDKSEVPSTSSSSSVLASCNPENPEEKFQLYMQ IINFFKGLSCANTQVKQEASFPVDEEMIMLQCTETFDDEDL >gi568815591r:135086154_135311502|GENSCAN_predicted_CDS_6|1206_bp atggggaagattgatgtggacaagatcctctttttcaatcaagaaatcaggctgtggcag cttataatggcaacccctgaagaaaacagcaatccccatgacagagcaacaccccagctg ccagcacagctgcaggagcttgagcatcgggtggcccggagacggctgtcccaggcccgc caccgagccaccctggcagcgctcttcaacaacctcaggaagacagtgtactctcagtct gatctcatagcctcaaagtggcaggttctgaataaggcaaagagtcatattccagaactg gagcaaaccctggataatttgctgaagctgaaagcatccttcaacctggaagatgggcat gcaagcagcttagaggaggtcaagaaagaatatgccagcatgtattctggaaatgacagc ctgctttcaaacagttttcctcagaatggttcctccccttggtgcccaactgaggcagtc aggaaggatgctgaggaggaggaagatgaggaagaggaagatcaagaagaagaggaggag gaagaagaagaggaggaggaggaagaggaggaagaggaagaggaggaggaggaagaggag aaaaaagtgatcttatactccccaggaactttgtcgcctgacctcatggaatttgaacgg tatctcaacttttacaaacagacgatggaccttctgactggcagcgggatcattaccccg caggaggcggcgctgcccatcgtctccgcggccatctcccacctgtggcagaacctctcg gaggagaggaaggccagcctccggcaggcctgggcgcagaagcaccgcggccctgcgacc ctggcggaggcctgccgagagccggcctgtgccgagggcagcgtgaaggacagcggcgtg gacagccagggggccagctgctcgctggtctccacgcccgaggagatcctttttgaggat gcctttgatgtggcaagcttcctggacaaaagtgaggttccgagtacatctagctccagt tcagtgcttgccagctgcaacccagaaaacccagaggagaagtttcagctctatatgcag atcatcaacttttttaaaggccttagctgtgcaaacactcaagtaaagcaggaagcatcc tttcccgttgatgaagagatgatcatgttgcagtgcacagagacctttgacgatgaagat ttgtaa >gi568815591r:135086154_135311502|GENSCAN_predicted_peptide_7|295_aa HFLTALGGLMAVPFILAKDLCLQQDPLTQSYLISTIFFAPASACSCKLPIPQGGTFAFVV ISLAMLSLPSWNCPEWTLSASQVNTNFPEFTEKWQKRIQEGAIMVTSCVRMLVGFSGLTG FLMGFICSLAVAPTNCLVALPLLDSAGNNAGIQWGISAMYCFVLRLRKDELWPFGSPRAY GHRSVVVKYVEMNLSRSLFAFGFSIYCGLTIPNRVSKNPEMLQTGVLQPAQVVQMLLTMG MFISGFLGFLLDNTIPAEDDALLAFHCHCKGKRKTQPSIGSTRNIPGRTMAAKAG >gi568815591r:135086154_135311502|GENSCAN_predicted_CDS_7|888_bp cacttcctcacagccctggggggcctcatggcggtgccattcatcctggccaaggacctg tgcctgcagcaggaccccctgacacagagctacctcatcagcaccattttctttgctcca gcatctgcatgctcctgcaagctgcccattccccagggaggtacgtttgcttttgtggta atttctctggccatgctctcccttccctcctggaattgccctgagtggacactcagtgcc agccaggtgaacaccaactttccagaattcactgagaaatggcagaagaggatccaagag ggtgctatcatggtcacttcctgtgtccggatgctggtgggcttctcaggcctgactggc tttctcatgggtttcatctgctccttggccgttgctccaactaactgcctagtggccctg cccctcttggattctgcaggcaataatgccgggatccagtgggggatttctgccatgtat tgcttcgtgttgcgtcttcgcaaggatgagctctggccatttggttctccacgtgcgtat ggccacaggagtgttgtggtcaagtacgtggagatgaacttgtccaggagcctcttcgcc tttggcttctccatctactgtgggctcaccattcccaaccgggtgagcaaaaaccccgag atgctccagacaggggttctccagccggcccaggttgttcagatgctgctgaccatgggc atgttcatcagtggatttctgggttttcttctagacaacaccatccccgctgaggatgat gccctcctagccttccactgtcattgcaaaggcaaaagaaaaacacagccttccattggc tctaccagaaacatcccagggaggacaatggcagccaaagcaggatga