GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:00:58 Sequence gi568815591f:135131979_135358489 : 226511 bp : 45.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 12039 12117 79 1 1 66 55 99 0.019 3.01 1.02 Intr + 23237 23258 22 2 1 78 96 14 0.002 -1.25 1.03 Term + 32440 33021 582 0 0 81 43 318 0.081 21.10 1.04 PlyA + 33297 33302 6 -1.75 2.03 PlyA - 33872 33867 6 1.05 2.02 Term - 34893 34633 261 0 0 84 49 144 0.992 5.63 2.01 Init - 36900 36808 93 0 0 64 86 171 0.173 14.88 2.00 Prom - 44480 44441 40 -3.46 3.14 PlyA - 45015 45010 6 1.05 3.13 Term - 54337 54173 165 1 0 134 45 244 0.999 22.62 3.12 Intr - 55191 54994 198 0 0 55 88 242 0.985 20.45 3.11 Intr - 56567 56455 113 0 2 57 102 201 0.875 18.50 3.10 Intr - 57474 57366 109 0 1 75 64 225 0.989 18.66 3.09 Intr - 61421 61253 169 1 1 84 96 196 0.937 19.95 3.08 Intr - 61694 61600 95 2 2 93 66 123 0.999 9.36 3.07 Intr - 62648 62592 57 2 0 119 81 -8 0.509 0.68 3.06 Intr - 63106 62956 151 0 1 98 105 109 0.998 13.76 3.05 Intr - 64359 64166 194 0 2 83 94 136 0.999 11.99 3.04 Intr - 66173 66015 159 2 0 70 80 140 0.999 11.58 3.03 Intr - 72455 72290 166 1 1 83 115 87 0.517 10.86 3.02 Intr - 74080 73950 131 1 2 33 74 193 0.997 11.89 3.01 Init - 74210 74124 87 2 0 63 72 60 0.912 1.23 3.00 Prom - 74771 74732 40 -6.66 4.04 PlyA - 74904 74899 6 -1.75 4.03 Term - 77020 76787 234 1 0 105 49 208 0.973 14.92 4.02 Intr - 77777 77598 180 2 0 99 51 178 0.979 15.26 4.01 Init - 79524 79402 123 0 0 95 80 269 0.930 26.97 4.00 Prom - 81212 81173 40 -5.36 5.00 Prom + 88911 88950 40 -4.26 5.01 Init + 100001 100060 60 1 0 83 99 63 0.890 8.15 5.02 Intr + 108541 108738 198 0 0 -4 86 131 0.101 3.25 5.03 Intr + 110803 110878 76 0 1 79 90 53 0.897 3.69 5.04 Intr + 111348 111432 85 1 1 57 110 118 0.981 9.78 5.05 Intr + 113310 113549 240 0 0 83 58 330 0.538 26.16 5.06 Intr + 114439 114724 286 1 1 57 83 411 0.993 34.84 5.07 Intr + 119818 119891 74 0 2 75 94 61 0.790 3.60 5.08 Intr + 123136 123247 112 1 1 81 91 63 0.891 6.28 5.09 Term + 126440 126514 75 1 0 125 38 114 0.993 7.94 5.10 PlyA + 127009 127014 6 1.05 6.13 PlyA - 128386 128381 6 1.05 6.12 Term - 129553 129536 18 1 0 111 47 9 0.016 -2.58 6.11 Intr - 135501 135401 101 1 2 114 78 -1 0.086 1.33 6.10 Intr - 138136 138041 96 2 0 92 74 78 0.676 6.78 6.09 Intr - 138342 138234 109 0 1 106 62 137 0.936 12.76 6.08 Intr - 139462 139431 32 2 2 54 86 16 0.014 -4.15 6.07 Intr - 142678 142623 56 0 2 94 94 90 0.503 8.82 6.06 Intr - 145419 145244 176 0 2 88 103 149 0.474 15.14 6.05 Intr - 156419 156260 160 1 1 133 77 23 0.129 5.69 6.04 Intr - 175332 175193 140 0 2 103 -24 129 0.001 2.46 6.03 Intr - 185731 185662 70 0 1 73 99 68 0.030 5.58 6.02 Intr - 211895 211694 202 0 1 127 92 -3 0.017 2.34 6.01 Init - 214354 214249 106 1 1 52 70 97 0.025 4.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 32464 33021 558 0 0 90 43 306 0.805 22.43 S.002 Init + 38334 38442 109 2 1 86 100 101 0.868 10.03 S.003 Init + 108547 108738 192 0 0 45 86 131 0.853 7.57 S.004 Init - 138640 138593 48 1 0 86 94 55 0.914 7.09 S.005 Term - 214745 214638 108 2 0 15 44 159 0.948 2.71 S.006 Intr - 217444 217286 159 1 0 51 116 75 0.902 6.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:135131979_135358489|GENSCAN_predicted_peptide_1|227_aa XGEGCQRFSSPYRCEAETTFEANEAKSLYVFMGKVPRQRAVEMAGPRPRWRDQLLFMSII VLVIVVICLMFYALLWEAGNLTDLPNLRIGFYNFCLWNEDTSTLQCHQFPELEALGVPRV GLGLARLGVYGSLVLTLFAPQPLLLAQCNSDERAWRLAVGFLAVSSVLLAGGLGLFLSYV WKWVRLSLPGPGFLALGSAQALLILLLIAMAVFPLRAERAESKLESC >gi568815591f:135131979_135358489|GENSCAN_predicted_CDS_1|684_bp ngaggagaaggctgccagcgtttctcatctccctaccgctgtgaagcagaaacaacattt gaggctaatgaggccaagagtctatatgtctttatgggtaaggtcccccggcagagggca gtagagatggccggcccaaggcctcggtggcgcgaccagctgctgttcatgagcatcata gtcctcgtgattgtggtcatctgcctgatgttttacgctcttctctgggaggctggcaac ctcactgacctgcccaacctgagaatcggcttctataacttctgcctgtggaatgaggac accagcaccctacagtgtcaccagttccctgagctggaagccctgggggtgcctcgggtt ggcctgggcctggccaggcttggcgtgtacgggtccctggtcctcaccctctttgccccc cagcctctcctcctagcccagtgcaacagtgatgagagagcgtggcggctggcagtgggc ttcctggctgtgtcctctgtgctgctggcaggcggcctgggcctcttcctctcctatgtg tggaagtgggtcaggctctccctcccggggcctgggtttctagctctgggcagcgcccag gccttactcatcctcttgcttatagccatggctgtgttccctctgagggctgagagggct gagagcaagcttgagagctgctaa >gi568815591f:135131979_135358489|GENSCAN_predicted_peptide_2|117_aa MADSPGGYKECGTNEGPQEDENGSSASGSSKSRKQEKACEQPALAGADNPEHSPPCSVSP HTSSGSSSEEEDSGKQALAPGLSPSQRPGGSSSACSRSPEEEEEEDVLKYVREIFFS >gi568815591f:135131979_135358489|GENSCAN_predicted_CDS_2|354_bp atggctgacagcccaggtggctacaaagaatgtggcaccaatgaaggcccccaagaggat gagaatggcagcagtgccagtggcagcagcaagagccgcaaacaggaaaaggcctgcgag cagccggccctggcgggggctgataacccagagcactcccctccctgctccgtgtcgcct cacacaagttctgggagcagcagtgaggaagaggacagtgggaaacaggcactggctcca ggcctcagcccttcccagaggccggggggttccagctctgcctgtagcaggagccctgag gaggaggaggaagaggatgtgctgaaatacgtccgggagatctttttcagctag >gi568815591f:135131979_135358489|GENSCAN_predicted_peptide_3|597_aa MGPLTLRRVALRPVGLDAMCLEIFLRRGKLFALQAEIHRLKKEEQQPEEEEALVQHKLPP YVSNMDRLGDSELAMVCSQRNASLSQSPRVGFLSSLLPQSKKSPSRLSPAQGPPQPQSSA KKESFGGQGTKGKDPTSGAKDGKSLLSGLATGESGWSQHRQRRLQDHGKERKELFSTTTS QCAEKKPEASGPEAEPCPELHTEPVEPLTRASSAGPEGGGVRPEQPFIVLGQEEYGEHHS SIMHCRVDCSGRRVASLDVDGVIKVWSFNPIMQTKASSISKSPLLSLEWATKRDRLLQFQ WAPSRARPHGKWLPLLLLGSGVGTVRLYDTEAKKNLCEININDNMPRILSLACSPNGASF VCSAAAPSLTSQVDFSAPDIGSKGMNQVPGRLLLWDTKTMKQQLQFSLDPEPIAINCTAF NHNGNLLVTGAADGVIRLFDMQQHECAMSWRAHYGEVYSVEFSYDENTVYSIGEDGKFIQ WNIHKSGLKVSEYSLPSDATGPFVLSGYSGYKQVQVPRGRLFAFDSEGNYMLTCSATGGV IYKLGGDEKVLESCLSLGGHRAPVVTVDWSTAMDCGTCLTASMDGKIKLTTLLAHKA >gi568815591f:135131979_135358489|GENSCAN_predicted_CDS_3|1794_bp atgggccccttgacgctgaggcgagtggcactcagaccagtgggcttggatgcgatgtgt ctagaaatcttcctccgcagagggaagctttttgcattgcaagctgaaatccaccgactg aagaaagaggagcaacagccagaagaggaagaggccttggtccaacacaaattgcctcct tatgtctccaacatggaccgcctgggggactcggaacttgccatggtgtgcagccaaagg aatgcctccctctcccagtcacctcgtgtgggcttcctgtcctcgctgctgcctcagagt aagaagagcccctcaaggttgtcgcctgctcagggccctcctcaacctcagagctcggcc aagaaagagtccttcggtggtcagggcaccaagggaaaggacccgacgtccggagccaag gatgggaagagcctcctcagcgggctggccactggggagtccggttggtcacagcaccgg cagcggcgcctgcaggaccatggcaaggagaggaaggagcttttctccacaaccacttcc cagtgtgcagagaagaaaccagaagccagtggcccagaggctgagccctgcccagagctc cacacggagccagtggagccactgactcgggcatcctcggcaggccctgagggtggagga gtccgccccgagcagccctttattgtgctgggacaggaggagtacggggaacaccactca tccatcatgcactgcagagtggactgctctgggaggagagtcgccagcttagacgtagat ggggtcatcaaagtgtggtccttcaaccccatcatgcagaccaaagcatcctccatttcc aaatcaccgctgctgtctttggaatgggccaccaaacgggacagactgctgcagtttcag tgggcgccctcccgtgccaggccccacgggaagtggcttccgctgctcttgctgggcagt ggtgtgggaacagtgcgtctctatgacacggaagccaagaagaatctctgtgaaatcaat atcaacgacaacatgcccagaatcctgtctcttgcgtgcagccccaacggggcctctttc gtctgttcggcagcagctccgagcctcacttcccaggtggacttctcagcaccagacatc ggcagcaagggcatgaaccaggttcctggcaggctgctgctgtgggacacgaaaaccatg aagcagcagctccagttctccctggatccagaacccattgctatcaactgtacagccttc aatcacaacgggaacctgctggtcacaggggcagctgatggcgtcatccggctgtttgac atgcagcagcatgagtgcgcgatgagctggagggcccactacggggaggtctactctgtg gagttcagctatgatgagaacaccgtgtacagcatcggcgaggacgggaagttcatccag tggaacatccacaagagtggcctcaaggtatccgagtacagcctcccctcagatgccacg ggcccctttgtgctgtctggatacagcggctacaagcaggttcaagtccccaggggccga ctcttcgcttttgactcggagggaaattacatgctgacatgttctgccacaggcggcgtc atctacaagctgggtggcgatgagaaggttctggagagctgcttgagcctaggtggccac cgagcccctgtggtcaccgtggactggagcactgccatggactgtgggacctgcctcacc gcctccatggatggcaagatcaagctgaccaccctcctggcccataaagcctga >gi568815591f:135131979_135358489|GENSCAN_predicted_peptide_4|178_aa MAEAVERTDELVREYLLFRGFTHTLRQLDAEIKADKEKGFRVDKIVDQLQQLMQVYDLAA LRDYWSYLERRLFSRLEDIYRPTIHKLKTSLFRFYLVYTIQTNRNDKAQEFFAKQATELQ NQAEWKDWFVLPFLPSPDTNPTFATYFSRQWADTFIVSLHNFLSVLFQCMHILSVAWG >gi568815591f:135131979_135358489|GENSCAN_predicted_CDS_4|537_bp atggcggaggccgtggagcgcactgacgagctggtccgggagtacctgctcttccgcggg ttcacgcacacactgcggcagctggacgccgagatcaaggcggacaaggagaaggggttc cgggtggataagattgtggaccagctgcagcagttaatgcaggtgtatgacttggctgcc cttcgggattattggagctacttggagcgtcggctcttcagccgcttggaggatatatac agacccacaatccacaagctgaaaaccagcctgtttcgattttatcttgtctacacaatc cagacaaacagaaatgacaaggctcaggagttctttgcaaagcaggccacggaactccag aaccaggctgagtggaaggattggtttgtcctgcccttcctgccatccccggacaccaac cccacctttgctacctacttttctcgacagtgggctgacaccttcattgtgtccctgcac aacttcctgagcgtcctgtttcagtgcatgcatatcctttcagttgcctggggctga >gi568815591f:135131979_135358489|GENSCAN_predicted_peptide_5|401_aa MGKIDVDKILFFNQEIRLWQLIMATPEENSNPHDRATPQLPAQLQELEHRVARRRLSQAR HRATLAALFNNLRKTVYSQSDLIASKWQVLNKAKSHIPELEQTLDNLLKLKASFNLEDGH ASSLEEVKKEYASMYSGNDSLLSNSFPQNGSSPWCPTEAVRKDAEEEEDEEEEDQEEEEE EEEEEEEEEEEEEEEEEEEEKKVILYSPGTLSPDLMEFERYLNFYKQTMDLLTGSGIITP QEAALPIVSAAISHLWQNLSEERKASLRQAWAQKHRGPATLAEACREPACAEGSVKDSGV DSQGASCSLVSTPEEILFEDAFDVASFLDKSEVPSTSSSSSVLASCNPENPEEKFQLYMQ IINFFKGLSCANTQVKQEASFPVDEEMIMLQCTETFDDEDL >gi568815591f:135131979_135358489|GENSCAN_predicted_CDS_5|1206_bp atggggaagattgatgtggacaagatcctctttttcaatcaagaaatcaggctgtggcag cttataatggcaacccctgaagaaaacagcaatccccatgacagagcaacaccccagctg ccagcacagctgcaggagcttgagcatcgggtggcccggagacggctgtcccaggcccgc caccgagccaccctggcagcgctcttcaacaacctcaggaagacagtgtactctcagtct gatctcatagcctcaaagtggcaggttctgaataaggcaaagagtcatattccagaactg gagcaaaccctggataatttgctgaagctgaaagcatccttcaacctggaagatgggcat gcaagcagcttagaggaggtcaagaaagaatatgccagcatgtattctggaaatgacagc ctgctttcaaacagttttcctcagaatggttcctccccttggtgcccaactgaggcagtc aggaaggatgctgaggaggaggaagatgaggaagaggaagatcaagaagaagaggaggag gaagaagaagaggaggaggaggaagaggaggaagaggaagaggaggaggaggaagaggag aaaaaagtgatcttatactccccaggaactttgtcgcctgacctcatggaatttgaacgg tatctcaacttttacaaacagacgatggaccttctgactggcagcgggatcattaccccg caggaggcggcgctgcccatcgtctccgcggccatctcccacctgtggcagaacctctcg gaggagaggaaggccagcctccggcaggcctgggcgcagaagcaccgcggccctgcgacc ctggcggaggcctgccgagagccggcctgtgccgagggcagcgtgaaggacagcggcgtg gacagccagggggccagctgctcgctggtctccacgcccgaggagatcctttttgaggat gcctttgatgtggcaagcttcctggacaaaagtgaggttccgagtacatctagctccagt tcagtgcttgccagctgcaacccagaaaacccagaggagaagtttcagctctatatgcag atcatcaacttttttaaaggccttagctgtgcaaacactcaagtaaagcaggaagcatcc tttcccgttgatgaagagatgatcatgttgcagtgcacagagacctttgacgatgaagat ttgtaa >gi568815591f:135131979_135358489|GENSCAN_predicted_peptide_6|421_aa MVNYVKRTGGHMGQRLKAIHRFGNLEVTDHLFQREGEDGFLDQPSGLKRYQWLGYARLTC DTLDVRKPGPHFCGTSTTCVTTEYLCCFALWVSFLIYKTRDLQGSQKKTPVAMNSAVTSR EHSTYSHFLTALGGLMAVPFILAKDLCLQQDPLTQSYLISTIFFAPASACSCKLPIPQGG TFAFVVISLAMLSLPSWNCPEWTLSASQVNTNFPEFTEKWQKRIQEGAIMVTSCVRMLVG FSGLTGFLMGFICSLAVAPTNCLVALPLLDSAGNNAGIQWGISAMYCFVLRLRKDELWPF GSPRAYGHRSVVVKYVEMNLSRSLFAFGFSIYCGLTIPNRVSKNPEMLQTGVLQPAQVVQ MLLTMGMFISGFLGFLLDNTIPAEDDALLAFHCHCKGKRKTQPSIGSTRNIPGRTMAAKA G >gi568815591f:135131979_135358489|GENSCAN_predicted_CDS_6|1266_bp atggtcaactatgtcaagcgcactggaggccacatgggacaacgactgaaagcaattcat agatttggaaatttggaggtcactgatcaccttttccagagggaaggtgaggatggtttt cttgaccagccgtcaggacttaaacgttaccagtggttgggatacgcacggttgacatgt gacacactcgatgttaggaaacctggaccccacttctgtgggacctctaccacctgtgtg accactgaatacctatgttgctttgctttgtgggtcagtttcctcatctataaaacgagg gatttgcaaggaagccagaagaagaccccagttgccatgaactccgccgtcactagccgt gagcactccacttactcgcacttcctcacagccctggggggcctcatggcggtgccattc atcctggccaaggacctgtgcctgcagcaggaccccctgacacagagctacctcatcagc accattttctttgctccagcatctgcatgctcctgcaagctgcccattccccagggaggt acgtttgcttttgtggtaatttctctggccatgctctcccttccctcctggaattgccct gagtggacactcagtgccagccaggtgaacaccaactttccagaattcactgagaaatgg cagaagaggatccaagagggtgctatcatggtcacttcctgtgtccggatgctggtgggc ttctcaggcctgactggctttctcatgggtttcatctgctccttggccgttgctccaact aactgcctagtggccctgcccctcttggattctgcaggcaataatgccgggatccagtgg gggatttctgccatgtattgcttcgtgttgcgtcttcgcaaggatgagctctggccattt ggttctccacgtgcgtatggccacaggagtgttgtggtcaagtacgtggagatgaacttg tccaggagcctcttcgcctttggcttctccatctactgtgggctcaccattcccaaccgg gtgagcaaaaaccccgagatgctccagacaggggttctccagccggcccaggttgttcag atgctgctgaccatgggcatgttcatcagtggatttctgggttttcttctagacaacacc atccccgctgaggatgatgccctcctagccttccactgtcattgcaaaggcaaaagaaaa acacagccttccattggctctaccagaaacatcccagggaggacaatggcagccaaagca ggatga