GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:37:09 Sequence gi568815588r:86337546_86621730 : 284185 bp : 44.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1533 1745 213 2 0 56 64 139 0.298 7.21 1.02 Term + 3613 3828 216 0 0 96 54 19 0.110 -3.56 1.03 PlyA + 5709 5714 6 1.05 2.08 PlyA - 7456 7451 6 -0.45 2.07 Term - 9192 9036 157 2 1 55 43 87 0.121 -1.69 2.06 Intr - 11274 11194 81 2 0 -21 61 190 0.281 4.15 2.05 Intr - 12396 12348 49 1 1 110 72 54 0.204 3.74 2.04 Intr - 20296 20165 132 2 0 56 38 93 0.162 1.72 2.03 Intr - 25998 25645 354 1 0 30 80 135 0.464 2.16 2.02 Intr - 26551 26396 156 2 0 111 113 379 0.998 42.68 2.01 Init - 28847 28769 79 2 1 87 111 109 0.970 14.41 2.00 Prom - 36777 36738 40 -6.66 3.03 PlyA - 37599 37594 6 1.05 3.02 Term - 39865 39732 134 2 2 67 42 122 0.031 3.75 3.01 Init - 52442 52373 70 2 1 54 41 126 0.089 3.71 3.00 Prom - 58993 58954 40 -2.66 4.02 PlyA - 60385 60380 6 1.05 4.01 Sngl - 62852 62523 330 2 0 69 44 318 0.954 19.52 4.00 Prom - 66740 66701 40 -5.06 5.00 Prom + 66811 66850 40 -1.36 5.01 Init + 74065 74201 137 0 2 76 88 37 0.855 2.11 5.02 Intr + 78199 78354 156 1 0 47 39 100 0.464 0.13 5.03 Term + 86005 86152 148 1 1 10 42 186 0.837 3.57 5.04 PlyA + 87381 87386 6 1.05 6.16 PlyA - 87713 87708 6 1.05 6.15 Term - 98455 98372 84 1 0 67 42 63 0.501 -2.75 6.14 Intr - 100470 100375 96 0 0 118 93 -2 0.695 3.51 6.13 Intr - 105818 105730 89 0 2 71 100 38 0.912 2.99 6.12 Intr - 108904 108697 208 1 1 38 71 198 0.982 11.75 6.11 Intr - 114586 114422 165 1 0 49 115 55 0.956 4.46 6.10 Intr - 115790 115675 116 0 2 83 119 75 0.999 10.27 6.09 Intr - 116286 116111 176 2 2 70 95 119 0.990 10.38 6.08 Intr - 121520 121444 77 2 2 110 97 1 0.950 1.51 6.07 Intr - 129917 129734 184 1 1 -9 76 190 0.090 7.89 6.06 Intr - 134799 134663 137 0 2 70 82 131 0.454 10.07 6.05 Intr - 136428 136333 96 0 0 64 98 40 0.636 2.81 6.04 Intr - 159774 159656 119 1 2 35 114 79 0.922 5.38 6.03 Intr - 163198 162173 1026 2 0 97 110 413 0.823 34.96 6.02 Intr - 180546 180026 521 2 2 85 111 290 0.665 23.60 6.01 Init - 198985 198852 134 1 2 62 90 133 0.216 8.52 6.00 Prom - 227530 227491 40 -3.46 7.06 PlyA - 228820 228815 6 1.05 7.05 Term - 229174 229042 133 0 1 54 48 143 0.115 4.46 7.04 Intr - 260870 260772 99 1 0 88 84 16 0.106 0.43 7.03 Intr - 274677 274537 141 2 0 -15 101 123 0.109 2.77 7.02 Intr - 281348 280256 1093 0 1 15 53 379 0.215 16.17 7.01 Intr - 282264 282242 23 2 2 104 97 44 0.625 4.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 39637 39986 350 0 2 42 44 210 0.801 8.78 S.002 Sngl + 243655 243927 273 0 0 114 48 104 0.829 4.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:86337546_86621730|GENSCAN_predicted_peptide_1|142_aa ASSTYHVHPEPQHPTQCLVPGPNKTPVNICEVNKCSLPCHTWTSPRSLPGILHFRETSPE LQGKAGEEEEEVSFKKPPAPGTVPDPPGPSFQLSPLKHKNKGSRVLLEIARKPYHRQGAL DAPEPALPTKQSRRMEEKMGLG >gi568815588r:86337546_86621730|GENSCAN_predicted_CDS_1|429_bp gcaagctccacgtaccatgtccaccctgaaccccagcacccaacacagtgcctggtgcct ggcccaaacaagacaccagtgaacatctgtgaagtgaacaaatgctctctgccctgccac acctggacatctcctaggagcctcccaggaatcctgcactttcgagaaacatctcctgag ctgcaagggaaggcaggggaggaggaggaggaggtttcatttaagaaaccccctgctcca gggactgtccctgatcccccaggtccatcctttcaacttagtcccctaaaacacaaaaac aaagggagcagagtgcttctagagatagcaagaaaaccctaccaccgacagggagctctg gatgccccagagcctgcactgcccactaaacagtcccggaggatggaggagaagatggga ctaggatga >gi568815588r:86337546_86621730|GENSCAN_predicted_peptide_2|335_aa MEALTLWLLPWICQCVSVRADSIIHIGAIFEENAAKDDRVFQLAVSDLSLNDDILQSEKI TYSIKVIEANNPFQAVQEVARCFPRFEVRSAQRLEPDLSPSPGHNLVGAAFRAPRPPEPQ PGASLQRSSTRFGGAGFSLAVRAAGGGAAAWAGGGRCSPRKGAELGRSGCGEILAYAGAA LVHSHCLRSGPAAGATGLLPVITAVDEAAGIDSLELGPGSPLWIPHKPVLNSRCQGAELT GVLVGNDVINLFVRAIRQDSEVCDESQYALIGTGECFRELLDVSLCLSFCLVGPMPGAAV DFGKLGCCWEVAALSAKAWLLPPGQRLLRSERLIE >gi568815588r:86337546_86621730|GENSCAN_predicted_CDS_2|1008_bp atggaagcgctgacgctgtggcttctcccctggatatgccagtgcgtgtcggtgcgggcc gactccatcatccacatcggtgccatcttcgaggagaacgcggccaaggacgacagggtg ttccagttggcggtatccgacctgagcctcaacgatgacatcctgcagagcgagaagatc acctactccatcaaggtcatcgaggccaacaacccattccaggctgtgcaggaagtggcc cggtgttttcctcgcttcgaggtgaggagcgcccagcggctggagcccgacctgtctcct tctcccgggcataacctggtaggcgcggcgtttcgcgccccgcggcccccagagcctcag cccggcgcttctctgcagcgctcctccaccaggtttggaggagccggtttctctctcgct gtccgcgccgcgggcggaggcgccgccgcctgggccggaggtgggcgctgttcgcccagg aaaggagcggagctgggccggtccggctgcggggaaatcctggcctacgccggggccgcc cttgtccacagccactgcctccggtcgggccccgcggcaggcgccacagggctactgcca gtgataacagctgtggatgaggctgcaggcattgatagtctggagttaggccctgggtcc cctctctggattccccacaagcccgtgctgaactcacggtgtcagggtgctgaactcaca ggcgtcctcgtcggcaatgatgttattaatttatttgtgagggccatcaggcaggacagc gaggtctgtgatgagtctcagtacgcactgatcggcactggtgaatgcttccgggagctc ctggatgtcagcctctgtttatccttctgcctagtggggccaatgcctggggctgccgtg gacttcgggaagctgggctgctgctgggaggtggctgccttatcagcaaaggcctggctc ctgcctcctgggcagcgtctgctccgctctgaacgattaattgaatag >gi568815588r:86337546_86621730|GENSCAN_predicted_peptide_3|67_aa MARRGSRALSAGWAAVMGQTRKGRGSIALRLLLGGAGDWAWKPRLAAACALAASLALSPA ESADQPV >gi568815588r:86337546_86621730|GENSCAN_predicted_CDS_3|204_bp atggctcgccgcgggtctcgggcgctgtcagccgggtgggccgcagtgatgggtcagacc aggaaggggcggggcagcattgcactgcggctgctcctgggcggcgccggcgactgggct tggaaaccgcgcctcgcggcggcctgcgccctcgctgcgtccctcgcgctgtcacctgcg gaatcggcagaccagcccgtgtag >gi568815588r:86337546_86621730|GENSCAN_predicted_peptide_4|109_aa MRWDCRPPAAPPAASALPLLSCQSSGFCRSSAVRLGSGFPAPLAVSARVSRLAPDRMVQL PQGRSRGSSLIASSAPTAPLRGRLGLIARKSAWSLPWAGWARSSRAVVL >gi568815588r:86337546_86621730|GENSCAN_predicted_CDS_4|330_bp atgcgctgggattgcaggcctccagcggccccgcccgccgcctccgccctcccgctcctg agctgccaatccagcgggttctgccggtcctcggccgtgcgtctggggtctggattcccg gctccactcgccgtgagcgcgcgtgtcagccgactcgccccggaccgcatggtgcagctc ccgcagggccgcagccgcgggtcgtcgctgattgccagctccgcgcccactgcaccgctg cgcggccgcctagggctcattgcgcggaagtcggcctggtccctcccctgggctggatgg gcccgctcctcccgagcggtcgtgctctga >gi568815588r:86337546_86621730|GENSCAN_predicted_peptide_5|146_aa MHQPLTQQIISGNTHSAQTGNSVHFRAVSSQMQGALLLQSRLEELRKNHETEKFGNAYTE ATGVATQEGQHGVSEPPAAVGIKQTWFQILILPACDLSLNGYYQKAKTDDAGKPVSENWY TAGGNVNSYSHYGKQVPRKTKYRSAI >gi568815588r:86337546_86621730|GENSCAN_predicted_CDS_5|441_bp atgcatcagcctctgacccagcagattatttcaggaaacacccacagtgctcagacagga aattctgtgcacttcagagctgtgagctcacagatgcagggagcattgctgttacagtcc agactagaagaattgaggaaaaaccatgaaacagaaaaattcgggaatgcgtacacagag gctacaggagtggccacccaggaaggacagcatggggtctcggaacccccagctgctgtg ggaatcaagcagacctggtttcaaattctgatcctgcctgcatgtgacctcagtttaaat ggctattatcaaaaagccaaaactgacgatgccggcaagcctgtctcggagaactggtac actgctggtgggaatgtaaactcgtacagccactatggaaaacaggttcctcgaaaaact aaatacagaagtgcaatatga >gi568815588r:86337546_86621730|GENSCAN_predicted_peptide_6|1075_aa MRSRAGPGSALAPAFGARTVPQSSRDTGPAGKVTCATKAAQAAPREYETGVKMTSRFGKT YSRKGGNGSSKFDEVFSNKRTTLSTKWGETTFMAKLGQKRPNFKPDIQEIPKKPKVEEES TGDPFGFDSDDESLPVSSKNLAQVKCSSYSESSEAAQLEEVTSVLEANSKISHVVVEDTV VSDKCFPLEDTLLGKEKSTNRIVEDDASISSCNKLITSDKVENFHEEHEKNSHHIHKNAD DSTKKPNAETTVASEIKETNDTWNSQFGKRPESPSEISPIKGSVRTGLFEWDNDFEDIRS EDCILSLDSDPLLEMKDDDFKNRLENLNEAIEEDIVQSVLRPTNCRTYCRANKTKSSQGA SNFDKLMDGTSQALAKANSESSKDGLNQAKKGGVSCGTSFRGTVGRTRDYTVLHPSCLSV CNVTIQDTMERSMDEFTASTPADLGEAGRLRKKADIATSKTTTRFRPSNTKSKKDVKLEF FGFEDHETGGDEGGSGSSNYKIKYFGFDDLSESEDDEDDDCQVERKTSKKRTKTAPSPSL QPPPESNDNSQDSQSGTNNAENLDFTEDLPGVPESVKKPINKQGDKSKENTRKIFSGPKR SPTKAVYNARHWNHPDSEELPGPPVVKPQSVTLYTVVQHVKHFNDVVEFGENQEFTDDIE YLLSGLKSTQPLNTRCLRDRLNMDLDRASLDLMIRLLELEQDASSAKLLNEKDMNKIKEK IRRLCETVHNKHLDLENITVTVHNPENQSYLIAYKDSQLIVSSAKALQHCEELIQQYNRA EDSICLADSKPLPHQNVTNHVGKAVEDCMRAIIGVLLNLTNDNEWGSTKTGEQDGLIGTA LNCVLQVPKYLPQEQRFDIRVLGLGLLINLVEYSARNRHCLVNMETSCSFDSSICSGEGD DSLRIGGQVHAVQALVQLFLERERAAQLAESKTDELIKDAPTTQHDKSGEWQETSGEIQW VSTEKTDGTEEKHKKEEEDEELDLNKALQHAGKHMEDCIVASYTALLLGCLCQESPINVT TVREYLPEGDFSIMTEMLKKFLSFMNLTNIVRQCNVGTIEIVFALSIKASLAKYF >gi568815588r:86337546_86621730|GENSCAN_predicted_CDS_6|3228_bp atgcgatctcgagcgggccctggctcagcgctggcgcccgctttcggagccaggactgtc cctcagagcagccgggacactggcccggccgggaaggtcacctgcgcaacaaaggcggcc caggcagccccaagagaatatgaaactggtgtcaaaatgacatccagatttgggaaaaca tacagtaggaaaggtggaaatggcagttcaaaattcgatgaagtcttttccaacaaacgg actacccttagcacaaaatggggagagaccacatttatggctaaattagggcagaagagg cccaatttcaaaccagatatccaagaaattccgaagaaacctaaagtggaagaagaaagt actggagatccttttggatttgatagtgatgatgagtctctaccagtttcttcaaagaat ttagcccaggttaagtgttcctcttattcagaatctagtgaagctgctcagttggaagag gtcacttcagtacttgaagctaatagcaaaattagtcatgtggtcgttgaagacactgtc gtttctgataaatgcttccctttggaggacactttacttgggaaagaaaagagcacaaac cgaattgtagaagatgatgcaagcataagtagctgtaataaattaataacttcagataaa gtggagaattttcatgaagaacatgaaaagaatagtcaccatattcacaaaaatgctgat gacagtactaagaaacccaatgcagaaactacagtggcttctgaaatcaaggaaacaaat gatacttggaactcccagtttgggaaaaggccagaatcaccatcagaaatatctccaatc aagggatctgttagaactggtttgtttgaatgggataatgattttgaagatatcagatca gaagactgtattttaagtttggatagtgatccccttttggagatgaaggatgacgatttt aaaaatcgattggaaaatctgaatgaagccattgaggaagatattgtacaaagtgttctt aggccaaccaactgtaggacgtactgtagggccaataaaacgaaatcctcccaaggagca tcaaattttgataagctgatggacggcaccagtcaggccttagccaaagcaaacagtgaa tcgagtaaagatggcctgaatcaggcaaagaaagggggtgtaagttgtgggaccagtttt agagggacagttggacggactagagattacactgttttacatccatcttgcttgtcagtt tgtaatgttaccatacaggatactatggaacgcagcatggatgagttcactgcatccact cctgcagatttgggagaagctggtcgtctcagaaaaaaggcagatattgcaacttctaag actactactagatttcgacctagtaatactaaatccaaaaaggatgttaaacttgaattt tttggttttgaagatcatgagacaggaggtgatgaaggaggttctggaagttctaattac aaaattaagtattttggctttgatgatctcagtgaaagcgaagatgatgaagatgatgac tgtcaagtagaaagaaagacaagcaaaaaaagaactaaaacagctccatcaccctccttg cagcctcccccagaaagcaatgataattcccaggacagtcagtctggtactaacaatgca gaaaacttggattttacagaggacttgcctggtgtgcctgaaagtgtgaagaagcccata aataaacaaggagataaatcaaaggaaaataccagaaagatttttagtggccccaaacgg tcacccacaaaagctgtatataatgccagacattggaatcatccagattcagaagaactg cctgggccaccagtagtaaaacctcagagtgtcacattatatactgttgttcagcacgtg aagcacttcaacgatgttgtagaatttggtgaaaatcaagagttcactgatgacattgag tacttgttaagtggcttaaagagcactcagcctctaaacacacgttgccttagagatcgt ttgaacatggatcttgatagagctagcttagatctaatgattcgacttttggaactggaa caagatgcttcatcagccaagctactgaatgaaaaagacatgaacaaaattaaagaaaaa atccgaaggctctgtgaaactgtacacaacaagcatcttgatctagaaaatataacggta actgtgcataatcccgaaaatcaaagctacttgatagcatataaagattcccaacttatt gtttcatcagctaaagcattacagcattgtgaagaactgattcagcagtacaaccgtgct gaggacagcatatgcttagctgacagtaagcctctgcctcaccagaatgtaactaaccat gtaggcaaagcagtggaggactgcatgagggccatcatcggggtgttgcttaatttaact aatgataatgagtggggcagcaccaaaacaggagagcaggacggtctcataggcacagcg ctgaactgtgtgcttcaggttccaaagtacctacctcaggagcagagatttgatattcga gtgctgggcttaggtctgctgataaatctagtggagtatagtgctcggaatcggcactgt cttgtcaacatggaaacatcgtgctcttttgattcttccatctgtagtggagaaggggat gatagtttaaggataggtggacaagttcatgctgtccaggctttagtgcagctattcctt gagcgagagcgggcagcccagctagcagaaagtaaaacagatgagttgatcaaagatgct cccaccactcagcatgataagagtggagagtggcaagaaacaagtggagaaatacagtgg gtgtcaactgaaaagactgatggtacagaagagaaacataagaaggaggaggaggatgaa gaacttgacctcaataaagcccttcagcatgccggcaaacacatggaggattgcattgtg gcctcctacacagcactacttcttgggtgtctctgccaggaaagtccaatcaatgtaacc actgtgcgggaatatctgccagaaggagacttttcaataatgacagagatgctcaaaaaa tttttgagttttatgaatctcactaacatagtacgccagtgtaacgtgggaaccattgag attgtatttgccctgagtattaaagctagcttagcaaaatacttttaa >gi568815588r:86337546_86621730|GENSCAN_predicted_peptide_7|496_aa XKKEPIMLENEIPGIQLTSDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIGKMA ILPKVIYRFNAIPIKLPMIFFTELEKTTLKFIWNQKRARIAKSMLSQKNKAGGIMLPDFK LYYKATVTKTAWYWYQNRNIDQWNRTEPSEIMPCIYNYLIFDKPEKNKQWGKDSLFNKWC WENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDFMS KTPKAMATKAKIDKWDVIKLNSFCTAKETTIRVNRQATEWEKIFAIYSSDKGIISRIYNE LQQIYKKKTNNPIKKWVKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHLTP VRMAIIKKSGNNRNNGKPLARLGAVMDALLDQKHSGHPAGTGGMEVSVRPAKAANSRGGR TLTKAKPKSLWIYITQKSHHWGCLDDASISEFLKISPLCNKSKVQKPESKETVCQASELK LSHHIPCDLHVYIQMA >gi568815588r:86337546_86621730|GENSCAN_predicted_CDS_7|1491_bp nngaaaaaggagcccatcatgcttgagaatgaaataccaggaattcaacttacaagcgat gtgaaggacctcttcaaggagaactataaaccactgctcaatgaaataaaagaggataca aacaaatggaagaacattccatgctcatgggtaggaagaatcaacatcgggaaaatggcc atactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgattttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatc gccaagtcaatgctaagccaaaagaacaaagctggaggcatcatgctacctgacttcaaa ctatactacaaggccacagtaaccaaaacagcatggtactggtaccaaaacagaaatata gaccaatggaacagaacagagccctcagaaattatgccgtgtatctacaactatctgatc tttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgc tgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacaccttat acaaaaattaattcgagatggattaaagacttaaatgttagacctaaaactataaaaacc ctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatgtct aaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatgtaattaaacta aatagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaagctacagaatgg gagaaaatttttgcaatctactcatctgacaaagggataatatccagaatctacaatgaa ctccaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggtgaaggatatg aacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaatgctca tcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctcacacca gttagaatggcaatcattaaaaagtcaggaaacaacaggaacaatggcaagcctttagcc agattgggagcagtaatggacgccttgctggatcagaagcacagcggacaccctgccgga accggagggatggaagtcagcgtcaggcctgcaaaggcggcaaacagccgtggtggacgg accctgactaaagcaaagcctaaaagtttgtggatttacatcacccagaaatctcatcat tggggatgtcttgatgatgccagcatttctgaatttctgaaaatatcacctctgtgcaac aagtctaaagttcagaaacctgaatccaaggaaactgtatgtcaggcctctgagctcaag ctaagccatcatatcccctgtgacctgcacgtgtacatccagatggcctga