GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:23:05 Sequence gi568815578f:31414552_31669216 : 254665 bp : 48.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5118 5272 155 0 2 45 74 120 0.565 6.02 1.02 Term + 16677 16702 26 1 2 122 49 4 0.125 -1.71 1.03 PlyA + 17321 17326 6 1.05 2.06 PlyA - 17377 17372 6 1.05 2.05 Term - 25624 25566 59 2 2 93 43 73 0.817 1.15 2.04 Intr - 25923 25819 105 1 0 85 76 18 0.490 0.49 2.03 Intr - 26444 26336 109 2 1 79 81 45 0.973 2.76 2.02 Intr - 29272 29039 234 1 0 69 97 77 0.821 4.59 2.01 Init - 34647 34426 222 0 0 63 98 108 0.759 7.86 2.00 Prom - 42541 42502 40 -5.46 3.00 Prom + 49159 49198 40 -3.86 3.01 Init + 61895 62234 340 1 1 73 75 183 0.635 12.94 3.02 Intr + 63277 63359 83 2 2 83 60 122 0.658 8.26 3.03 Intr + 67736 67937 202 1 1 95 105 254 0.997 26.66 3.04 Term + 69608 69879 272 0 2 69 48 355 0.999 25.25 3.05 PlyA + 70182 70187 6 1.05 4.00 Prom + 73608 73647 40 -3.26 4.01 Init + 82598 82891 294 1 0 48 86 145 0.392 7.49 4.02 Intr + 90844 90946 103 0 1 61 32 108 0.394 2.25 4.03 Intr + 93340 93392 53 2 2 128 45 16 0.407 -0.07 4.04 Intr + 99908 100183 276 1 0 63 97 426 0.801 38.71 4.05 Intr + 112933 113031 99 0 0 115 106 137 0.920 18.51 4.06 Intr + 123628 123710 83 0 2 95 71 106 0.899 8.04 4.07 Intr + 127877 128032 156 2 0 98 70 43 0.809 2.63 4.08 Intr + 130396 130484 89 1 2 134 86 65 0.993 10.51 4.09 Intr + 132888 133141 254 1 2 68 103 111 0.502 7.75 4.10 Intr + 134414 134563 150 1 0 81 113 108 0.128 12.96 4.11 Intr + 134656 134781 126 0 0 63 92 207 0.999 19.48 4.12 Intr + 135513 135570 58 2 1 146 101 98 0.999 15.36 4.13 Intr + 138988 139005 18 2 0 102 111 1 0.566 0.38 4.14 Intr + 140195 140278 84 0 0 104 99 91 0.977 11.59 4.15 Intr + 145060 145096 37 2 1 113 109 92 0.998 11.12 4.16 Intr + 147083 147185 103 2 1 88 80 200 0.991 19.38 4.17 Intr + 151659 151744 86 2 2 76 110 73 0.857 6.92 4.18 Term + 153527 153677 151 2 1 101 38 64 0.565 0.08 4.19 PlyA + 157628 157633 6 1.05 5.02 PlyA - 158976 158971 6 1.05 5.01 Sngl - 164325 163900 426 0 0 81 37 296 0.959 18.10 5.00 Prom - 168483 168444 40 -6.76 6.03 PlyA - 171745 171740 6 1.05 6.02 Term - 180384 180194 191 1 2 61 41 98 0.185 0.01 6.01 Init - 190381 190123 259 1 1 30 94 252 0.954 17.30 6.00 Prom - 190457 190418 40 -10.94 7.00 Prom + 190699 190738 40 -5.76 7.01 Init + 190837 191262 426 0 0 63 78 822 0.971 73.00 7.02 Term + 192033 192113 81 2 0 93 39 75 0.788 0.79 7.03 PlyA + 193323 193328 6 1.05 8.00 Prom + 195289 195328 40 -4.26 8.01 Init + 196130 196151 22 1 1 76 105 21 0.124 2.69 8.02 Intr + 218658 218885 228 1 0 78 58 69 0.386 0.74 8.03 Intr + 225382 225546 165 2 0 109 95 267 0.926 29.43 8.04 Intr + 228853 228984 132 2 0 47 113 163 0.998 15.32 8.05 Term + 230217 230353 137 1 2 81 38 257 0.998 18.18 8.06 PlyA + 232050 232055 6 1.05 9.02 PlyA - 233148 233143 6 1.05 9.01 Term - 251535 251398 138 0 0 150 47 254 0.999 25.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 51112 50955 158 2 2 103 37 69 0.848 1.40 S.002 Intr - 205236 205128 109 2 1 71 103 40 0.903 4.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:31414552_31669216|GENSCAN_predicted_peptide_1|60_aa XEFAQDLMVFESVWQFPLVPSLSPAAVEDVPCFLFIFCPDYKFPEASPAMQNYVYGATRF >gi568815578f:31414552_31669216|GENSCAN_predicted_CDS_1|183_bp nntgagttcgcacaagatctgatggtttttgaaagtgtttggcagttccccctggtgccc tctctctcacctgctgccgtggaagatgtgccttgcttcctcttcatcttctgccctgat tataagtttcctgaggcctccccagccatgcagaactatgtttatggtgccactagattc tga >gi568815578f:31414552_31669216|GENSCAN_predicted_peptide_2|242_aa MTHNQEKNKSTNTEMTMMMKLADKDDKTAMINMLQVVKKVEEAMSMMRRNKDVKNIHGRA RWVKPMIPALWEAKEVEGKYSAHNFWGSWKFKSSSRSSSSGPESPARTHASFCQPDGGPT NKLGTKAFRVSPASSLLVDLNTQEVEIINVRKATPTCSLELGRKRRDGAAERAALDVVVV IYQLAPAAAPNCLNPVTSRRRHKHRLRKVREDGRVLKNKYKATGSSFRFLRDLRTNHSES FF >gi568815578f:31414552_31669216|GENSCAN_predicted_CDS_2|729_bp atgacccataaccaggagaaaaacaaatcaaccaacacagaaatgacaatgatgatgaaa ctagcagacaaggatgataaaacagctatgataaatatgctccaagtagtaaaaaaggtg gaggaagccatgagcatgatgagaagaaataaagatgtaaaaaatattcatggccgggca cgatgggtcaagcctatgatcccagcactttgggaggccaaggaagtagaggggaaatac tctgcccataatttctggggaagctggaagtttaaatccagctcccgcagcagcagcagt gggcctgagagccctgctagaacccatgcctcattctgccagcctgatggtggacccacc aacaagctagggacaaaggcctttcgtgtgtctcctgctagcagtctactggttgacctg aacactcaggaagtagagataattaatgtgagaaaggccacccccacatgctccctggag ctgggcaggaagaggagggatggagccgctgagagagcagctctggatgttgtagttgtc atctatcagctggcacctgctgctgctccaaactgcctgaacccagtgaccagcaggagg cgccacaaacaccggctccggaaggtcagggaagacggcagagttttgaagaataaatac aaagcaacagggtcatccttccgcttcctcagagatctgagaaccaaccactcagaatcc ttcttctga >gi568815578f:31414552_31669216|GENSCAN_predicted_peptide_3|298_aa MTLNTEQEAKTPLHRRASTPLPLSPRGHQPGRLSTVPSTQSQHPRLGQSASLNPPTQKPS PAPDDWSSESSDSEGSWEALYRVVLLGDPGVGKTSLASLFAGKQERDLHEQLGEDVYERT LTVDGEDTTLVVVDTWEAEKLDKSWSQESCLQGGSAYVIVYSIADRGSFESASELRIQLR RTHQADHVPIILVGNKADLARCREVSVEEGRACAVVFDCKFIETSATLQHNVAELFEGVV RQLRLRRRDSAAKEPPAPRRPASLAQRARRFLARLTARSARRRALKARSKSCHNLAVL >gi568815578f:31414552_31669216|GENSCAN_predicted_CDS_3|897_bp atgacactcaacaccgagcaggaagcaaagacccctctgcaccggcgagccagcacccca ctgcccctgtccccacggggccaccagcctggccgcctgagcacagtgccttccactcaa tcccagcatccccggctgggccaatcagcctccctcaaccctcccacccagaaaccttca cctgccccagatgattggtcttctgaatccagcgactctgaaggctcctgggaggctctc taccgtgtggtgctacttggagatcctggagtggggaagaccagcttggccagcctcttt gcagggaagcaagagagggacctccatgaacagctgggagaagatgtatatgagaggacc ctcacggtggatggagaagacaccacactggtggtcgtggacacctgggaggccgagaaa ctggataaaagctggagccaggagtcatgcctgcaggggggcagtgcctatgtcatcgta tactccatcgcagaccgaggcagctttgagagtgcctctgagctccgcatccagctgcgg cgcacacatcaggcagaccatgtgcccatcatcctcgtgggcaacaaggcagacttggcc cgctgccgagaagtctctgtggaagagggccgcgcctgcgctgtggtgttcgactgtaaa ttcatcgagacatccgccacgctgcagcacaatgtggccgagctcttcgagggcgtggtg cgccaactgcgcttgcgccgccgggacagtgcggccaaggaacccccagcaccccgacgg ccggccagcctagcccagcgcgctcgtcgcttcctggcacgcctgacagcccgcagcgca cgccgccgggcactcaaggcccgctccaagtcctgccacaatctggccgtgctctga >gi568815578f:31414552_31669216|GENSCAN_predicted_peptide_4|739_aa MNLGINHSWNGDAGKLDSLGMRTADSGRVRWPGRRIDPILRDLRQGWLRSESHQPHQMHP DLYDISAVPLPGKSPLATHNAHNSTIMDTTKALVQYTKQAWFSVKENEDKADKNRKEMQR EDPGSMDILASRDYNLNKVSRALASFLDHEGNVAFPAEPVSPPASLLQQPELESDPERTL AMDSALSDPHNGSAEAGGPTNSTTRPPSTPEGIALAYGSLLLMALLPIFFGALRSVRCAR GKNASDMPETITSRDAARFPIIASCTLLGLYLFFKIFSQEYINLLLSMYFFVLGILALSH TIRGEGHQDVACQWEAMTFTFDMSHQGPGRISACYKSLHSESFSKHFSRCLAGTSPFMNK FFPASFPNRQYQLLFTQGSGENKEGDSTGALPIPFVSLLSPASPWIMFKKFDEKESVSNC IQLKTSVIKGIKSQLVEQFPGIEPWLNQIMPKKDPVKIVRCHEHTEILTGLTGGRGSPAS GSGLTWPLCSEIINYEFDTKDLVCLGLSSIVGVWYLLRKHWIANNLFGLAFSLNGVELLH LNNVSTGCILLGGLFIYDVFWVFGTNVMVTVAKSFEAPIKCDKTKAVVFPQDLLEKGLEA NNFAMLGLGDVVIPGIFIALLLRFDISLKKNTHTYFYTSFAAYIFGLGLTIFIMHIFKHA QPALLYLVPACIGFPVLVALAKGEVTEMFSYESSAEILPHTPRLTHFPTVSGSPASLADS MQQKLAGPRRRRPQNPSAM >gi568815578f:31414552_31669216|GENSCAN_predicted_CDS_4|2220_bp atgaatctgggcatcaaccactcctggaatggagatgctggcaaactcgactctctgggg atgcgcacagctgactcagggagggtgcggtggccaggaagaaggatcgatcctattctc agagatcttcgccagggctggttaagatctgagagccatcagccacatcaaatgcatcct gatctctatgacatcagtgcggtccctctgcctggtaaatctcctcttgctactcacaat gcccataattccaccattatggatacaaccaaggctcttgttcagtatacaaagcaagcc tggttttctgtgaaagagaatgaagacaaagcagataaaaacagaaaggagatgcagaga gaggatcctggcagcatggacatcctggcttccagggactacaatttgaacaaggtgtcc agagctttagcatccttcttagatcatgaggggaacgtggctttccctgcagagccggtg tctccgcctgcgtccctgctgcagcaaccggagctggagtcggatcccgaacgcaccctc gccatggactcggccctcagcgatccgcataacggcagtgccgaggcaggcggccccacc aacagcactacgcggccgccttccacgcccgagggcatcgcgctggcctacggcagcctc ctgctcatggcgctgctgcccatcttcttcggcgccctgcgctccgtacgctgcgcccgc ggcaagaatgcttcagacatgcctgaaacaatcaccagccgggatgccgcccgcttcccc atcatcgccagctgcacactcttggggctctacctctttttcaaaatattctcccaggag tacatcaacctcctgctgtccatgtatttcttcgtgctgggaatcctggccctgtcccac accatcaggggagaagggcaccaggatgtggcttgtcagtgggaagcaatgacctttacg tttgacatgagccatcaagggccaggaagaatctctgcctgctacaaaagtctgcattca gagtcattcagcaagcattttagcagatgtctagcaggcaccagccccttcatgaataag ttttttccagccagctttccaaatcgacagtaccagctgctcttcacacagggttctggg gaaaacaaggaaggggactccaccggagccttgccaattccgtttgtttccctgttgtcg cccgcttcaccctggatcatgttcaagaagtttgatgaaaaggaaagtgtgtccaactgc atccagttgaaaacgtcagttattaagggcattaagagccaactggtagagcaatttcca ggtattgaaccatggcttaatcaaatcatgcctaagaaagatcctgtcaaaatagtccga tgccacgaacatacagaaatccttaccgggctgacaggtgggaggggtagccctgcctca gggagtggacttacctggcctctctgctcagagatcatcaattatgaatttgacaccaag gacctggtgtgcctgggcctgagcagcatcgttggcgtctggtacctgctgaggaagcac tggattgccaacaacctttttggcctggccttctcccttaatggagtagagctcctgcac ctcaacaatgtcagcactggctgcatcctgctgggcggactcttcatctacgatgtcttc tgggtatttggcaccaatgtgatggtgacagtggccaagtccttcgaggcaccaataaaa tgtgacaaaactaaggcagtggtgtttccccaggatctgctggagaaaggcctcgaagca aacaactttgccatgctgggacttggagatgtcgtcattccagggatcttcattgccttg ctgctgcgctttgacatcagcttgaagaagaatacccacacctacttctacaccagcttt gcagcctacatcttcggcctgggccttaccatcttcatcatgcacatcttcaagcatgct cagcctgccctcctatacctggtccccgcctgcatcggttttcctgtcctggtggcgctg gccaagggagaagtgacagagatgttcagctacgagtcctcggcggaaatcctgcctcat accccgaggctcacccacttccccacagtctcgggctccccagccagcctggccgactcc atgcagcagaagctagctggccctcgccgccggcgcccgcagaatcccagcgccatgtaa >gi568815578f:31414552_31669216|GENSCAN_predicted_peptide_5|141_aa MGLAGPALGAAGRPAAPGSEGLSTRASSCRGCTGSPSSAGPPALRWISCRALAASPWDTA GDLQPAMPPRPMGSCAAQASPTSAAPSSMAPGPIDPPRAEECGRHTAWDWQAAPPAAPVR DPQGEASRAPESSGDLENLYV >gi568815578f:31414552_31669216|GENSCAN_predicted_CDS_5|426_bp atgggcttggcgggccccgcacttggagcggctggccggcccgccgccccaggcagtgag gggcttagcacccgggccagcagctgcagagggtgcaccgggtccccaagcagtgctggc ccaccggcgctgcgctggatttcttgccgggccttagctgcctccccgtgggacacggct ggggacctgcagcccgccatgcccccccgccccatgggctcctgtgcagcccaagcctcc cctacgagcgccgccccctcctccatggcgcccggtcccatcgaccccccaagggctgag gagtgcggtcggcacacggcctgggattggcaggcagctccacctgcagcccctgtgcgg gatccacagggtgaagccagccgggctcctgagtctagtggggacttggagaacctttat gtctag >gi568815578f:31414552_31669216|GENSCAN_predicted_peptide_6|149_aa MREIEARTEKRLALDSPESLGFRTLLEDASRTLIYKYQDRNLKRALLYICSPGLSQFLNN VAVRVPETRRAYPDPSRVGAINSGKSKQYTNAHTHTPNVAFICTTDTTDKPNSHTPIDTT CQMHNAQQYTIIYNQHTDLRAHYRNTKSA >gi568815578f:31414552_31669216|GENSCAN_predicted_CDS_6|450_bp atgagagaaattgaggcccgaacggagaagcggctggctctggatagcccagagagcctg ggtttccggaccctcttggaagatgcttctcggaccctgatctacaaatatcaggaccga aatttaaagcgggcgctcctttacatctgctcccctgggctttctcaattcctaaataat gttgctgttcgtgttcctgagacccggagggcctacccagatccctcccgggtgggagca attaattcgggaaagtctaagcaatatacaaatgcacacacccacacgcccaacgtggca tttatatgtacgactgacacaacagacaaacccaacagccacacaccaatagacacaacc tgccaaatgcacaacgcacaacagtacaccataatatacaaccaacacactgacctccgc gcacactaccgaaacaccaaatctgcataa >gi568815578f:31414552_31669216|GENSCAN_predicted_peptide_7|168_aa MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQ QVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTPGG RGLPVRAPLSTLNGEISALTAELVLGGELEGYIWIVALTGLNECFRCL >gi568815578f:31414552_31669216|GENSCAN_predicted_CDS_7|507_bp atgaaagtcgccagtggcagcaccgccaccgccgccgcgggccccagctgcgcgctgaag gccggcaagacagcgagcggtgcgggcgaggtggtgcgctgtctgtctgagcagagcgtg gccatctcgcgctgcgccgggggcgccggggcgcgcctgcctgccctgctggacgagcag caggtaaacgtgctgctctacgacatgaacggctgttactcacgcctcaaggagctggtg cccaccctgccccagaaccgcaaggtgagcaaggtggagattctccagcacgtcatcgac tacatcagggaccttcagttggagctgaactcggaatccgaagttggaacccccgggggc cgagggctgccggtccgggctccgctcagcaccctcaacggcgagatcagcgccctgacg gccgagctggttctgggaggagaattggagggctacatctggattgttgctcttaccggc ctgaatgagtgtttccggtgtctttaa >gi568815578f:31414552_31669216|GENSCAN_predicted_peptide_8|227_aa MAAGKKGGLSLSQSSQHVGPVTTSGLNAVSGVPSTLGPPAVPGEDPYSSALGPRVACLKG QSVSSQVQDLERRLRSCPAGYIHPRGGGKMSPYTNCYAQRYYPMPEEPFCTELNAEEQAL KEKEKGSWTQLTHAEKVALYRLQFNETFAEMNRRSNEWKTVMGCVFFFIGFAALVIWWQR VYVFPPKPITLTDERKAQQLQRMLDMKVNPVQGLASRWDYEKKQWKK >gi568815578f:31414552_31669216|GENSCAN_predicted_CDS_8|684_bp atggctgccggaaagaaaggaggtctcagcctatcccagagctctcagcatgtcgggcct gtcacgacaagtggcctgaatgccgtgtccggagtcccttccacccttggaccccccgcg gttccaggagaagacccttattcctcggctctgggaccccgagtggcctgccttaaggga cagtccgtctcttcccaggttcaggaccttgaaaggaggctccgcagttgtcctgcaggc tatattcacccccgtggtggggggaagatgtccccctacaccaactgctatgcccagcgc tactaccccatgccagaagagcccttctgcacagaactcaacgctgaggagcaggccctg aaggagaaggagaagggaagctggacccagctgacccacgccgaaaaggtggccttgtac cggctccagttcaatgagacctttgcggagatgaaccgtcgctccaatgagtggaagaca gtgatgggttgtgtcttcttcttcattggattcgcagctctggtgatttggtggcagcgg gtctacgtatttcctccaaagccgatcaccttgacggacgagcggaaagcccagcagctg cagcgcatgctggacatgaaggtgaatcctgtgcagggcctggcctcccgctgggactat gagaagaagcagtggaagaagtga >gi568815578f:31414552_31669216|GENSCAN_predicted_peptide_9|45_aa DTFVELYGNNAAAESRKGQERFNRWFLTGMTVAGVVLLGSLFSRK >gi568815578f:31414552_31669216|GENSCAN_predicted_CDS_9|138_bp gatacttttgtggaactctatgggaacaatgcagcagccgagagccgaaagggccaggaa cgcttcaaccgctggttcctgacgggcatgactgtggccggcgtggttctgctgggctca ctcttcagtcggaaatga