GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:49:20 Sequence gi568815596r:101177241_101408588 : 231348 bp : 43.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 5361 5356 6 1.05 1.03 Term - 19390 19298 93 1 0 66 45 78 0.260 -0.87 1.02 Intr - 24966 24944 23 1 2 108 115 16 0.479 3.56 1.01 Init - 43352 43313 40 2 1 105 94 30 0.788 5.65 1.00 Prom - 53781 53742 40 -0.86 2.00 Prom + 56822 56861 40 -4.16 2.01 Init + 75725 76238 514 1 1 32 82 729 0.626 59.77 2.02 Intr + 76609 76655 47 2 2 53 48 104 0.829 1.03 2.03 Intr + 80598 80715 118 2 1 80 107 148 0.860 15.94 2.04 Intr + 85299 85451 153 1 0 77 56 118 0.953 7.54 2.05 Intr + 87605 87807 203 0 2 61 89 107 0.985 7.10 2.06 Intr + 89437 89639 203 0 2 50 123 136 0.969 11.38 2.07 Term + 91230 91338 109 0 1 50 48 82 0.861 -1.62 2.08 PlyA + 92753 92758 6 1.05 3.03 PlyA - 94005 94000 6 1.05 3.02 Term - 94504 94440 65 2 2 118 39 22 0.357 -1.65 3.01 Init - 96043 95887 157 1 1 80 90 99 0.875 7.99 3.00 Prom - 96705 96666 40 -4.76 4.07 PlyA - 96848 96843 6 1.05 4.06 Term - 97037 96861 177 2 0 9 42 125 0.871 -2.31 4.05 Intr - 97498 97352 147 1 0 31 94 165 0.203 11.83 4.04 Intr - 117941 117691 251 0 2 117 95 128 0.731 13.56 4.03 Intr - 126910 126826 85 1 1 107 100 -31 0.460 -0.61 4.02 Intr - 130805 130702 104 0 2 31 107 49 0.480 0.99 4.01 Init - 131348 130889 460 2 1 105 88 965 0.909 92.12 4.00 Prom - 135261 135222 40 -5.76 5.00 Prom + 136715 136754 40 -7.06 5.01 Sngl + 139141 139476 336 0 0 75 37 251 0.793 14.73 5.02 PlyA + 140025 140030 6 1.05 6.00 Prom + 140872 140911 40 -2.16 6.01 Sngl + 142482 142886 405 2 0 60 38 228 0.982 11.28 6.02 PlyA + 143019 143024 6 1.05 7.00 Prom + 143778 143817 40 -2.46 7.01 Init + 148388 148441 54 1 0 74 103 52 0.715 6.58 7.02 Term + 152604 152663 60 2 0 74 41 37 0.085 -4.60 7.03 PlyA + 152961 152966 6 1.05 8.03 PlyA - 154472 154467 6 1.05 8.02 Term - 164804 164658 147 2 0 121 54 48 0.805 2.60 8.01 Init - 165959 165894 66 2 0 81 105 54 0.956 7.47 8.00 Prom - 169178 169139 40 -6.06 9.00 Prom + 170082 170121 40 0.84 9.01 Init + 174155 174226 72 1 0 68 70 30 0.241 0.27 9.02 Intr + 178509 178546 38 2 2 83 84 47 0.558 0.86 9.03 Intr + 178724 178773 50 2 2 73 105 54 0.349 3.92 9.04 Intr + 187112 187235 124 0 1 31 99 66 0.028 1.64 9.05 Intr + 187794 187905 112 0 1 14 77 52 0.572 -2.92 9.06 Intr + 187951 188130 180 0 0 110 83 53 0.815 7.06 9.07 Intr + 194413 194660 248 0 2 73 85 57 0.101 0.16 9.08 Term + 198647 198818 172 2 1 89 39 139 0.845 6.40 9.09 PlyA + 200384 200389 6 1.05 10.06 PlyA - 201967 201962 6 1.05 10.05 Term - 203314 203275 40 0 1 63 42 37 0.240 -6.74 10.04 Intr - 204251 203812 440 2 2 63 53 234 0.813 9.71 10.03 Intr - 204891 204714 178 2 1 72 36 76 0.981 0.72 10.02 Intr - 206462 206293 170 2 2 52 74 204 0.998 14.14 10.01 Init - 210442 209777 666 1 0 67 105 766 0.724 69.43 10.00 Prom - 212666 212627 40 -2.56 11.03 PlyA - 213041 213036 6 1.05 11.02 Term - 220484 220308 177 2 0 78 50 94 0.860 2.29 11.01 Init - 225312 225196 117 0 0 99 97 120 0.352 14.35 11.00 Prom - 228227 228188 40 -1.46 12.02 PlyA - 229518 229513 6 1.05 12.01 Term - 230769 230549 221 1 2 40 38 185 0.766 5.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 140654 141190 537 1 0 49 48 196 0.855 7.78 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:101177241_101408588|GENSCAN_predicted_peptide_1|51_aa MAIALSNDNTVKKEIHVIDGKRVQDTASGLSDNTCSNGQSGQHTDCHSPMV >gi568815596r:101177241_101408588|GENSCAN_predicted_CDS_1|156_bp atggccattgcactatcaaatgacaacacagtcaaaaaagaaatacatgttattgatggc aagcgggtacaggatacagcttcaggcttgtctgacaacacctgttccaatggtcagagt ggacagcacactgactgccacagtcctatggtctga >gi568815596r:101177241_101408588|GENSCAN_predicted_peptide_2|448_aa MPGGGASAASGRLLTAAEQRGSREAAGSASRSGFGGSGGGRGGASGPGSGSGGPGGPAGR MSLTPKELSSLLSIISEEAGGGSTFEGLSTAFHHYFSKADHFRLGSVLVMLLQQPDLLPS AAQRLTALYLLWEMYRTEPLAANPFAASFAHLLNPAPPARGGQEPDRPPLSVSKEYVEVS IKKLKKLLMLAPPRELFKKTPRQIALMDVGNMGQSVDISGLQLALAERQSELPTQSKASF PSILSDPDPDSSNSGFDSSVASQITEALVSGPKPPIESHFRPEFIRPPPPLHICEDELAW LNPTEPDHAIQWDKSMCVKNSTGVEIKRIMAKAFKSPLSSPQQTQLLGELEKDPKLVYHI GLTPAKLPDLVENNPLVAIEMLLKLMQSSQITEYFSVLVNMDMSLHSMEVVNRASSLNVK HVFWRRTNLKRLQQLMTVSLEHYILMPY >gi568815596r:101177241_101408588|GENSCAN_predicted_CDS_2|1347_bp atgcccggcggaggggcgagcgcggcgtctggccggcttctcaccgccgcggagcaaaga gggtcccgggaagcggcagggtcggcgtccaggagcggcttcgggggctccggcggcggc agaggcggagcaagcggccccgggtccgggagcggaggcccggggggccccgcgggcagg atgagcttgaccccgaaggagctctcgagcctgctgagcatcatatcggaggaggcgggc ggcggcagcaccttcgagggcctgtccaccgccttccaccactacttcagcaaggccgac cacttccgcctgggctcggtgctcgtcatgctgctccagcagcccgacctgctgcctagc gcggcgcagcgcctcacggcgctctacctgctctgggagatgtaccgcaccgagccgctg gccgccaaccccttcgccgccagcttcgcgcacctgctcaaccccgcgccgcccgcccgc ggcggccaggaacccgaccgccctccgctctcagtctcaaaggagtacgtggaagtcagt attaagaagctgaagaagttgctgatgctggcacccccacgggaactcttcaaaaagacg cctcgccagattgcactgatggacgttggaaacatgggccagtctgtggacattagtggg cttcagttagccttggccgaacgccaatctgaattgccaacgcaaagcaaagcgagcttc cccagtattctcagtgacccagacccggattcttctaattctggatttgacagctcagtt gcctctcagatcacagaagctttagtcagcggaccaaagccacctattgaaagccatttt cgaccagagtttattcgtccaccgcctccactccacatttgtgaggatgaacttgcttgg ctaaaccccacggagcctgaccacgcgatccagtgggataaatcgatgtgtgttaagaat agcactggtgtggagatcaaacgaataatggccaaagccttcaaaagccccttatcctct ccccaacaaacacagctacttggtgagttggaaaaagaccccaaacttgtctaccatatt ggcctcaccccagccaaacttcctgaccttgtggaaaacaaccctttagtcgctatagaa atgttgctgaaattaatgcagtcaagccagatcactgagtatttctctgtcctggtcaat atggacatgtctttacattcaatggaagttgtaaatcgtgcatcatccttaaacgtgaaa cacgttttttggcgacgcacaaatctcaagagactgcagcagctcatgacagtgtccttg gagcactacatcctcatgccttattga >gi568815596r:101177241_101408588|GENSCAN_predicted_peptide_3|73_aa MEKEAVSVTVLLSAAPCLLSCFLGSSVSGLAFWVSQQKTKGPERCKNTHHLAGDLKILKL EIGTRKSKEFKAS >gi568815596r:101177241_101408588|GENSCAN_predicted_CDS_3|222_bp atggagaaggaggcagtgtccgtgactgtgctgctctccgcagccccctgcctgctgtcc tgtttcctcggctcctcggtgtctggactggcgttctgggtttcccagcagaaaactaaa gggccagagaggtgtaaaaacacacaccacttggcaggtgatttgaagatattaaagcta gaaattggaactagaaaatcaaaagaattcaaggcatcttaa >gi568815596r:101177241_101408588|GENSCAN_predicted_peptide_4|407_aa MAWRRREASVGARGVLALALLALALCVPGARGRALEWFSAVVNIEYVDPQTNLTVWSVSE SGRFGDSSPKEGAHGLVGVPWAPGGDLEGCAPDTRFFVPEPGGRGAAPWVALVARGGCTF KDKVLVAARRNASAVVLYNEERYGNITLPMSHAGREAGLARLSGLWLLELSGKPSFARCG SPIAVGRECWECKPPHLSKERDLMQVLQKYQSKLDQGTGNIVVIMISYPKGREILELVQK GIPVTMTIGVGTRHVQEFISGQSVVFVAIAFITMMIISLAWLIFYYIQRFLYTGSQIGSQ LMKLTFKKKAVSFADAAAAQGPLLPAMVNPTMFFHIAVDGEPLGCVSFERERFPVFYLPC QDAKTEWLDCKHVVFGKVKDGMNIVEVMEHLGSKNGKISNQQEDHHC >gi568815596r:101177241_101408588|GENSCAN_predicted_CDS_4|1224_bp atggcgtggcggcggcgcgaagccagcgtcggggctcgcggcgtgttggctctggcgttg ctcgccctggccctgtgcgtgcccggggcccggggccgggctctcgagtggttctcggcc gtggtaaacatcgagtacgtggacccgcagaccaacctgacggtgtggagcgtctcggag agtggccgcttcggcgacagctcgcccaaggagggcgcgcatggcctggtgggcgtcccg tgggcgcccggcggagacctcgagggctgcgcgcccgacacgcgcttcttcgtgcccgag cccggcggccgaggggccgcgccctgggtcgccctggtggctcgtgggggctgcaccttc aaggacaaggtgctggtggcggcgcggaggaacgcctcggccgtcgtcctctacaatgag gagcgctacgggaacatcaccttgcccatgtctcacgcgggccgcgaagcgggcctggca cggctctcaggcctctggctgttggaactttcagggaagccctcgtttgctcggtgcggc tccccgatcgctgttgggagggagtgctgggagtgtaagccaccacacctgtctaaggaa agggatttaatgcaggtattgcaaaagtatcaaagtaagcttgaccaaggaacaggaaat atagtggtcattatgattagctatccaaaaggaagagaaattttggagctggtgcaaaaa ggaattccagtaacgatgaccataggggttggcacccggcatgtacaggagttcatcagc ggtcagtctgtggtgtttgtggccattgccttcatcaccatgatgattatctcgttagcc tggctaatattttactatatacagcgtttcctatatactggctctcagattggaagtcag ctaatgaagctgacttttaaaaagaaggctgtgagctttgcagatgctgctgccgcccag ggccccctgcttccagccatggtcaaccccaccatgtttttccacattgctgtcgatggc gagcccttgggctgtgtctccttcgagcgtgaacgtttcccagttttttatctgccctgc caagatgccaagacagagtggttggattgcaagcatgtggtctttggcaaggtgaaagat ggcatgaatattgtggaggtcatggagcacttggggtccaagaatggcaagatcagcaat cagcaagaagatcaccattgctga >gi568815596r:101177241_101408588|GENSCAN_predicted_peptide_5|111_aa MMRNQSRKDENSKNQSASSPPKERSSLPATEQSWTENDFDELREGFRRSVITNFSKLKED VQTHRKEAKNLEKRLDEWLTRINSIEKTLNDLMELKTMARELCDACTSFSS >gi568815596r:101177241_101408588|GENSCAN_predicted_CDS_5|336_bp atgatgagaaaccagagcagaaaagatgaaaattctaaaaatcagagtgcctcttctcct ccaaaggaacgcagctccttgccagcaacagaacaaagctggacggagaatgactttgat gagttgagagaaggcttcagacgatcagtaataacaaacttctccaagctaaaggaggat gttcaaacccatcgcaaagaagctaaaaaccttgagaaaagattagacgaatggctaact agaataaacagcatagagaagaccttaaatgacctgatggagctgaaaaccatggcacga gaactatgtgacgcatgcacaagcttcagtagctga >gi568815596r:101177241_101408588|GENSCAN_predicted_peptide_6|134_aa MSELPFTIATKRIKYLGIQLTRDVKDLFKENYKPLLKEIREETNKWKNIPCSWIGRINFV KMAMLPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKTILSQKNKAGGITLP DFKLYYKATVTKTA >gi568815596r:101177241_101408588|GENSCAN_predicted_CDS_6|405_bp atgagtgaactcccattcacaattgctacaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaga gaggagacaaacaaatggaagaacattccatgctcatggataggaagaatcaatttcgtg aaaatggccatgctgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaactggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcattgccaagacaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatag >gi568815596r:101177241_101408588|GENSCAN_predicted_peptide_7|37_aa MVFDEKLAVNLIEDHSYVNTPLRLSDFELAVLIECIL >gi568815596r:101177241_101408588|GENSCAN_predicted_CDS_7|114_bp atggtttttgatgagaaattagcagttaatcttattgaggatcactcgtatgtgaataca cccttaaggttaagtgactttgaacttgctgtgctcatagaatgcatcttatag >gi568815596r:101177241_101408588|GENSCAN_predicted_peptide_8|70_aa MDGGGDREHGGWGEPDAAVTVKHPTLSLLSTVAVPGNGPRLREEQLQRTETRLHLHPSQA VSSLVLTATL >gi568815596r:101177241_101408588|GENSCAN_predicted_CDS_8|213_bp atggacggtggaggtgaccgggaacatggtggctggggtgagccagatgccgcagtcaca gtgaagcaccccaccctctccctgctctcgacagtggctgtccctggcaatggccccagg ctcagggaggagcagctccagcggaccgagaccaggctccatctgcatccttcacaagcc gtgtcttccctggtcctcacggcgaccctgtga >gi568815596r:101177241_101408588|GENSCAN_predicted_peptide_9|331_aa MLMRLFIGLCGEAGNEPHRKHLQREGEYTGEQVQELGFLSGIQEESGHTDLKADASSTSH PNPSCNNQKGLQTLPNVPGEESLSSLFEIHWWKCSKSGVWVVRTWLEKLECRMRLPHSAE GKNKRMAAERKMGNPRTNRILLFGASLSGICRKHCVVPETAPFQAGCFQTSQLVNVPVKL WDPELPLKLQQEGKLCCFEGHTGQQPQLALALPVLQRPQGQTSRMEVAAQTKVSKAVLED QGMVWAFQREEPWTGKSQEAESQERKTMKSWGINRHWGLGQVVTDCVQGGGSQKAVKAGL SAVSAFSFAGLCCFNCHDMGICKAAVMLWKL >gi568815596r:101177241_101408588|GENSCAN_predicted_CDS_9|996_bp atgctgatgcggctcttcatagggctgtgtggggaagcaggcaatgaaccacacagaaaa cacctgcagagagagggggagtacacaggtgagcaggttcaggagctggggttcttgtct ggcatccaggaagaatcaggtcatacagacttaaaggctgatgccagcagtacctcccac cccaacccaagttgtaacaaccaaaaaggtctccagacattgccaaatgtccctggggag gagtctctcagctccctgtttgagatccactggtggaaatgcagcaaatctggagtgtgg gtggtgagaacctggctggagaagctggagtgtaggatgcgccttccacacagtgctgaa ggcaagaacaagaggatggcagcagagaggaaaatgggtaaccctagaacaaataggatt ctcctgttcggtgctagcctttccggcatctgcagaaagcactgtgttgttcctgagact gctcctttccaggctggctgctttcagacatcccagcttgtcaatgttcctgttaaatta tgggatccagaactgcccttgaaactccagcaggaggggaagctttgctgcttcgagggt cacactggccagcagccccagctggccttagctctccccgtgctgcagagacctcaagga caaacaagcaggatggaagtggctgcacaaaccaaggtcagcaaggctgtcctagaagac cagggaatggtatgggcttttcagagggaggagccttggactgggaagtcccaggaggct gagtcacaagagaggaagacaatgaaatcctggggaataaatcgtcactggggccttgga caagtcgttactgactgtgttcaagggggtggatctcagaaggctgtcaaggcaggcctg tcggcagtttcagctttcagctttgctgggctgtgctgtttcaactgtcacgacatgggc atctgcaaagctgctgtcatgctgtggaagctctga >gi568815596r:101177241_101408588|GENSCAN_predicted_peptide_10|497_aa MRGVCVEPGLLRAWTSELRARLQGFAGTRAGDRARGSCRGGKGARAPCFRRSGALPGSPA LALLAAPAAGRAACKMSVRRGRRPARPGTRLSWLLCCSALLSPAAGYVIVSSVSWAVTNE VDEELDSASTEEAMPALLEDSGSIWQQSFPASAHKEDAHLRPRAGAARARPPPAPPGMFS YRREGGQTASAPPGPRLRAATARSLAHASVWGCLATVSTHKKIQGLPFGNCLPVSDGPFN NSTGIPFFYMTAKDPVVADLMKNPMASLMLPESEGEFCRYPESSRAGACGSQNPEHSRSG DRQQNIYLGQSCGSTPAGGGFVNWQPPGKWQQELERGAGLRLPIIGVSPDVTHLCEVRIP SGALHFRFHLQNLGNTEHRDLTSQSHVAGIEKEPRINKPSKGFLGPLEYSEVSGRSVRSY PNTWVELRRTARQGTSKVPLLPVAGPEEVSSKGRDRRHVLQSFTASPTGAGRFKKAPMSY DDNDRGKTKPELGFTGP >gi568815596r:101177241_101408588|GENSCAN_predicted_CDS_10|1494_bp atgcgcggcgtgtgcgtggaacctggtcttctgcgagcctggacttctgagcttcgcgcg cggctgcagggattcgccgggactcgcgcgggggaccgcgcccggggatcctgcaggggc gggaagggggctcgggctccttgcttccgccgcagtggggcgctgccgggctccccggca ctagcgctgctggcggccccggcggccgggcgtgctgcctgcaagatgtccgtgcgccgc ggccggcggccggcgcggccggggacccgcctctcctggctgctgtgctgcagcgccctg ctgtccccggccgcgggctacgtgatcgtgagctccgtgtcttgggccgtcaccaacgag gtggacgaggagctggacagcgcctccactgaggaggctatgcccgcgctgctagaggat tcgggcagcatctggcagcaaagcttccccgcctctgcccacaaggaggacgcgcacctg cggccccgggcgggcgccgcccgggccaggccgccccccgcgccacccgggatgttctcc taccggcgcgagggcggccagacggccagtgcgcccccgggccctagactgcgcgccgcc accgcccgctccctggcccatgccagcgtctggggctgcctggccaccgtgtccacccac aagaagatccaaggactgccatttgggaactgcctgcccgtcagtgatggccccttcaac aatagcactgggattcctttcttctacatgacagccaaggaccccgtggtggctgatctg atgaagaaccccatggcctcgctgatgctgccagaatcagaaggggagttctgcaggtat cctgaaagcagtcgtgctggagcatgtggctcccagaacccagagcacagccgctctggg gacaggcaacagaatatatacttgggacaaagctgcggcagcactccagctgggggagga tttgtcaactggcaacccccaggcaaatggcagcaggagctggagcgaggagcaggtcta cgacttcccataatcggcgtaagcccagatgtgactcacctgtgtgaggttcgcattcca agtggagctttacattttaggtttcaccttcagaatcttggcaatactgaacacagggac ttaacaagtcaaagccatgtggctggcattgaaaaagagccccgaataaacaagccaagc aaggggttcctagggccactggaatatagcgaggtatctggtagaagtgtccgcagctac ccaaacacctgggtagagcttcgaaggacagcccggcaaggtaccagcaaggtgcccctc ctgcctgtcgccgggcctgaagaggtctccagcaaaggcagagatcgtcgtcacgtgctt cagagcttcacagcatctcccacaggagcggggagattcaagaaagcgcccatgtcttac gatgataacgacagggggaagacaaagccagaactgggctttactggaccataa >gi568815596r:101177241_101408588|GENSCAN_predicted_peptide_11|97_aa MEPLGVMPTHMGQGRYPVGVSNMVLRILGFLVDTAMGNKLIQVLLEDETTESAVKLSLPM GQEALITLKDGQQFVIQISDVPQSSEDIYFRENNANV >gi568815596r:101177241_101408588|GENSCAN_predicted_CDS_11|294_bp atggagccactgggggtgatgcccacacacatgggccagggccgatatcccgtgggtgtg agcaacatggtcctcaggatcctgggcttcctggtggacactgccatgggcaataagctc atccaggtgctgttggaagatgaaaccactgaaagcgcagttaaactcagccttcctatg ggacaagaagccctcataaccctaaaagatggacaacaatttgtgattcagatatcagat gtaccccaaagctctgaagatatttatttcagagaaaacaatgctaatgtgtga >gi568815596r:101177241_101408588|GENSCAN_predicted_peptide_12|73_aa XWLIFTKPDYCQSSAQTIKMHTFGLVRDAPWLQLASGSQRAEGSSGPGGASTLGDMKCHT VDVTGRKLNGVHG >gi568815596r:101177241_101408588|GENSCAN_predicted_CDS_12|222_bp ncttggctgatctttacaaaaccagattattgtcaatcaagtgctcagactattaagatg cacacctttgggcttgtcagagatgccccatggcttcagttagcatctggcagtcagcga gcagagggcagttcgggccctggaggagcttctacgttgggggacatgaagtgccacact gtggacgtgacgggaagaaagctcaatggtgttcatggctga