GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:50:14 Sequence gi568815592f:36944860_37145186 : 200327 bp : 49.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9902 10072 171 1 0 108 115 336 0.969 35.74 1.02 Intr + 14286 14507 222 2 0 108 64 479 0.910 45.82 1.03 Intr + 16592 16701 110 1 2 64 75 158 0.994 11.18 1.04 Intr + 17027 17135 109 2 1 81 45 65 0.469 1.79 1.05 Intr + 17269 17377 109 0 1 38 90 82 0.474 3.26 1.06 Intr + 18076 18753 678 2 0 94 110 228 0.940 17.19 1.07 Term + 18964 19085 122 2 2 107 49 34 0.945 0.04 1.08 PlyA + 19952 19957 6 1.05 2.13 PlyA - 21504 21499 6 1.05 2.12 Term - 24115 24044 72 1 0 118 37 52 0.638 1.01 2.11 Intr - 25255 25180 76 0 1 101 99 31 0.639 5.02 2.10 Intr - 25614 25547 68 0 2 69 77 76 0.820 2.30 2.09 Intr - 25835 25788 48 2 0 107 98 80 0.998 9.78 2.08 Intr - 26197 26084 114 1 0 81 80 23 0.694 1.44 2.07 Intr - 27937 27793 145 0 1 101 77 129 0.597 13.38 2.06 Intr - 32391 32340 52 1 1 92 94 16 0.849 0.57 2.05 Intr - 32832 32775 58 0 1 83 103 41 0.823 3.56 2.04 Intr - 33296 33219 78 2 0 103 90 109 0.908 12.35 2.03 Intr - 33752 33646 107 0 2 112 105 119 0.999 15.93 2.02 Intr - 36813 36729 85 0 1 47 113 18 0.731 -0.41 2.01 Init - 41260 40994 267 1 0 93 75 307 0.845 24.88 2.00 Prom - 44101 44062 40 -7.26 3.00 Prom + 48454 48493 40 -6.96 3.01 Init + 60959 61026 68 1 2 85 80 69 0.747 6.34 3.02 Intr + 63975 64206 232 0 1 92 51 186 0.831 13.08 3.03 Intr + 66114 66195 82 2 1 119 -44 131 0.975 2.21 3.04 Intr + 66821 66995 175 0 1 81 81 261 0.982 23.70 3.05 Intr + 68750 68906 157 2 1 87 94 248 0.999 25.31 3.06 Intr + 69103 69241 139 0 1 71 42 237 0.866 17.44 3.07 Intr + 69787 69845 59 2 2 86 103 82 0.992 8.10 3.08 Intr + 70033 70179 147 0 0 57 91 306 0.999 28.23 3.09 Intr + 70909 71001 93 0 0 100 116 128 0.997 16.96 3.10 Intr + 72348 72443 96 2 0 78 52 53 0.648 0.91 3.11 Intr + 75682 75761 80 0 2 99 60 122 0.977 8.85 3.12 Intr + 75850 75880 31 1 1 82 80 9 0.968 -2.37 3.13 Intr + 76653 76745 93 2 0 112 84 146 0.999 16.76 3.14 Intr + 77380 77511 132 0 0 74 73 145 0.985 12.44 3.15 Intr + 79914 79934 21 2 0 117 81 30 0.802 2.94 3.16 Intr + 80933 81188 256 1 1 126 -3 264 0.691 18.12 3.17 Intr + 81232 81349 118 2 1 151 26 68 0.923 6.42 3.18 Intr + 81403 81466 64 1 1 90 71 28 0.604 0.02 3.19 Intr + 82570 82716 147 0 0 117 110 164 0.999 21.93 3.20 Term + 83089 83304 216 0 0 118 52 305 0.997 27.04 3.21 PlyA + 84114 84119 6 1.05 4.03 PlyA - 84794 84789 6 1.05 4.02 Term - 85509 85417 93 0 0 58 38 105 0.554 0.33 4.01 Init - 88956 88846 111 0 0 75 100 97 0.702 9.81 4.00 Prom - 91589 91550 40 -4.76 5.00 Prom + 99830 99869 40 -7.66 5.01 Sngl + 100001 100330 330 1 0 85 43 405 0.996 29.72 5.02 PlyA + 100505 100510 6 1.05 6.00 Prom + 101058 101097 40 -5.96 6.01 Init + 106051 106113 63 0 0 54 87 63 0.023 3.95 6.02 Term + 114664 114867 204 0 0 53 36 127 0.581 1.37 6.03 PlyA + 115653 115658 6 1.05 7.00 Prom + 123940 123979 40 -1.46 7.01 Sngl + 132839 133366 528 1 0 83 36 235 0.672 12.06 7.02 PlyA + 133680 133685 6 1.05 8.00 Prom + 146338 146377 40 -5.76 8.01 Sngl + 146453 146947 495 1 0 82 47 558 0.983 47.15 8.02 PlyA + 146967 146972 6 1.05 9.04 PlyA - 149250 149245 6 1.05 9.03 Term - 172640 172538 103 1 1 83 47 101 0.049 3.25 9.02 Intr - 192834 192803 32 0 2 89 96 41 0.286 1.93 9.01 Intr - 199888 199779 110 2 2 87 51 52 0.225 1.40 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 186370 186608 239 2 2 127 41 112 0.898 6.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:36944860_37145186|GENSCAN_predicted_peptide_1|506_aa MHGSCSFLMLLLPLLLLLVATTGPVGALTDEEKRLMVELHNLYRAQVSPTASDMLHMRWD EELAAFAKAYARQCVWGHNKERGRRGENLFAITDEGMDVPLAMEEWHHEREHYNLSAATC SPGQMCGHYTQVVWAKTERIGCGSHFCEKLQGVEETNIELLVCNYEPPGNVKGKRPYQEG TPCSQCPSGYHCKNSLCGESTGGWPPTRSHFGALSFQVAGFQPFKGRMLESLAASGGPAR EPIGSPEDAQDLPYLVTEAPSFRATEASDSRKMGTPSSLATGIPAFLVTEVSGSLATKAL PAVETQAPTSLATKDPPSMATEAPPCVTTEVPSILAAHSLPSLDEEPVTFPKSTHVPIPK SADKVTDKTKVPSRSPENSLDPKMSLTGARELLPHAQEEAEAEAELPPSSEVLASVFPAQ DKPGELQATLDHTGHTSSKSLPNFPNTSATANATGGRALALQSSLPGAEGPDKPSVVSGL NSGPGHVWGPLLGLLLLPPLVLAGIF >gi568815592f:36944860_37145186|GENSCAN_predicted_CDS_1|1521_bp atgcacggctcctgcagtttcctgatgcttctgctgccgctactgctactgctggtggcc accacaggccccgttggagccctcacagatgaggagaaacgtttgatggtggagctgcac aacctctaccgggcccaggtatccccgacggcctcagacatgctgcacatgagatgggac gaggagctggccgccttcgccaaggcctacgcacggcagtgcgtgtggggccacaacaag gagcgcgggcgccgcggcgagaatctgttcgccatcacagacgagggcatggacgtgccg ctggccatggaggagtggcaccacgagcgtgagcactacaacctcagcgccgccacctgc agcccaggccagatgtgcggccactacacgcaggtggtatgggccaagacagagaggatc ggctgtggttcccacttctgtgagaagctccagggtgttgaggagaccaacatcgaatta ctggtgtgcaactatgagcctccggggaacgtgaaggggaaacggccctaccaggagggg actccgtgctcccaatgtccctctggctaccactgcaagaactccctctgtggtgagtcc acgggtggatggccccccacgcgcagccactttggcgccctgtcgttccaagtggccgga tttcaacccttcaaagggaggatgttagaaagtctggcggcttcgggggggcccgcgcga gaacccatcggaagcccggaagatgctcaggatttgccttacctggtaactgaggcccca tccttccgggcgactgaagcatcagactctaggaaaatgggtactccttcttccctagca acggggattccggctttcttggtaacagaggtctcaggctccctggcaaccaaggctctg cctgctgtggaaacccaggccccaacttccttagcaacgaaagacccgccctccatggca acagaggctccaccttgcgtaacaactgaggtcccttccattttggcagctcacagcctg ccctccttggatgaggagccagttaccttccccaaatcgacccatgttcctatcccaaaa tcagcagacaaagtgacagacaaaacaaaagtgccctctaggagcccagagaactctctg gaccccaagatgtccctgacaggggcaagggaactcctaccccatgcccaggaggaggct gaggctgaggctgagttgcctccttccagtgaggtcttggcctcagtttttccagcccag gacaagccaggtgagctgcaggccacactggaccacacggggcacacctcctccaagtcc ctgcccaatttccccaatacctctgccaccgctaatgccacgggtgggcgtgccctggct ctgcagtcgtccttgccaggtgcagagggccctgacaagcctagcgtcgtgtcagggctg aactcgggccctggtcatgtgtggggccctctcctgggactactgctcctgcctcctctg gtgttggctggaatcttctga >gi568815592f:36944860_37145186|GENSCAN_predicted_peptide_2|389_aa MAGAGAGAGARGGAAAGVEARARDPPPAHRAHPRHPRPAAQPSARRMDGGSGGLGSGDNA PTTEALFVALGAGVTALSHPLLYVKLLIQVGHEPMPPTLGTNVLGRKVLYLPSFFTYAKY IVQVDGKIGLFRGLSPRLMSNALSTVTRGSMKKVFPPDEIEQVSNKDDMKTSLKKVVKET SYEMMMQCVSRMLAHPLHVISMRCMVQFVGREAKYSGLIPHLLGDVVFLWGCNLLAHFIN AYLVDDSVSDTPGGLGNDQNPGSQARIRLWVSGCMSVLALTACQALDVDQADTGSALEAH SPFSQALAIRSYTKFVMGIAVSMLTYPFLLVGDLMAVNNCGLQAGLPPYSPVFKSWIHCW KYLSVQGQLFRGSSLLFRRVSSGSCFALE >gi568815592f:36944860_37145186|GENSCAN_predicted_CDS_2|1170_bp atggcgggagccggagctggagccggagctcgcggcggagcggcggcgggggtcgaggct cgagctcgcgatccaccgcccgcgcaccgcgcacatcctcgccaccctcggcctgcggct cagccctcggcccgcaggatggatggcgggtcagggggcctggggtctggggacaacgcc ccgaccactgaggctcttttcgtggcactgggcgcgggcgtgacggcgctcagccatccc ctgctctacgtgaagctgctcatccaggtgggtcatgagccgatgccccccacccttggg accaatgtgctggggaggaaggtcctctatctgccgagcttcttcacctacgccaagtac atcgtgcaagtggatggtaagatagggctgttccgaggcctgagtccccggctgatgtcc aacgccctctctactgtgactcggggtagcatgaagaaggttttccctccagatgagatt gagcaggtttccaacaaggatgatatgaagacttccctgaagaaagttgtgaaggagacc tcctacgagatgatgatgcagtgtgtgtcccgcatgttggcccaccccctgcatgtcatc tcaatgcgctgcatggtccagtttgtgggacgggaggccaagtacagtggattaatccct cacctcctgggcgatgtggttttcttgtggggctgtaacctgctggcccacttcatcaat gcctacctggtggatgacagcgtgagtgacaccccaggggggctgggaaacgaccagaat ccaggttcccaggccaggattcgtctttgggtttcagggtgcatgtctgtcttagcactg actgcgtgccaagccctggatgttgatcaggcagacacgggctcagcccttgaggctcac agtccgttcagccaggccctggccatccggagctataccaagttcgtgatggggattgca gtgagcatgctgacctaccccttcctgctagttggcgacctcatggctgtgaacaactgc gggctgcaagctgggctccccccttactccccagtgttcaaatcctggattcactgctgg aagtacctgagtgtgcagggccagctcttccgaggctccagcctgcttttccgccgggtg tcatcaggatcatgctttgccctggagtaa >gi568815592f:36944860_37145186|GENSCAN_predicted_peptide_3|801_aa MKGASEEKLASVSNLVTVFENSRTPEAAPRGQRLEDVHHRPECRPPESPGPREKTNVGEA VGSEPRTVSRRYLNSLKNKLSSEAWRKSCQPVTLSGSGTQEPEKKIVQELLETEQAYVAR LHLLDQAMSDLSWRLQVFFQELLKTARSSKAFPEDVVRVIFSNISSIYQFHSQFFLPELQ RRLDDWTANPRIGDVIQKLAPFLKMYSEYVKNFERAAELLATWTDKSPLFQEVLTRIQSS EASGSLTLQHHMLEPVQRIPRYELLLKEYIQKLPAQAPDQADAQKALDMIFSAAQHSNAA ITEMERLQDLWEVYQRLGLEDDIVDPSNTLLREGPVLKISFRRNDPMERYLFLFNNMLLY CVPRVIQVGAQFQVRTRIDVAGMKMHRNFFNQSSAERHLDCFQLSAAANEAAVNSLVREL MDAEFPHSFLVSGKQRTLELQARSQEEMISWMQAFQAAIDQIEKRNETFKAAAQGPEGDI QEQELQSEELGLRAPQWVRDKMVTMCMRCQEPFNALTRRRHHCRACGYIFVPAEPVVCAR CSDYRAELKYDDNRPNRVCLHCYAFLTGNVLPEAKEDKRRGILEVRATVPALTIRPLFRE CVPGNHAGLTLSSAEEQASHDPRPLATAQAVWSLSPNGGTLWLQAVVHILLHFLYVPIRY PRVPRAGVTASILQMGTLTLQKGSSATPDQSLMCSFLQLIGDKWGKSGPRGWCVIPRDDP LVLYVYAAPQDMRAHTSIPLLGYQVTVGPQGDPRVFQLQQSGQLYTFKAETEELKGRWVK AMERAASGWSPSWPNDGDLSD >gi568815592f:36944860_37145186|GENSCAN_predicted_CDS_3|2406_bp atgaagggggcaagtgaggagaagctggcatctgtgtccaacctggtcactgtgtttgag aatagcaggaccccagaagcagcacccagaggccagaggctagaggacgtgcatcaccgc cctgagtgcaggcctcccgagtccccaggaccacgggagaagacgaatgtcggggaggcc gtggggtctgagcccaggacagtcagcaggaggtacctgaactccctgaagaacaagctg tccagcgaagcctggaggaaatcttgccagcctgtgaccctctcaggatcggggacgcag gagccagagaagaagatcgtccaggagctgctggagacagagcaggcctatgtggcgcgc ctccacctgctagaccaggccatgagtgacctgtcgtggcggctacaggtgtttttccag gagctgctgaagacagcccgcagcagcaaggccttcccagaggatgtggtcagggtcatc ttctccaacatctcctccatctatcagttccattctcagttcttcctcccagagctgcag cggcgcctggacgactggacagctaacccccgcatcggtgacgtgatccagaagctggcc cccttcctgaagatgtacagtgagtatgtcaagaactttgagcgagcggctgagctgctg gccacctggaccgacaagtctccactcttccaggaggttctcactcgcatccagagcagc gaggcttcgggcagcctgaccctgcagcaccacatgctggaaccagtgcagagaattcca cgttacgagctgctgctcaaggagtacatccagaagctgccagcccaggccccagaccag gccgatgcccagaaagccctggacatgatcttctcagctgcccagcactccaatgcagcc atcactgagatggagcggctgcaggacctgtgggaggtgtaccagcgcctgggcctcgag gacgacatagtagacccctctaacaccctgctccgtgagggcccggtcctcaagatctcc ttccgccgcaacgaccccatggagcgctaccttttcttgttcaacaacatgctgctctac tgtgtgcccagggtgatccaggtgggcgcccagttccaggtgaggacccgcatcgatgtg gccgggatgaagatgcaccggaatttcttcaaccagtcttctgctgagagacacttagat tgttttcagttgtcagcagctgcaaatgaggctgcagtgaatagcctggtgcgggagctg atggatgctgagtttccccactccttcctggtgtccgggaagcagcgcaccctggagctg caagcccggtcccaggaggaaatgatttcctggatgcaggccttccaagcagccattgac caaatcgagaagcggaatgaaaccttcaaggctgcggcccaggggcctgagggagacatc caggagcaggagctgcagtctgaggagctgggcctccgggcaccgcagtgggtccgggac aagatggtgaccatgtgcatgcgctgccaggagcccttcaacgctctgacgcgccgtcgc caccactgccgggcctgcggctatatcttcgtgcctgctgagccagtggtgtgtgccagg tgctccgactaccgggccgaactgaaatacgacgacaacaggcccaaccgagtctgcctc cactgctacgcattcctcactggaaatgtgctgcctgaggccaaggaggacaagaggcgg ggcatcctggaggtgagggccactgtccccgcgctcaccatccgtcctctgttcagggaa tgtgtgcccggcaaccatgctgggctgacactgtccagtgctgaggaacaagccagccat gaccccaggcctctcgccacagcccaggcagtgtggagcctgtcaccaaatggaggcacc ctttggctgcaggctgtcgtccacattcttctgcacttcctctatgtccccatccgatat cccagggtgccaagagctggtgtcaccgcttccattttacagatgggaacactgacgctc cagaaagggtcctcagccacgcctgaccagagcctgatgtgcagcttcctgcagctcatc ggggacaagtggggcaagagcggcccccggggctggtgtgtgatccctcgggatgacccc ctcgtgctctatgtctatgctgcccctcaggacatgagggctcacacctccatccccctg ctgggctaccaggtgactgttgggccccagggggaccctcgggtcttccagctacagcag tcaggccagctctacaccttcaaggccgagacggaggagctgaagggccgctgggtgaag gccatggagcgggcggccagtggctggagccccagctggcccaacgatggggacctgtcc gactga >gi568815592f:36944860_37145186|GENSCAN_predicted_peptide_4|67_aa MPIFMFANFHGGNTLIVAIFKKKQVLVSSHEGGTGSKIKPLEKLRYQCCRIDSAVTEPAA FSIIGEF >gi568815592f:36944860_37145186|GENSCAN_predicted_CDS_4|204_bp atgccaattttcatgtttgccaatttccatggtggaaatactctcattgtggccatcttc aaaaagaagcaggtcctggtatcaagccatgagggcgggacaggcagtaagattaaaccc ctggaaaagcttaggtatcagtgctgcagaatcgactctgctgtcaccgagccagctgcc ttcagcatcattggcgagttttaa >gi568815592f:36944860_37145186|GENSCAN_predicted_peptide_5|109_aa MAAVGVFSVSRLLGRSRPQLGRPMSSGAHGEEGSARMWKTLTFFVALPGVAVSMLNVYLK SHHGEHERPEFIAYPHLRIRTKPFPWGDGNHTLFHNPHVNPLPTGYEDE >gi568815592f:36944860_37145186|GENSCAN_predicted_CDS_5|330_bp atggcggcagttggtgtgttctcggtttctcggctgctgggtcggtcccgcccacagctg gggcggcctatgtcgagtggcgcccatggagaagagggctcagctcgcatgtggaagacc ctcaccttcttcgtcgcgctccccggggtggcagtcagcatgctgaatgtgtacctgaag tcgcaccacggagagcacgagagacccgagttcatcgcctacccccatctccgcatcagg accaagccgtttccctggggagatggtaaccatactctattccataaccctcatgtgaat ccacttccaactggctacgaagatgaataa >gi568815592f:36944860_37145186|GENSCAN_predicted_peptide_6|88_aa MPLLIPDISKGSCCGKGDSAQESNNTRGSRAILKQKCDYNAFCTGYKPSPAPESRVTLTS VLFLASWVVFLPTDLANPRRLALFSLLT >gi568815592f:36944860_37145186|GENSCAN_predicted_CDS_6|267_bp atgcctctcctcatccctgatatatccaagggcagctgctgtggcaagggggacagtgca caggagagcaacaacacccggggatcaagggccatcctcaaacaaaagtgtgattataac gcgttctgcacgggctataaaccgtccccagctccggagagcagggtgaccctcaccagc gtcttatttctggcctcctgggttgtgttcctgcctactgacttggcaaatcctaggcga ctcgccttgttttctctcctcacttaa >gi568815592f:36944860_37145186|GENSCAN_predicted_peptide_7|175_aa MVLRLLPVPALPKSKSTVLGVLSTCLSARPAPFSLKGLCQPGHDSAASRGARASPASKQH TVCGQPPTTFPLPDVSEEPWLPWSPHGQPCPDMPLPLCCPALHSVSRRPDVPALVLAALN EVQMCSLARASSGCYQRRGRPCEGAARLQLAAAVLEEECSGKDSHWNVGSRDVSA >gi568815592f:36944860_37145186|GENSCAN_predicted_CDS_7|528_bp atggtcctgagactgctcccagttccagccctccccaagagcaagagcactgtgcttgga gtcctgagtacctgtctctctgccagacctgcccccttcagcctaaaaggcctctgccag ccagggcacgactctgctgcatccagaggggccagagccagccccgccagcaagcagcac accgtctgcggccagccgcccaccaccttcccacttccagatgtctcagaggagccctgg ctgccttggagcccgcatgggcagccctgtcctgacatgcccctgcccctttgctgccct gccttgcactctgtctcgcggcggccagatgtgccggctttggtgctggcggctcttaat gaggtccagatgtgcagcttggcccgggccagcagcggctgctaccaaaggagagggagg ccctgtgagggagcagccaggctgcagctggccgcggcggtgctggaggaggaatgctct gggaaggattcccactggaatgtgggctctcgggatgtctctgcctaa >gi568815592f:36944860_37145186|GENSCAN_predicted_peptide_8|164_aa MWPKFNPSEIKVVYLRCTGGEVSATSALGPKIGPLDLSPKKVGDDIAKATGDWKGLRITV KLTIENRQAQIEVVPSASALIIKALKEPRDRKKQKNIKHSGNITFDEIVNIAPRMRHRSL ARDLTGTIKEILGTAQSVGCNVDGRHPHDIIDDINSGAVECPAS >gi568815592f:36944860_37145186|GENSCAN_predicted_CDS_8|495_bp atgtggccgaagttcaaccccagcgagatcaaagtcgtatacctgaggtgcactgggggt gaagtcagtgccacgtctgcgctgggccccaagatcggccccctggacctgtctccaaaa aaggttggtgatgacattgccaaggcaacgggtgactggaagggcctgaggattacagtg aaactgaccattgagaacagacaggcccagattgaggtggtgccttctgcttctgccctg atcatcaaagcccttaaggaaccaagagacagaaagaaacagaaaaacattaaacacagt gggaatatcacttttgatgagatcgtcaacattgctccacggatgcggcaccgatcttta gccagagatctcactggaaccattaaagagatcctggggactgcccagtctgtgggctgc aatgttgatggccgccaccctcatgacatcatagatgacatcaacagtggtgctgtggaa tgcccagctagttaa >gi568815592f:36944860_37145186|GENSCAN_predicted_peptide_9|81_aa XKVRIPAGALDQQHLSTEVAVFDALLSSRDKINNLPNGEPCEDEVGLRYSTILTMLILAA IILTQVFILTCELLHGSHSGY >gi568815592f:36944860_37145186|GENSCAN_predicted_CDS_9|246_bp ntaaaagtacgaattccagctggagctcttgaccaacaacatctttctacagaagttgct gtcttcgatgctttgctgagttccagagataagataaataatttaccaaatggggagccc tgtgaagatgaagttggtttaagatattctacgattttgaccatgctcattctcgctgcc atcatcctcacccaagttttcatcctcacatgtgaattattgcatggcagtcactcaggt tattaa