GENSCAN 1.0 Date run: 6-Nov-116 Time: 17:05:16 Sequence gi568815593r:149274372_149477026 : 202655 bp : 48.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 921 916 6 -1.75 1.01 Sngl - 1137 925 213 0 0 56 54 229 0.756 11.48 1.00 Prom - 6194 6155 40 -4.26 2.00 Prom + 12922 12961 40 -3.66 2.01 Init + 14447 14504 58 1 1 69 63 77 0.661 4.77 2.02 Intr + 25138 25266 129 2 0 129 71 232 0.999 26.27 2.03 Intr + 25900 25983 84 2 0 114 111 105 0.542 15.19 2.04 Intr + 26762 26859 98 0 2 72 92 94 0.903 7.93 2.05 Intr + 28047 28155 109 2 1 125 98 128 0.993 17.36 2.06 Intr + 31935 32033 99 1 0 48 95 151 0.998 11.88 2.07 Intr + 33031 33242 212 2 2 48 105 346 0.986 30.93 2.08 Intr + 35585 35764 180 1 0 76 100 238 0.999 23.86 2.09 Intr + 37741 37833 93 0 0 108 71 88 0.968 9.26 2.10 Intr + 39197 39283 87 1 0 81 56 68 0.820 3.07 2.11 Intr + 41450 41543 94 1 1 94 70 71 0.966 5.44 2.12 Intr + 41780 41932 153 0 0 111 52 238 0.882 22.54 2.13 Intr + 43358 43569 212 0 2 83 78 359 0.708 33.03 2.14 Intr + 45211 45356 146 0 2 102 95 236 0.999 24.78 2.15 Intr + 46020 46092 73 0 1 97 92 44 0.996 5.11 2.16 Intr + 48235 48346 112 0 1 116 81 97 0.998 11.75 2.17 Intr + 55295 55459 165 0 0 111 82 225 0.957 24.13 2.18 Intr + 58324 58502 179 2 2 63 84 291 0.991 25.84 2.19 Term + 59153 59269 117 1 0 56 31 96 0.705 -0.76 2.20 PlyA + 59699 59704 6 1.05 3.00 Prom + 60625 60664 40 -3.06 3.01 Init + 71169 71245 77 2 2 108 82 142 0.996 14.17 3.02 Intr + 73901 74054 154 2 1 115 79 150 0.982 16.87 3.03 Intr + 75283 75364 82 0 1 77 77 44 0.995 1.41 3.04 Term + 76547 76911 365 0 2 76 49 307 0.991 20.33 3.05 PlyA + 78268 78273 6 1.05 4.00 Prom + 83063 83102 40 -2.86 4.01 Init + 83698 83785 88 0 1 112 58 277 0.959 25.90 4.02 Intr + 88266 88472 207 1 0 52 92 258 0.504 21.65 4.03 Intr + 89665 89839 175 2 1 104 75 248 0.999 24.10 4.04 Intr + 91571 91782 212 2 2 79 115 315 0.998 31.86 4.05 Intr + 92989 93129 141 2 0 66 109 59 0.970 6.12 4.06 Term + 93622 94283 662 2 2 97 55 986 0.742 90.27 4.07 PlyA + 94726 94731 6 1.05 5.04 PlyA - 95999 95994 6 1.05 5.03 Term - 100229 99998 232 1 1 110 41 352 0.999 28.65 5.02 Intr - 102543 102365 179 0 2 83 96 311 0.628 30.12 5.01 Init - 104854 104834 21 1 0 81 92 24 0.401 0.25 5.00 Prom - 109904 109865 40 -6.76 6.00 Prom + 114993 115032 40 -5.66 6.01 Init + 117322 117396 75 0 0 87 70 53 0.546 4.56 6.02 Intr + 117599 117716 118 1 1 102 90 25 0.783 4.14 6.03 Intr + 130716 130763 48 1 0 81 64 47 0.062 0.25 6.04 Intr + 132670 132707 38 2 2 71 121 38 0.097 3.38 6.05 Intr + 146110 146232 123 0 0 105 37 91 0.714 6.48 6.06 Term + 146310 146459 150 2 0 103 53 123 0.476 8.21 6.07 PlyA + 151263 151268 6 1.05 7.04 PlyA - 151862 151857 6 1.05 7.03 Term - 157133 157084 50 0 2 84 54 62 0.133 -0.13 7.02 Intr - 172985 172851 135 0 0 119 60 13 0.027 2.14 7.01 Intr - 197723 197551 173 1 2 66 78 102 0.040 6.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 118899 118942 44 1 2 117 49 27 0.823 -0.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:149274372_149477026|GENSCAN_predicted_peptide_1|70_aa MKKEERRKKEEEEEEEEEEEEETPEIFLWMHREEATKGHGKKCPYKPGREVPPDTNPDCT LILDFQHLEL >gi568815593r:149274372_149477026|GENSCAN_predicted_CDS_1|213_bp atgaagaaagaggaaagaaggaagaaagaagaagaagaagaggaagaggaagaagaagaa gaagaaacaccagagatctttctctggatgcacagagaagaggccacgaagggacatggc aaaaagtgtccctacaagccaggaagagaggtcccacccgacactaaccctgactgcacc ttgatcttagacttccagcatctggaactgtga >gi568815593r:149274372_149477026|GENSCAN_predicted_peptide_2|799_aa MPQVTALLVEDKNLGRADAMLEQLLPELTGLLSLLDHEYLSDTTLEKKMAVASILQSLQP LPAKEVSYLYVNTADLHSGPSFVESLFEEFDCDLSDLRDMPEDDGEPSKGASPELAKSPR LRNAADLPPPLPNKPPPEDYYEEALPLGPGKSPEYISSHNGCSPSHSIVDGYYEDADSSY PATRVNGELKSSYNDSDAMSSSYESYDEEEEEGKSPQPRHQWPSEEASMHLVRECRICAF LLRKKRFGQWAKQLTVIREDQLLCYKSSKDRQPHLRLALDTCSIIYVPKDSRHKRHELRF TQGATEVLVLALQSREQAEEWLKVIREVSKPVGGAEGVEVPRSPVLLCKLDLDKARILST CLTHIRGTPGTVLATEMQSKHKQRLSQEKQTSDSDSVGVGDNCSTLGRRETCDHGKGKKS SLAELKGSMSRAAGRKITRIIGFSKKKTLADDLQTSSTEEEVPCCGYLNVLVNQGWKERW CRLKCNTLYFHKDHMDLRTHVNAIALQGCEVAPGFGPRHPFAFRILRNRQEVAILEASCS EDMGRWLGLLLVEMGSRVTPEALHYDYVDVETLTSIVSAGRNSFLYARSCQNQWPEPRVY DDVPYEKMQDEEPERPTGAQVKRHASSCSEKSHRVDPQVKVKRHASSANQYKYGKNRAEE DARRYLVEKEKLEKEKETIRTELIALRQEKRELKEAIRSSPGAKLKALEEAVATLEAQCR AKEERRIDLELKLVAVKERLQQSLAGGPALGLSVSSKPKSGQLSEEDTLTSNGALSERTS LTSSTPALLNPNTTDILDQ >gi568815593r:149274372_149477026|GENSCAN_predicted_CDS_2|2400_bp atgccccaagtgacagccctcctggtggaggacaagaacctgggcagagctgatgcgatg ctggagcagctgctcccagagctcaccgggctgctcagcctcctggaccacgagtacctc agcgataccaccctggaaaagaagatggccgtggcctccatcctgcagagcctgcagccc cttccagcaaaggaggtctcctacctgtatgtgaacacagcagacctccactcggggccc agcttcgtggaatccctctttgaagaatttgactgtgacctgagtgaccttcgggacatg ccagaggatgatggggagcccagcaaaggagccagccctgagctagccaagagcccacgc ctgagaaacgcggccgacctgcctccaccgctccccaacaagcctccccctgaggactac tatgaagaggcccttcctctgggacccggcaagtcgcctgagtacatcagctcccacaat ggctgcagcccctcacactcgattgtggatggctactatgaggacgcagacagcagctac cctgcaaccagggtgaacggcgagcttaagagctcctataatgactctgacgcaatgagc agctcctatgagtcctacgatgaagaggaggaggaagggaagagcccgcagccccgacac cagtggccctcagaggaggcctccatgcacctggtgagggaatgcaggatatgtgccttc ctgctgcggaaaaagcgtttcgggcagtgggccaagcagctgacggtcatcagggaggac cagctcctgtgttacaaaagctccaaggatcggcagccacatctgaggttggcactggat acctgcagcatcatctacgtgcccaaggacagccggcacaagaggcacgagctgcgtttc acccagggggctaccgaggtcttggtgctggcactgcagagccgagagcaggccgaggag tggctgaaggtcatccgagaagtgagcaagccagttgggggagctgagggagtggaggtc cccagatccccagtcctcctgtgcaagttggacctggacaaggccaggatcctgagcacc tgcttgacacatatccggggcaccccaggaactgtgctagccactgaaatgcagtccaaa cacaaacagaggctgtcccaagagaagcagacctcagattctgacagcgtgggtgtgggt gacaactgttctacccttggccgccgggagacctgtgatcacggcaaagggaagaagagc agcctggcagaactgaagggctcaatgagcagggctgcgggccgcaagatcacccgtatc attggcttctccaagaagaagacactggccgatgacctgcagacgtcctccaccgaggag gaggttccctgctgtggctacctgaacgtgctggtgaaccagggctggaaggaacgctgg tgccgcctgaagtgcaacactctgtatttccacaaggatcacatggacctgcgaacccat gtgaacgccatcgccctgcaaggctgtgaggtggccccgggctttgggccccgacaccca tttgccttcaggatcctgcgcaaccggcaggaggtggccatcttggaggcaagctgttca gaggacatgggtcgctggctcgggctgctgctggtggagatgggctccagagtcactccg gaggcgctgcactatgactacgtggatgtggagaccttaaccagcatcgtcagtgctggg cgcaactccttcctatatgcaagatcctgccagaatcagtggcctgagccccgagtctat gatgatgttccttatgaaaagatgcaggacgaggagcccgagcgccccacaggggcccag gtgaagcgtcacgcctcctcctgcagtgagaagtcccatcgtgtggacccgcaggtcaaa gtcaaacgccacgcctccagtgccaatcaatacaagtatggcaagaaccgagccgaggag gatgcccggaggtacttggtagaaaaagagaagctggagaaagagaaagagacgattcgg acagagctgatagcactgagacaggagaagagggaactgaaggaagccattcggagcagc ccaggagcaaaattaaaggctctggaagaagccgtggccaccctggaagctcagtgtcgg gcaaaggaggagcgccggattgacctggagctgaagctggtggctgtgaaggagcgcttg cagcagtccctggcaggagggccagccctggggctctccgtgagcagcaagcccaagagt gggcaactctctgaggaagatacgctcacctccaatggtgctctctcagagagaacttct ctgacctcatctacaccagcgcttctcaaccccaacactactgacattttggaccagtaa >gi568815593r:149274372_149477026|GENSCAN_predicted_peptide_3|225_aa MAVRSLWAGRLRVQRLLAWSAAWESKGWPLPFSTATQRTAGEDCRSEDPPDELGPPLAER ALRVKAVKLEKEVQDLTVRYQRAIADCENIRRRTQRCVEDAKIFGIQSFCKDLVEVADIL EKTTECISEESEPEDQKLTLEKVFRGLLLLEAKLKSVFAKHGLEKLTPIGDKYDPHEHEL ICHVPAGVGVQPGTVALVRQDGYKLHGRTIRLARVEVAVESQRRL >gi568815593r:149274372_149477026|GENSCAN_predicted_CDS_3|678_bp atggccgtacggtcgctgtgggcgggccggctgcgggtgcagcgcctactggcctggagt gccgcgtgggagagcaagggatggccgcttccattcagcactgccacccagagaactgct ggtgaggactgccgttctgaggaccctcctgatgagcttgggccccctcttgctgaacga gccttaagggtaaaagctgttaaactggagaaagaagtccaagatttaacagtgagatac cagagagctatagctgattgtgaaaacataaggaggcgaacccagagatgtgtggaagac gccaagatatttggaatccagagtttctgtaaggacttggtggaggtggctgacattttg gagaagactacagagtgcatttctgaagaatcggagcctgaggaccaaaagctcactctg gagaaggtcttccgagggttgttgcttttagaagcaaagctgaaaagtgtgtttgccaag catggcctggagaaactgacacccattggtgacaaatatgacccccatgagcatgaactc atctgtcatgtgccagctggtgttggggtgcagcctggcaccgtggcattagtaagacaa gatggctacaaacttcatggccgcaccattaggcttgcccgagtggaagtggcagtggag tctcagagaagactgtga >gi568815593r:149274372_149477026|GENSCAN_predicted_peptide_4|494_aa MARAAPLLAALTALLAAAAAGGDAPPGKIAVVGAGIGGSAVAHFLQQHFGPRVQIDVYEK GTVGGRLATISVNKQHYESGAASFHSLSLHMQDFVKLLGLRHRREVVGRSAIFGGEHFML EETDWYLLNLFRLWWHYGISFLRLQMWVEEVMEKFMRIYKYQAHGYAFSGVEELLYSLGE STFVNMTQHSVAESLLQVGVTQRFIDDVVSAVLRASYGQSAAMPAFAGAMSLAGAQGSLW SVEGGNKLVCSGLLKLTKANVIHATVTSVTLHSTEGKALYQVAYENEVGNSSDFYDIVVI ATPLHLDNSSSNLTFAGFHPPIDDVQGSFQPTVVSLVHGYLNSSYFGFPDPKLFPFANIL TTDFPSFFCTLDNICPVNISASFRRKQPQEAAVWRVQSPKPLFRTQLKTLFRSYYSVQTA EWQAHPLYGSRPTLPRFALHDQLFYLNALEWAASSVEVMAVAAKNVALLAYNRWYQDLDK IDQKDLMHKVKTEL >gi568815593r:149274372_149477026|GENSCAN_predicted_CDS_4|1485_bp atggcccgcgcagccccgctgctcgccgcgttgaccgcgctcctcgccgccgccgctgct ggcggagatgccccgccgggcaaaatcgcggtggttggggctgggattgggggctctgct gtggcccattttctccagcagcactttggacctcgggtgcagatcgacgtgtacgagaag ggaaccgtgggtggccgcttggccaccatctcagtcaacaagcagcactatgagagcggg gctgcctccttccactccctgagcctgcacatgcaggacttcgtcaagctgctggggctg aggcaccggcgcgaggtggtgggcaggagcgccatcttcggcggggagcacttcatgctg gaggagactgactggtacctgctgaacctcttccgcctctggtggcactatggcatcagc ttcctgaggctgcagatgtgggtggaggaggtcatggagaagttcatgaggatctataag taccaggcccacggctatgccttctcgggtgtggaggagctgctctactcactgggggag tccacctttgttaacatgacccagcactctgtggctgagtccctgctgcaggtgggcgtc acgcagcgctttattgatgatgtcgtttctgctgtcctgcgggccagctatggccagtca gcagcgatgcccgcctttgcaggagccatgtcactagccggggcccaaggcagcctgtgg tctgtggaaggaggcaataagctggtttgttccggtttgctgaagctcaccaaggccaat gtgatccatgccacagtgacctctgtgaccctgcacagcacagaggggaaagccctgtac caggtggcgtatgagaatgaggtaggcaacagctctgacttctatgacatcgtggtcatc gccacccccctgcacctggacaacagcagcagcaacttaacctttgcaggcttccacccg cccattgatgacgtgcagggctctttccagcccaccgtcgtctccttggtccacggctac ctcaactcgtcctacttcggtttcccagaccctaagcttttcccctttgccaacatcctt accacagatttccccagcttcttctgcactctggacaacatctgccctgtcaacatctct gccagcttccggcgaaagcagccccaggaggcagctgtttggcgagtccagtcccccaag cccctctttcggacccagctaaagaccctgttccgttcctattactcagtgcagacagct gagtggcaggcccatcccctctatggctcccgccccacgctcccgaggtttgcactccat gaccagctcttctacctcaatgccctggagtgggcggccagctccgtggaggtgatggcc gtggctgccaagaatgtggccttgctggcttacaaccgctggtaccaggacctagacaag attgatcaaaaagatttgatgcacaaggtcaagactgaactgtga >gi568815593r:149274372_149477026|GENSCAN_predicted_peptide_5|143_aa MDWPHNLVPLDLVSRMKPYARMEEYERNIEEMVAQLRNSSELAQRKCEVNLQLWMSNKRS LSPWGYSINHDPSRIPVDLPEARCLCLGCVNPFTMQEDRSMVSVPVFSQVPVRRRLCPPP PRTGPCRQRAVMETIAVGCTCIF >gi568815593r:149274372_149477026|GENSCAN_predicted_CDS_5|432_bp atggactggcctcacaacctggtgccactggacctggtgtcacggatgaaaccgtatgcc cgcatggaggagtatgagaggaacatcgaggagatggtggcccagctgaggaacagctca gagctggcccagagaaagtgtgaggtcaacttgcagctgtggatgtccaacaagaggagc ctgtctccctggggctacagcatcaaccacgaccccagccgtatccccgtggacctgccg gaggcacggtgcctgtgtctgggctgtgtgaaccccttcaccatgcaggaggaccgcagc atggtgagcgtgccggtgttcagccaggttcctgtgcgccgccgcctctgcccgccaccg ccccgcacagggccttgccgccagcgcgcagtcatggagaccatcgctgtgggctgcacc tgcatcttctga >gi568815593r:149274372_149477026|GENSCAN_predicted_peptide_6|183_aa MAGPSLSCQWRRVIELCAMEDGDWPLLIPLLSLPGCLEKLKISVTQASCTLQVGVHNGYN LHEGGLHGPPRPALMAMLKAGPKEPHLDQTPGKTCGGNPTALEPKESNGCNKCTGLSALR QGNRGQRGPSRKSWMWPIPSPSCFLEMRKSRKGHRGFRNDRGDDSFDVWSSSRPLPTPKR KLD >gi568815593r:149274372_149477026|GENSCAN_predicted_CDS_6|552_bp atggcaggcccgtcactgagctgtcagtggaggagggtgattgaactgtgtgcaatggag gacggcgactggccgcttctcatccccttactctctctcccaggctgcctggagaaactc aaaatttctgtgacacaggcctcttgtacattgcaagtgggagtacacaatggttacaac ctccatgagggaggactgcacgggccacctcgcccagccctcatggcaatgctgaaggca ggccccaaggagccacacctggaccagaccccaggaaagacatgtggtggaaaccctacg gccctggagcccaaggagagcaacggctgtaacaagtgcacaggtttgagtgcgctgaga caggggaaccgtgggcaaagaggtccctccagaaagagctggatgtggcccattcccagc ccctcgtgtttcttggaaatgagaaagtccaggaaagggcacaggggctttcgcaacgac cgtggcgatgatagctttgacgtgtggtcatccagccggccactgcccacacccaagaga aagctggattga >gi568815593r:149274372_149477026|GENSCAN_predicted_peptide_7|119_aa XLPPVNGHHPGRHPREGAMLQVKTALSCQSVVPGTCRSPSSPSPTVIGSLLGSWVREGGG EHNAQACLIGVMVMNLDLFMVQFGPLRDSPGTFANLSGGAVGRDLRKEGFEDEPACGSK >gi568815593r:149274372_149477026|GENSCAN_predicted_CDS_7|360_bp nnccttccccctgtgaatggacatcacccgggccggcacccccgagaaggtgctatgctg caggtaaagactgctttgtcctgccagagcgtggttcctggaacttgtcgcagtcccagt agtccttcccccacagtgatcggctccctcttgggttcctgggtgcgggaaggagggggt gagcacaatgctcaggcctgtctaattggagtcatggtgatgaatctggatttgttcatg gttcagtttgggccattaagagacagccctgggacttttgctaacttaagtggtggggcg gtggggagggatctacgtaaggagggctttgaggatgaaccagcctgtggaagtaaatga