GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:18:05 Sequence gi568815589f:877002_1091002 : 214001 bp : 43.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 16911 17194 284 1 2 77 80 227 0.694 17.94 1.02 Intr + 17605 17697 93 0 0 47 116 21 0.611 0.96 1.03 Intr + 39759 39906 148 2 1 69 78 103 0.023 7.21 1.04 Term + 69800 69960 161 0 2 62 35 152 0.566 5.40 1.05 PlyA + 72207 72212 6 1.05 2.00 Prom + 78399 78438 40 -6.06 2.01 Init + 85258 85358 101 0 2 52 100 62 0.817 3.55 2.02 Intr + 87652 87783 132 1 0 72 71 50 0.392 1.46 2.03 Intr + 91000 91134 135 1 0 62 46 143 0.394 7.18 2.04 Intr + 92443 92726 284 1 2 47 13 110 0.095 -3.54 2.05 Intr + 93821 94055 235 0 1 96 59 107 0.350 5.35 2.06 Intr + 95182 95369 188 1 2 125 81 15 0.872 3.83 2.07 Term + 95607 95860 254 1 2 67 48 144 0.409 4.10 2.08 PlyA + 97236 97241 6 1.05 3.00 Prom + 99228 99267 40 -4.56 3.01 Init + 100001 100454 454 1 1 69 110 886 0.961 84.93 3.02 Intr + 100594 100715 122 2 2 89 51 97 0.297 6.31 3.03 Intr + 105925 106051 127 0 1 36 64 113 0.303 3.95 3.04 Term + 113040 114004 965 1 2 82 39 925 0.934 79.22 3.05 PlyA + 114142 114147 6 1.05 4.04 PlyA - 114229 114224 6 1.05 4.03 Term - 118276 118251 26 2 2 119 47 -19 0.074 -4.51 4.02 Intr - 121797 121542 256 0 1 46 86 158 0.359 8.42 4.01 Init - 132278 132234 45 2 0 56 81 73 0.168 2.17 4.00 Prom - 143022 142983 40 -3.46 5.00 Prom + 149496 149535 40 -3.26 5.01 Init + 165647 165731 85 1 1 53 35 128 0.913 2.98 5.02 Intr + 166046 166323 278 0 2 81 81 70 0.919 2.84 5.03 Term + 168632 168970 339 1 0 77 47 191 0.568 8.44 5.04 PlyA + 171385 171390 6 1.05 6.00 Prom + 172538 172577 40 -1.36 6.01 Init + 174613 175137 525 0 0 106 92 875 0.812 84.55 6.02 Intr + 176721 176823 103 2 1 114 98 76 0.994 10.95 6.03 Term + 179215 180272 1058 2 2 84 48 515 0.661 39.40 6.04 PlyA + 180519 180524 6 1.05 7.05 PlyA - 181549 181544 6 1.05 7.04 Term - 191985 191456 530 1 2 35 36 192 0.324 3.12 7.03 Intr - 193407 193368 40 0 1 23 83 88 0.019 -0.40 7.02 Intr - 204880 204708 173 2 2 122 106 -39 0.210 0.96 7.01 Intr - 205551 205342 210 1 0 48 87 89 0.177 3.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:877002_1091002|GENSCAN_predicted_peptide_1|228_aa XGRMVIQDIPAVTSRGHVENTPDLVSDSTYYSSFYQPSLFPYYNNLYNCPQYSMALAADS ASGEVGNPLGGSPVKNSLRGLPGPYVPGQTGNQWQKMLGKAFWVAVICVMNLHLGKETGH NLYGRKQMKNMENRHAMSSQYRMHSYYPPPSYLGQSVPQFFTFEDAPSYPEARASGKIGE YVLSVLMGVYVQVCKFTLAVVEIHTKEQCLTMQGLDSQTVPIKIPKSR >gi568815589f:877002_1091002|GENSCAN_predicted_CDS_1|687_bp nagggacgtatggtcatccaggatattcctgctgtcaccagcagagggcatgtggagaac acacctgacctggtttcagactccacctactacagcagcttctaccagccgtctctgttt ccttattacaacaatctatacaactgcccgcagtactccatggccttggctgctgattct gcttctggggaggtgggaaatcccctcgggggatcccctgtgaagaacagccttcggggc ctccccggaccttatgtgcctggtcagacaggaaaccagtggcagaaaatgctgggtaaa gccttctgggtggcagtaatctgcgtgatgaatttgcaccttgggaaggaaacaggtcat aatctgtatggaagaaagcagatgaagaacatggagaaccgccatgcaatgagctcccag tacaggatgcattcttactacccgcctccctcttacctgggccagagcgtgccccagttc ttcacttttgaggatgctccctcttacccggaagccagggcgagcggaaagattggggaa tacgtcctttctgtgctaatgggtgtgtacgtccaggtctgcaagtttacactggcagta gttgagattcacacaaaagagcagtgcctgacaatgcaaggactggattcacagacggtg cccatcaagataccaaagagccgataa >gi568815589f:877002_1091002|GENSCAN_predicted_peptide_2|442_aa MSVGGKFPQLTEFGGKGAVWSITIADRQEGFFGSAAGPNSLPQAQEQYTNIFTDGVLSIS GSIILPFFGVIPQSGITSSQDSGLVSLSSSSPISNKSTKAVLECEPASEPSSFTVTPVIE EDERLSFIAQCFTSLVKEDHFITLVRLLLLQPCALRRAAVAVAVVTATAEAAAARGMEPD RGVVTSKVFLVASWKMIPRPPCRRVRGPLPTAPPLSLADLTRGLTTHRDKQAAGFAPPPR GPWLKFHLESCRPCATELGPRTPLVTLAFLLPQPPSYPQPGLPPLTTLGDPNRGQRAPRP GLQAKFAPPVPGVSAARAVSLHYSRAPESERFPRSAQIRLLSPRLGFVQGQQRRRREAGG QGLRRSLPGRCGEAADVPSRTHALLLLGVGSAAGAPDRELESDERLPPGPPGTAAQRGAP QASAFPVPFESAGLRSRRWALG >gi568815589f:877002_1091002|GENSCAN_predicted_CDS_2|1329_bp atgtcagtgggaggaaagttcccacaactgactgagtttgggggcaagggagcagtctgg agcatcaccattgctgacaggcaagaagggttttttggaagtgcagctggccccaactcc cttccccaagcacaagagcagtacactaacatcttcactgacggtgtgttgagcatctca ggctcaattattcttcctttcttcggggttattccccagagtggaattaccagcagtcaa gattctggcttggtttccctctcgagcagctctcctattagtaacaagagcacaaaggca gtgcttgaatgtgagcctgcgtcggagcccagcagcttcacagtcactcccgtcatcgag gaggacgaaagattgagttttattgcccagtgctttacgagtttagttaaggaagatcat tttatcacacttgtcaggctgctgctactgcagccgtgtgcccttcggcgggcggctgta gctgtagctgttgtgacggctacggcggaggctgcggccgcgcggggaatggagccggac cgcggagtcgtcacctccaaggtgtttctagtggcctcctggaagatgatcccgcgccca ccttgccggcgtgttcgcgggcccctgcccactgccccccctctttctttagcggacttg acccgcgggctgacaacccaccgcgacaagcaggcggctgggttcgcgccgccgccccgg ggcccttggctcaaatttcacctcgagtcctgcagaccctgcgccactgaattggggccc aggacgcccttggtgacactcgccttcttgctgccacaaccaccgtcatacccgcagccg gggctccctccgctaaccacgcttggagaccccaatcggggacagagggcgcctcggcct ggcctccaggcaaagttcgcgccccctgttcctggggtgtcggccgcgcgggccgtttcc cttcattactcccgggcccctgaatccgaacgctttcccagaagcgcgcaaatccgcttg ctttccccgcggctgggctttgttcagggacagcaaaggaggaggcgggaggctggtgga cagggtctgcgacgctcccttccaggacggtgtggggaagcggccgacgtccccagccgg actcacgccctcctactactgggcgtcggctccgccgcgggcgctcccgacagggagctg gagtcggacgagcggctgcccccagggcctccaggaaccgcggcccagcggggagcgccc caggctagcgcttttccagttcccttcgaaagcgcggggctgaggtcgcggcgctgggcc ctcggatga >gi568815589f:877002_1091002|GENSCAN_predicted_peptide_3|555_aa MNGYGSPYLYMGGPVSQPPRAPLQRTPKCARCRNHGVLSWLKGHKRYCRFKDCTCEKCIL IIERQRVMAAQVALRRQQANESLESLIPDSLRALPGPPPPGDAVAAPQPPPASQPSQPQP PRPAAELAAAAALRWTAEPQPGALQAQLAKPGRGFLSREKPAESGPSASGTRARREGASG QPNDGTVRQYRQSCVALMACARVLMAEVEDAVIGQLVLTGPPPCDCVSYVPHRLNLTEER LGDGKSADNTEVFSDKDTDQRSSPDVAKSKGCFTPESPEIVSVEEGGYAVQKNGGNPESR PDSPKCHAEQNHLLIEGPSGTVSLPFSLKANRPPLEVLKKIFPNQKPTVLELILKGCGGD LVSAVEVLLSSRSSVTGAERTSAEPESLALPSNGHIFEHTLSSYPISSSKWSVGSAFRVP DTLRFSADSSNVVPSPLAGPLQPPFPQPPRYPLMLRNTLARSQSSPFLPNDVTLWNTMTL QQQYQLRSQYVSPFPSNSTSVFRSSPVLPARATEDPRISIPDDGCPFVSKQSIYTEDDYD ERSDSSDSRTLNTSS >gi568815589f:877002_1091002|GENSCAN_predicted_CDS_3|1668_bp atgaacggctacggctccccctacctgtacatgggcggcccggtgtcgcagccgccacgg gcgcccctgcagcgcacgcccaagtgcgcgcgctgccgcaaccatggcgtcctgtcctgg ctcaagggccacaagcgttactgccgcttcaaggactgcacctgcgagaagtgcatcctc atcatcgagcggcagcgggtcatggctgcgcaggtggcgctgcgccggcagcaggccaac gagagcttggagagcctcatccccgactcgctgcgcgctctgccagggcccccgccgccg ggggacgccgtcgccgccccgcagccgccgccagcctctcagccgtcgcagccgcagccg ccgcgccctgctgccgagttggccgcggccgccgcgctgcgttggactgccgagccgcag cccggggctctgcaggcgcagctcgccaagccaggccggggcttcctctcccgggagaag ccggctgagtctggacccagcgcgagtggcactcgcgcccgcagggaaggcgcctccggg cagccaaacgacgggactgtgcgtcagtatcgccagtcctgtgtggctctgatggcttgt gcacgggtgctcatggcagaggtggaggatgcagtgataggacagctggtcctcacaggt ccaccgccttgtgattgtgtcagttacgtcccacatcgccttaatttgactgaagaacga cttggagacggcaagtcggcagacaatacagaggtcttcagtgacaaagacactgaccag aggagttccccagatgtggcaaagagtaagggctgcttcacccctgagagccctgagata gtgtccgtggaggaagggggatacgctgtccagaaaaacggaggcaaccccgagagccgc cctgacagccccaagtgtcacgcggagcagaatcacctcctgattgagggcccctcgggg actgtttctctgcccttcagcttgaaagccaacagaccgccgcttgaagtgttaaaaaag atattccccaaccagaagccaacggtgcttgagctcatcctcaagggctgtggcggggac ctggtgagcgccgtggaagtccttctgtccagccgatcctcagtcacgggagcagagcga acttccgcagaacctgagagtctagcgttgccctccaatgggcacatctttgaacacacc ttgagctcctaccccatctcgtcttccaaatggtctgtgggatcagcctttcgagtccca gacacgttgaggttttctgccgactctagcaacgttgtccccagtcccttggctgggcct ctgcagccccctttcccccagccaccccggtacccgctgatgctgaggaatactttggcg agaagccagtcgagcccctttttgcccaatgatgtcaccctgtggaacaccatgacgctg cagcagcagtatcagctgaggtcccagtatgtcagtcctttccccagtaactctaccagc gtcttcagaagctcgcccgtccttcctgcccgcgccacggaagaccctcggatttccatc cctgatgatgggtgtccatttgtgtcaaagcagtccatttacaccgaggacgactatgac gagaggtctgactcctcagactctagaacactcaacacatcatcttaa >gi568815589f:877002_1091002|GENSCAN_predicted_peptide_4|108_aa MAPLARGAQSVRPVLPPGGSRAGKNIHAGGSGKRKEGKRRLHAFTSTSLQGAAEPGRTFT QAAPGNGRRENADFTHLPPLASRGQQSREEHSRRRLRETEGQARDLGS >gi568815589f:877002_1091002|GENSCAN_predicted_CDS_4|327_bp atggcgccattggctcgaggagcacagagtgtccgcccggtgctgcctccagggggcagc agagccgggaagaacattcacgcaggcggctccgggaaacggaaggagggaaaacgccga cttcacgcatttacctccactagcctccagggggcagcagagccgggaagaacattcacg caggcggctccgggaaacggaaggagggaaaacgccgacttcacgcatttacctccacta gcctccagggggcagcagagccgggaagaacattcacgcaggcggctccgggaaacggaa gggcaagctagggaccttgggagttga >gi568815589f:877002_1091002|GENSCAN_predicted_peptide_5|233_aa MLSADRRVLPTAPPRGLLPLRPARRPAAEHRLLSGVSIIKEKISSRPLKFTWRLVTRSAL EAGFGISLSPLSPAPPPHPRGETPEQAFCIFSPHLELQKIYGKVIWFHFNLFILQKPKVV MPVEVVRRCNAERLSAEGGAFEAGSGGATAKPATAKHFSRDAQTFSNAGRHPGRWRQVDL ATGDLGCRWKARIPAGPQPPLQASEAGELGGIGRKSGFGCPPASGALQASSRS >gi568815589f:877002_1091002|GENSCAN_predicted_CDS_5|702_bp atgctgagcgcggaccggcgtgtcctccccacagcgcccccgcgcggcctcctcccgctg cgccccgcacggcgacccgccgcggaacaccgcctgctctccggggtcagcattatcaaa gagaagatttcctccaggcccttaaaatttacgtggcgccttgttacaaggagcgcgctg gaagctggctttggtatctccttgtcccctctttctcctgccccgcccccacatccccgt ggagaaacccccgagcaggccttttgcattttctcaccccacttagaactacaaaaaata tatgggaaggtgatttggtttcacttcaacctcttcattttgcagaagcctaaagtggtg atgccggtggaagtcgtgagacggtgcaacgctgagcggctttccgcagaaggaggtgcc ttcgaggccggctctggaggagccacagccaagccggccacagcaaagcacttctctcgg gatgctcagacgttttcaaatgccggtcgccaccctgggcgctggcggcaggttgactta gccacaggcgacttaggctgcaggtggaaggcgaggattcccgccggcccccagcctcct ctgcaggcctccgaagctggagagttgggtggaatagggaggaaatcgggcttcggatgc cccccagcctcgggcgctctgcaggctagctcacggagttga >gi568815589f:877002_1091002|GENSCAN_predicted_peptide_6|561_aa MADPQAGSAAGDWEIDVESLELEEDVCGAPRSTPPGPSPPPADGDCEDDEDDDGVDEDAE EEGDGEEAGASPGMPGQPEQRGGPQPRPPLAPQASPAGTGPRERCTPAGGGAEPRKLSRT PKCARCRNHGVVSCLKGHKRFCRWRDCQCANCLLVVERQRVMAAQVALRRQQATEDKKGL SGKQNNFERKAVYQRQVRAPSLLAKSILEGYRPIPAETYVGGTFPLPPPVSDRMRKRRAF ADKELENIMLEREYKEREMLETSQAAALFLPNRMVPGPDYNSYKSAYSPSPVEPPSKDFC NFLPTCLDLTMQYSGSGNMELISSNVSVATTYRQYPLSSRFLVWPKCGPISDTLLYQQCL LNATTSVQALKPGASWDLKGARVQDGLSAEQDMMPSKLEGSLVLPHTPEIQTTRSDLQGH QAVPERSAFSPPRRNFSPIVDTDSLAAQGHVLTKISKENTRHPLPLRHNPFHSLFQQTLT DKSGPELKTPFVKEAFEETPKKHRECLVKDNQKYTFTIDRCAKDLFVAKQVGTKLSVNEP LSFSVESILKRPSSAITRVSQ >gi568815589f:877002_1091002|GENSCAN_predicted_CDS_6|1686_bp atggccgacccgcaggctggctccgcggccggggactgggagatcgatgtcgagagcctg gagctggaagaggacgtctgcggggcgccgcggtccacgccccccgggcccagcccgccg ccggcggacggggactgcgaggacgacgaagatgacgacggggtggacgaagacgcggaa gaagagggcgacggcgaggaggcaggcgcgtcccccgggatgcccggccagccggagcag cgggggggaccgcagccgaggccgccgctcgcgcctcaggcctcacccgccggcaccggt ccccgagagcgctgcactcccgcgggcggcggcgcggagccgcgcaagctgagccgcacg cccaagtgcgcgcgctgccgcaaccacggcgtggtgtcctgcctgaagggccacaagcgc ttctgtcgctggcgcgactgccagtgcgccaactgcctgctggtggtggagcggcagcgc gtcatggccgcccaggtggcgctccggaggcagcaggccaccgaggacaagaaggggctt tccgggaaacagaataatttcgagcgcaaagctgtgtaccagaggcaagtcagagccccc agtttgctggccaaaagcattttagaaggctatcgccccattccagcggagacttatgta ggagggaccttccctctacctcccccagttagtgacaggatgaggaaaagaagagccttt gctgataaagagttggagaacattatgctggagagagaatataaagaaagggagatgttg gaaacttctcaagctgctgctctgtttctgcccaaccgcatggtgcctggacctgactac aattcctacaaaagtgcctacagccccagcccagtggaaccaccaagcaaggacttctgt aattttttgcccacctgccttgatttaaccatgcagtattcagggtctgggaatatggaa ctaatttcttctaatgtcagcgtggccacaacttatagacagtatcccttgtcctcaaga tttttagtttggcccaagtgtggccccattagcgacaccctcctctaccagcaatgcctg ctaaatgccaccacctcagttcaagccctgaagcctggggccagctgggacttgaaggga gcacgagtccaggatggactcagtgcagagcaggacatgatgccatcgaaattggaaggt tccctggtgctgcctcacactcctgagatccagaccacgagaagtgaccttcagggtcat caggctgtcccagagaggtccgcgttctccccaccccgacggaatttctctcccattgtt gacacggactccctggcagctcaagggcatgtcttaacgaagatcagcaaagaaaacacc aggcaccctctgccacttagacataatccattccactcattattccagcaaacacttact gacaaatcgggtcctgagttgaaaacaccatttgtcaaagaggcctttgaagagacccct aagaaacacagagagtgtttagttaaggacaaccagaagtacacatttacaatagataga tgtgcaaaagacctttttgtagccaaacaagttggaacaaaactctcggtgaatgaacca ctgtcattttctgttgagtctattcttaagaggccttcatctgccatcactcgtgtctct cagtga >gi568815589f:877002_1091002|GENSCAN_predicted_peptide_7|317_aa XTLSTPCACILLTKVKWSKDDTLPKPDQSEFFPKMFNQTFLTFVQRVRGERGNSSDLLLL KNDFNLAQWEAPGKCIQFYSPDHNHNIQFHHPQNVFVFLCSQSAPPTPPWPLATTSMFSV IIALPTGWAMDQYQSVGRTAGEMQSTIREYYKHLYPNKLENLEEIDQFLDRYTFPRLKQE EVESLNRLITSPSPEIEAVINNLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQTIEKE GLLLNSFYEASIILIPKSGRDTTKKENFRPISLINIDAKILNKILANQIQRHIEKLIHHN QLSFIPGMPAGSTYANQ >gi568815589f:877002_1091002|GENSCAN_predicted_CDS_7|954_bp ntaaccttatctacaccttgtgcctgcatcttactcaccaaggttaagtggtccaaggat gacactctacccaagcctgaccaatcagagttttttcccaagatgtttaaccagacattt cttacttttgtgcagcgggtacggggagaaaggggtaacagcagtgacctgttactcctg aaaaatgactttaacttggcccaatgggaagcccctggcaaatgtatacagttttatagc cccgatcacaatcataatatacaatttcatcacccccaaaatgtcttcgtgttcctttgt agccaatcagctcccccgaccccaccttggcccctagcaaccacttctatgttttctgtc attatagctttacccacaggctgggctatggaccagtaccagtctgtgggccgcacagca ggagaaatgcaatctaccatcagagaatattataaacacctctacccaaataaactagaa aatctagaagaaattgatcaattcctggacagatacaccttcccaagactaaaacaagaa gaagttgaatccctcaatagactaataacaagtccaagtcctgaaattgaggcagtaatt aataacctaccaaccaaaaaaagcccaggaccagatggatttacagccgaattctaccag aggtacaaagaagagctggtaccattccttctgaaactattccaaacaatagaaaaagag ggactcctccttaactcattttatgaggccagcatcatcctgataccaaaatctggcaga gacacaacaaaaaaagaaaatttcaggccaatatccctgatcaacattgatgcaaaaatc ctcaataaaatactggcaaaccaaatccagcggcacatcgaaaagcttatccaccacaat caactcagcttcatccctgggatgccggctggttcaacatacgcaaaccaataa