GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:47:02 Sequence gi568815578r:44086326_44286705 : 200380 bp : 46.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3900 4028 129 2 0 66 72 85 0.536 5.59 1.02 Term + 4735 4842 108 0 0 52 36 81 0.408 -2.19 1.03 PlyA + 5929 5934 6 1.05 2.00 Prom + 14977 15016 40 2.04 2.01 Init + 19724 19802 79 1 1 56 64 63 0.725 1.92 2.02 Term + 20728 20741 14 2 2 144 38 5 0.277 -0.54 2.03 PlyA + 21802 21807 6 1.05 3.06 PlyA - 25394 25389 6 1.05 3.05 Term - 28551 28471 81 0 0 80 47 131 0.766 5.89 3.04 Intr - 30061 29322 740 2 2 90 75 1025 0.654 92.56 3.03 Intr - 32298 32180 119 2 2 86 91 144 0.999 14.61 3.02 Intr - 37704 37655 50 0 2 99 101 -10 0.421 -1.12 3.01 Init - 39233 39180 54 2 0 54 86 71 0.301 4.79 3.00 Prom - 42237 42198 40 -4.66 4.00 Prom + 44380 44419 40 -5.26 4.01 Sngl + 49994 50179 186 1 0 84 37 268 0.829 16.39 4.02 PlyA + 52509 52514 6 1.05 5.00 Prom + 56647 56686 40 -7.36 5.01 Init + 56729 56867 139 1 1 40 48 74 0.351 -1.10 5.02 Intr + 62152 62292 141 2 0 111 65 109 0.872 11.22 5.03 Intr + 62883 63070 188 1 2 81 31 56 0.235 -1.39 5.04 Term + 68244 68405 162 2 0 84 48 62 0.397 -0.26 5.05 PlyA + 69874 69879 6 1.05 6.04 PlyA - 70047 70042 6 1.05 6.03 Term - 74082 73289 794 1 2 137 48 1746 0.792 168.96 6.02 Intr - 84090 83987 104 2 2 46 63 116 0.251 4.72 6.01 Init - 89642 89578 65 2 2 99 96 21 0.528 4.64 6.00 Prom - 93441 93402 40 -8.56 7.03 PlyA - 93997 93992 6 1.05 7.02 Term - 95245 95123 123 1 0 85 43 96 0.444 3.18 7.01 Init - 100380 99976 405 0 0 52 59 512 0.414 41.29 7.00 Prom - 105492 105453 40 -4.56 8.06 PlyA - 107657 107652 6 1.05 8.05 Term - 111414 110727 688 2 1 108 44 253 0.795 15.98 8.04 Intr - 116749 116636 114 0 0 102 94 27 0.568 4.26 8.03 Intr - 125553 125418 136 1 1 26 24 106 0.001 -2.37 8.02 Intr - 131306 131242 65 1 2 62 103 75 0.089 4.76 8.01 Init - 142100 142009 92 2 2 49 38 121 0.344 3.16 8.00 Prom - 147896 147857 40 -4.26 9.00 Prom + 149767 149806 40 -5.56 9.01 Init + 158647 158774 128 0 2 28 105 38 0.155 -0.87 9.02 Intr + 158867 158913 47 2 2 73 110 1 0.382 -1.15 9.03 Intr + 160778 160877 100 0 1 106 117 55 0.926 9.37 9.04 Intr + 161048 161189 142 2 1 63 75 126 0.964 9.16 9.05 Intr + 170828 171020 193 1 1 96 80 506 0.998 49.67 9.06 Intr + 172109 172282 174 0 0 59 73 388 0.638 34.31 9.07 Intr + 176905 177002 98 2 2 69 101 156 0.996 14.73 9.08 Intr + 178120 178234 115 0 1 117 59 244 0.980 24.42 9.09 Term + 192632 192975 344 0 2 126 49 493 0.900 43.87 9.10 PlyA + 194029 194034 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 123526 123533 8 0 2 57 102 23 0.820 -0.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:44086326_44286705|GENSCAN_predicted_peptide_1|78_aa THHHENKPMLACWMMRDPAEKRKVPQLTSESPVPIQQPTSDVRPCQEMPHSPTFGSEQAG RVIFQLDTAGDDQDSDPG >gi568815578r:44086326_44286705|GENSCAN_predicted_CDS_1|237_bp acccatcaccatgagaacaagcccatgctagcctgttggatgatgagagaccctgcagag aaaaggaaggtgccccagctgacatcagaaagccctgtgcccatccagcagccaacttca gatgtacgaccttgccaggaaatgccccacagtcccacatttggaagtgaacaagctgga agggtgattttccaactggacacagctggtgacgatcaagatagtgaccctggataa >gi568815578r:44086326_44286705|GENSCAN_predicted_peptide_2|30_aa MAATAPSISTHDPQPASKSKNKGESQGFDP >gi568815578r:44086326_44286705|GENSCAN_predicted_CDS_2|93_bp atggctgccacagctccaagcatcagcacccacgacccccaacctgcatccaaaagcaag aataagggggaaagccaaggttttgatccctaa >gi568815578r:44086326_44286705|GENSCAN_predicted_peptide_3|347_aa MTDLAQKWYQEGSSLVSWGQSHPPDPHPEAWIQRWTSHAKAKAEAAEQAALAANQESNIA RTLARELAPDFYQPGPEYQKRRLLQEILENSESLLEPPDRGAGAAGLPQPPRESPQLHER ETPRPEGGSPSPAGTPPQPKRPRPGVSKDGLLSPGAWNGEPSGEGSRSVTPSEGAGRRSP ARPATERMAIEALQAPPAPSREPEVALYQGYHSYAVRTTPPEPPPFEDQPEPEVSGSESA PSSPATAPLQAPTLRGPEPARETPAKLEPKPIIPKAEPRAKARKTEARGLTKAGAKKKAR KEAALAAEAEVEVEEVRLSGKVPNTILICMVILLNIGLAILFVHLLT >gi568815578r:44086326_44286705|GENSCAN_predicted_CDS_3|1044_bp atgacggacctggctcagaagtggtaccaggaaggcagcagcttggtgtcatggggtcag agccatcctccagaccctcacccagaggcatggatccagagatggacaagccacgccaag gccaaagctgaggcagcggaacaggccgccctggctgccaaccaggagtccaacattgct cgcactttggccagggagctggctccggacttctaccagccaggtccggaatatcagaag cgccggctgctgcaggagatcctggagaactcggagagcctgctggagccccccgaccgg ggcgccggcgcagcgggcctcccacagccgccccgcgagagcccgcagctgcacgagcgt gagacccctcggcccgagggtggctccccgtcaccggccgggacgcccccgcagcccaag cggcccaggcccggggtgtccaaggacggcctgctgagcccaggcgcctggaacggcgag cccagcggtgagggcagccggtcagtcactccgtccgagggcgcgggccgccgcagcccc gcgcgtccagccaccgagcgcatggccatcgaggctctgcaggcaccgcctgcgccgtcg cgggagccggaggtggcgctttaccagggctaccacagctatgctgtgcgcaccacgccg cccgagcccccaccctttgaggaccagcccgagcccgaggtctccgggtccgagtccgcg ccctcgtccccggccaccgccccgctgcaggcccccacgctccgaggccccgagcctgca cgcgagacccccgccaagctggagcccaagcccatcatccccaaagccgagcccagggcc aaggcccgcaagactgaggctcgagggctgaccaaggcgggggccaagaagaaggcgcgg aaggaggccgcactggcggcagaggcggaggtggaggtggaagaggtgaggctgtcgggc aaggtccccaacaccatcctcatctgcatggtgatcctgctgaacatcggcctggccatc ctctttgttcacctcctgacctga >gi568815578r:44086326_44286705|GENSCAN_predicted_peptide_4|61_aa MTVTVLLQGDEVCWKMASPFAYSAPWVQPSKRMLNPLGLIAIIIININITIMNRQCLLRT Y >gi568815578r:44086326_44286705|GENSCAN_predicted_CDS_4|186_bp atgactgtcaccgtgctcctgcagggggacgaagtctgctggaaaatggcctcacccttt gcctattctgctccctgggtgcaacccagcaagaggatgctgaatcccctgggactcatc gccatcatcatcatcaacatcaacatcaccattatgaataggcaatgcttgctgcgcacc tactag >gi568815578r:44086326_44286705|GENSCAN_predicted_peptide_5|209_aa MKRAGVRERVKEKHFGQMEQHVQRTAGRKENGPLKETWLKANDGESDEERECGEETRKGN KGFFVSFPPADNVLDAERGNVLGEGDPSCRVALVALPCFMSPPSPQIPALASSLTPSLQP SPCHTLATRAMVQVWSYTFGQTAPSSRGYPLTKRGRKRCPDRSLRADKFHMSPEGGAGSH NGEKARACLGVKRLWLLGMLSVSPQPNYS >gi568815578r:44086326_44286705|GENSCAN_predicted_CDS_5|630_bp atgaaaagagcaggtgttcgtgagagagtaaaggaaaagcattttggacaaatggaacag catgtgcaaaggactgcaggcaggaaagagaacggccctttgaaggaaacatggctgaag gcaaatgatggggagagtgatgaggaaagggagtgtggtgaggaaacaaggaaaggaaac aaagggttcttcgtttctttcccacctgctgacaacgtcctggatgcagaaaggggaaat gtgttgggcgaaggcgacccgagctgccgcgtggccttagtggctctgccatgtttcatg tctccaccatctccccagatccctgcattagcctcctcactgacgcccagcctacagcct agcccttgccatactctcgctacaagagccatggtccaggtgtggtcatacacatttggg cagaccgctcccagctctcgtggctacccactcacaaagcggggcaggaaaagatgtcct gatcgtagcctccgggctgacaaattccacatgagcccggagggaggagctggaagccac aatggagagaaagcccgggcctgcctgggtgtgaagaggctgtggctgctggggatgctt tctgtctctccacagccaaattactcatga >gi568815578r:44086326_44286705|GENSCAN_predicted_peptide_6|320_aa MGERCLLEVSMGIGFLGRPVHRSYANIWYYVRGLGEGSAMAGRQNRQDKDKRAAMGRTYQ GQFTNGMRHGYGVRQSVPYGMAVVVRSPLRTSLSSLRSEHSNGTVAPDSPASPASDGPAL PSPAIPRGGFALSLLANAEAAARAPKGGGLFQRGALLGKLRRAESRTSVGSQRSRVSFLK SDLSSGASDAASTASLGEAAEGADEAAPFEADIDATTTETYMGEWKNDKRSGFGVSERSS GLRYEGEWLDNLRHGYGCTTLPDGHREEGKYRHNVLVKDTKRRMLQLKSNKVRQKVEHSV EGAQRAAAIARQKAEIAASR >gi568815578r:44086326_44286705|GENSCAN_predicted_CDS_6|963_bp atgggtgagagatgcctgttggaagtgtccatgggcattgggtttctgggcaggcctgtg cacagatcttatgccaacatctggtactatgtgagaggacttggagaaggatcagcgatg gctggtagacaaaacaggcaagataaagacaagagagctgccatgggaaggacgtaccaa ggccagttcaccaacggcatgcgccatggctacggagtacgccagagcgtgccctacggg atggccgtggtggtgcgctcgccgctgcgcacgtcgctgtcgtccctgcgcagcgagcac agcaacggcacggtggccccggactctcccgcctcgccggcctccgacggccccgcgctg ccctcgcccgccatcccgcgtggcggcttcgcgctcagcctcctggccaatgccgaggcg gccgcgcgggcgcccaagggcggcggcctcttccagcggggcgcgctgctgggcaagctg cggcgcgcagagtcgcgcacgtccgtgggtagccagcgcagccgtgtcagcttccttaag agcgacctcagctcgggcgccagcgacgccgcgtccaccgccagcctgggagaggccgcc gagggcgccgacgaggccgcacccttcgaggccgatatcgacgccaccaccaccgagacc tacatgggcgagtggaagaacgacaaacgctcgggcttcggcgtgagcgaacgctccagt ggcctccgctacgagggcgagtggctggacaacctgcgccacggctatggctgcaccacg ctgcccgacggccaccgcgaggagggcaagtaccgccacaacgtgctggtcaaggacacc aagcgccgcatgctgcagctcaagagcaacaaggtccgccagaaagtggagcacagtgtg gagggtgcccagcgcgccgctgctatcgcgcgccagaaggccgagattgccgcctccagg tag >gi568815578r:44086326_44286705|GENSCAN_predicted_peptide_7|175_aa MSGGRFDFDDGGAYCGGWEGGKAHGHGLCTGPKGQGEYSGSWNFGFEVAGVYTWPSGNTF EGYWSQGKRHGLGIETKGRWLYKGEWTHGFKGRYGIRQSSSSGAKYEGTWNNGLQDGYGT ETYADGGEASWGPAESNATKETAVFQTGEDNALVDGEVNSVGHDLHFSKGKNRTE >gi568815578r:44086326_44286705|GENSCAN_predicted_CDS_7|528_bp atgagtgggggccgcttcgactttgatgatggaggggcgtactgcgggggctgggagggg ggaaaggcccatgggcatggactgtgcacaggccccaagggccagggcgaatactctggc tcctggaactttggctttgaggtggcaggtgtctacacctggcccagcggaaacaccttt gagggatactggagccagggcaaacggcatgggctgggcatagagaccaaggggcgctgg ctctacaagggcgagtggacacatggcttcaagggacgctacggaatccggcagagctca agcagcggtgccaagtatgagggcacctggaacaatggcctgcaagacggctatggcacc gagacctatgctgatggaggtgaggccagctgggggcccgcagagtcaaatgctacaaag gaaacagcagtcttccaaactggagaggacaacgcattagtggatggtgaagtcaattca gtgggtcacgacctgcatttttcaaaaggaaagaacagaacagaatag >gi568815578r:44086326_44286705|GENSCAN_predicted_peptide_8|364_aa MGDWNLKHGWILKARTEDFLLYLIKEKHTGTQVIHKSRVFHSFEATVKGFTNGVGHNQRH PTQLRKIICSHEALSSADVELEIGRRGEAQLDQQNQSWSVASLSVGEGTGVRAPVRTATD DTKPKTTCASKDSWHGSTRKSSRGAVRTQRRRRSKSPVLHPPKFIHCSTIASSSSSQLKH KSQTDSPDGSSGLGISSPKEFSAGESSTSLDANHTGAVVEPLRTSVPRLPSESKKEDSSD ATQVPQASLKASDLSDFQSVSKLNQGKPCTCIGKECQCKRWHDMEVYSFSGLQSVPPLAP ERRSTLEDYSQSLHARTLSGSPRSCSEQARVFVDDVTIEDLSGYMEYYLYIPKKMSHMAE MMYT >gi568815578r:44086326_44286705|GENSCAN_predicted_CDS_8|1095_bp atgggagactggaacctgaagcacggatggattctcaaggcccgcactgaggacttcctt ctatacctcatcaaggaaaaacacacaggcacgcaagtgatccacaaatcccgggtcttc cacagttttgaagccacagtgaagggcttcaccaacggggtgggccacaatcaacgccat ccaactcaattgcgcaaaatcatctgttctcacgaagctttgagcagtgcagacgtggag ctggagattggcaggcgaggagaggcacagctagatcagcagaaccagtcctggtctgta gcatctctgtctgttggagaaggcacaggtgtcagagcaccagtcagaacagcaacagat gataccaaacctaaaaccacatgtgcatctaaagacagttggcacgggtctacaaggaag tcttcacgaggagcagtgagaactcagcgtcgtcgacgttctaagtctcctgtccttcat cctccaaagtttatacattgcagtacaatagcgtcttcttccagcagtcaactcaagcac aaaagccagactgactcacctgatggcagcagtgggctgggaatttcatcccctaaagag ttcagtgcaggagaaagctctacttctctcgatgctaatcacacaggggcagtcgttgag cctttgagaacttctgttccaaggctcccatcagagagtaagaaggaagactcctctgac gctacccaagtcccccaagcaagtctcaaagccagtgatctctctgactttcaatcagtt tccaagctaaaccagggcaagccatgcacatgcataggcaaggaatgccagtgtaagaga tggcatgatatggaagtgtattccttttcaggcctgcagagtgtccctcccttggctcca gaacgaagatccacacttgaggactactctcagtcgctgcacgccagaactctgtctggc tctccccgatcctgttctgagcaagctcgagtcttcgtggatgatgtgaccattgaggac ctgtcaggctacatggagtattacttgtatattcccaagaaaatgtcccacatggcagaa atgatgtacacctga >gi568815578r:44086326_44286705|GENSCAN_predicted_peptide_9|446_aa MGRESEESWGFYGQQHSSLLPTFIGHKSVMLSPHLNAKWSSPCYHYLYFTDDEPNLREGS LTLRAGGFSGAGQGEEEKGAPGIPSPPPPTARWWPISALESDAAKPAEAPDAPEAASPAH WPRESLVLYHWTQSFSSQKVRLVIAEKGLVCEERDVSLPQSEHKEPWFMRLNLGEEVPVI IHRDNIISDYDQIIDYVERTFTGEHVVALMPEVGSLQHARVLQYRELLDALPMDAYTHGC ILHPELTTDSMIPKYATAEIRRHLANATTDLMKLDHEEEPQLSEPYLSKQKKLMAKILEH DDVSYLKKILGELAMVLDQIEAELEKRKLENEGQKCELWLCGCAFTLADVLLGATLHRLK FLGLSKKYWEDGSRPNLQSFFERVQRRFAFRKVLGDIHTTLLSAVIPNAFRLVKRKPPSF FGASFLMGSLGGMGYFAYWYLKKKYI >gi568815578r:44086326_44286705|GENSCAN_predicted_CDS_9|1341_bp atgggaagagagagtgaagaatcatggggattttatgggcagcagcattcctcacttctg cccacattcattggccacaagtcagtcatgttgtcacctcacctcaatgcaaagtggagc agtccctgttatcattatctctatttcacagatgatgagccgaatctcagggaggggagc ctgacactgagggctggcggcttttctggcgcgggccagggggaggaggagaaaggagct cccgggatcccctcgccccctcccccaactgcccgctggtggcccatctccgcgctggag agcgatgcggccaagccagcggaggcccccgacgctcccgaggcggccagccccgcccat tggcccagggagagcctggttctgtaccactggacccagtccttcagctcgcagaaggtg cggctggtgatcgccgagaagggcctggtgtgcgaggagcgggacgtgagcctgccacag agcgagcacaaggagccctggttcatgcggctcaacctgggcgaggaggtgcccgtcatc atccaccgcgacaacatcatcagtgactatgaccagatcattgactatgtggagcgcacc ttcacaggagagcacgtggtggccctgatgcccgaggtgggcagcctgcagcacgcacgg gtgctgcagtaccgggagctgctggacgcactgcccatggatgcctacacgcatggctgc atcctgcatcccgagctcaccaccgactccatgatccccaagtacgccacggccgagatc cgcagacatttagccaatgccaccacggacctcatgaaactggaccatgaagaggagccc cagctctccgagccctacctttctaaacaaaagaagctcatggccaagatcttggagcat gatgatgtgagctacctgaagaagatcctcggggaactggccatggtgctggaccagatt gaggcggagctggagaagaggaagctggagaacgaggggcagaaatgcgagctgtggctc tgtggctgtgccttcaccctcgctgatgtcctcctgggagccaccctgcaccgcctcaag ttcctgggactgtccaagaaatactgggaagatggcagccggcccaacctgcagtccttc tttgagagggtccagagacgctttgccttccggaaagtcctgggtgacatccacaccacc ctgctgtcggccgtcatccccaatgctttccggctggtcaagaggaaacccccatccttc ttcggggcgtccttcctcatgggctccctgggtgggatgggctactttgcctactggtac ctcaagaaaaaatacatctag