GENSCAN 1.0 Date run: 3-Nov-116 Time: 10:43:15 Sequence gi568815597r:77597320_77841697 : 244378 bp : 39.13% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 16873 16969 97 0 1 32 41 156 0.362 5.82 1.02 Term + 27933 28021 89 1 2 54 41 96 0.233 -1.66 1.03 PlyA + 28203 28208 6 1.05 2.02 PlyA - 29005 29000 6 1.05 2.01 Sngl - 36035 34521 1515 2 0 35 36 1387 0.993 123.33 2.00 Prom - 44432 44393 40 -5.95 3.23 PlyA - 46211 46206 6 1.05 3.22 Term - 48173 48107 67 1 1 96 38 85 0.032 0.73 3.21 Intr - 86370 85943 428 0 2 63 105 359 0.080 26.76 3.20 Intr - 86679 86542 138 0 0 19 51 201 0.072 9.24 3.19 Intr - 100155 100019 137 1 2 38 -1 104 0.015 -4.13 3.18 Intr - 100612 100544 69 2 0 71 115 39 0.858 3.14 3.17 Intr - 104152 104050 103 1 1 96 107 45 0.930 6.03 3.16 Intr - 114536 114428 109 1 1 73 65 30 0.098 -1.43 3.15 Intr - 118549 118423 127 2 1 103 63 68 0.726 4.62 3.14 Intr - 120728 120548 181 2 1 85 86 50 0.575 3.02 3.13 Intr - 121322 121277 46 1 1 134 64 9 0.548 0.59 3.12 Intr - 123886 123853 34 2 1 99 106 -8 0.559 -1.54 3.11 Intr - 124582 124512 71 0 2 77 116 61 0.901 5.71 3.10 Intr - 124877 124705 173 2 2 2 87 109 0.728 0.02 3.09 Intr - 126124 126012 113 2 2 67 96 51 0.965 2.98 3.08 Intr - 128443 128303 141 2 0 52 93 130 0.979 9.30 3.07 Intr - 131393 130976 418 2 1 15 61 441 0.871 26.77 3.06 Intr - 132619 132541 79 0 1 64 92 54 0.829 2.03 3.05 Intr - 133412 133299 114 1 0 58 91 72 0.735 3.14 3.04 Intr - 137097 137028 70 1 1 111 27 37 0.497 -2.78 3.03 Intr - 138839 138737 103 2 1 50 84 127 0.860 7.33 3.02 Intr - 144429 144298 132 0 0 102 52 91 0.761 6.82 3.01 Init - 145365 145342 24 0 0 51 82 3 0.334 -4.42 3.00 Prom - 145946 145907 40 -3.75 4.00 Prom + 146509 146548 40 -6.75 4.01 Init + 156641 156814 174 1 0 57 31 105 0.020 1.09 4.02 Intr + 162048 162567 520 2 1 103 59 252 0.009 15.40 4.03 Term + 176631 177445 815 1 2 2 45 702 0.139 49.63 4.04 PlyA + 178786 178791 6 1.05 5.03 PlyA - 180761 180756 6 1.05 5.02 Term - 182795 182352 444 2 0 73 53 178 0.521 7.05 5.01 Init - 188629 188318 312 1 0 106 28 140 0.586 7.17 5.00 Prom - 205674 205635 40 -4.25 6.02 PlyA - 205707 205702 6 1.05 6.01 Sngl - 214462 213542 921 1 0 84 42 839 0.999 75.01 6.00 Prom - 214854 214815 40 -12.43 7.00 Prom + 215138 215177 40 -11.44 7.01 Init + 215850 215898 49 2 1 86 58 39 0.991 -0.14 7.02 Intr + 216415 216548 134 2 2 117 67 152 0.887 15.44 7.03 Intr + 217789 217912 124 0 1 120 91 128 0.999 15.54 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 86721 86542 180 0 0 65 51 177 0.847 10.93 S.002 Term - 100155 99998 158 1 2 38 28 141 0.866 0.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:77597320_77841697|GENSCAN_predicted_peptide_1|61_aa MFLTVAYQDTSNAVKPDASPKPTLVEQQLQMPQFRSGNLHETQPLNRKAYKQENTIKHPT H >gi568815597r:77597320_77841697|GENSCAN_predicted_CDS_1|186_bp atgtttctgacagtagcttatcaagatacatccaatgctgtaaaaccggatgcctctcct aagccgactctagtggaacagcaactacagatgccacaatttagatctggtaacttacat gaaacacagcccctaaaccggaaagcatacaaacaagaaaacacaatcaaacatcccaca cactaa >gi568815597r:77597320_77841697|GENSCAN_predicted_peptide_2|504_aa MAASRSTRVTRSTVGLNGLDESFCGRTLRNRSIAHPEEISSNSQVRSRSPKKRPEPVPIQ KGNNNGRTTDLKQQSTRESWVSPRKRGLSSSEKDNIERQAIENCERRQTEPVSPVLKRIK RCLRSEAPNSSEEDSPIKSDKESVEQRSTVVDNDADFQGTKRACRCLILDDCEKREIKKV NVSEEGPLNSAVVEEITGYLAVNGVDDSDSAVINCDDCQPDGNTKQNSIGSYVLQEKSVA ENGDTDTQTSMFLDSRKEDSYIDHKVPCTDSQVQVKLEDHKIVTACLPVEHVNQLTTEPA TGPFSETQSSLRDSEEEVDVVGDSSASKEQCKENTNNELDTSLESMPASGEPEPSPVLDC VSAQMMSLSEPQEHRYTLRTSPRRAAPTRGSPTKNSSPYRENGQFEENNLSPNETNATVS DNVSQSPTNPGEISQNEKGICCDSQNNGSEGVSKPPSEARLNIGHLPSAKESASQHITEE EDDDPDVYYFESDHVALKHNKEYV >gi568815597r:77597320_77841697|GENSCAN_predicted_CDS_2|1515_bp atggctgcttcccgatctactcgtgttacaagatcaacagtggggttaaacggcttggat gaatctttttgtggtagaactttaaggaatcgtagcattgcgcatcctgaagaaatctct tctaattctcaagtacgatcaagatcaccaaagaagagaccagagcctgtgccaattcag aaaggaaataataatgggagaaccactgatttaaaacagcagagtacccgagaatcatgg gtaagccctaggaaaagaggactttcttcttcagaaaaggataacatagaaaggcaggct atagaaaattgtgagagaaggcaaacagaacctgtttcaccagttttaaaaagaattaag cgttgtcttagatctgaagcaccaaacagttcagaagaagattctcctataaaatcagac aaggagtcagtagaacagaggagtacagtagtggacaatgatgcagattttcaagggact aaacgagcttgtcgatgtcttatactggatgattgtgagaaaagggaaattaaaaaggtg aatgtcagtgaggaagggccacttaattctgcagtagttgaagaaatcacaggctatttg gctgtcaatggtgttgatgacagtgattcagctgttataaactgtgatgactgtcagcct gatgggaacactaaacaaaatagcattggttcctatgtgttacaggaaaaatcagtagct gaaaatggggatacggatacccaaacttcaatgttccttgatagtaggaaggaggacagt tatatagaccataaggtgccttgcacagattcacaagtgcaggtcaagttggaggaccac aaaatagtaactgcctgcttgcctgtggaacatgttaatcagctgactactgagccagct acagggcccttttctgaaactcagtcatctttaagggattctgaggaggaagtagatgtg gtgggagatagcagtgcctcaaaagagcagtgtaaagaaaacaccaataacgaactggac acaagtcttgagagtatgccagcctccggagaacctgaaccatctcctgttctagactgt gtttcagctcaaatgatgtctttatcagaacctcaagaacatcgttatactctgagaacc tcaccacgaagggcagcccctaccagaggtagtcccactaaaaacagttctccttacaga gaaaatggacaatttgaggagaataatcttagtcctaatgaaacaaatgcaactgttagt gataatgtaagtcaatctcctacaaatcctggtgaaatttctcaaaatgaaaaagggata tgttgtgactctcaaaataatggaagtgaaggagtaagtaaaccaccctcagaggcaaga ctcaatattggacatttgccatctgccaaagagagtgccagtcagcacattacagaagag gaagatgatgatcctgatgtttattactttgaatcagatcatgtggcactgaaacacaac aaagagtatgtgtaa >gi568815597r:77597320_77841697|GENSCAN_predicted_peptide_3|958_aa MKSPVIKKVLPHFESLGKQEKIPNKMSAFRNHCPHLDSVGEITKEDLIQKSLDFKIPSNT TLKTPLVAVFDDLDIEADEEDELRARGLTGLKNIGNTCYMNAALQALSNCPPLTQFFLDC GGLARTDKKPAICKSYLKLMTELWHKSRPGSVVPTTLFQGIKTVNPTFRGYSQQDAQEFL RCLMDLLHEELKEQVMEVEEDPQTITTEETMEEDKSQSDVDFQSCESCSNSDRAENENGS RCFSEDNNETTMLIQDDENNSEMSKDWQKEKMCNKINKVNSEGEFDKDRDSISETVDLNN QETVKVQIHSRASEYITDVHSNDLSTPQILPSNEGVNPRLSASPPKSGNLWPGLAPPHKK AQSASPKRKKQHKKYRSVISDIFDGTIISSVQCLTCDRVSVTLETFQDLSLPIPGKEDLA KLHSSSHPTSIVKAGSCGEAYAPQGWIAFFMEYVKSWFWGPVVTLQDCLAAFFARDELKG DNMYSCEKCKKLRNGVKFCKVQNFPEILCIHLKRFRHELMFSTKISTHVSFPLEGLDLQP FLAKDSPAQIVTYDLLSVICHHGTASSGHYIAYCRNNLNNLWYEFDDQSVTEVSESTVQN AEAYVLFYRYGGGPAVNHLYICHTCQIEAEKIEKRRKTELEIFIRLNRAFQKEDSPATFY CISMQWFREWESFVKGKDGDPPGPIDNTKIAVTKCGNVMLRQGADSGQISEETWNFLQSI YGGGPEVILRPPVVHVDPDILQAEEKIEPSEKGPEICIEHRGVKKNLEEMFCQRNRQAQR PEVVRACVLWYKPEGQVGRLQGMDCNPPSEDQTSPRGRDWPGRASAAPDPRALHGPRLPA GGVVSAPAAAATAAAAVAAAAAARQSQRGERRGPEGAAGAAGSGRSGRRGPTWKRRGAAI ELPAAVATRASAVAGAESGHGGGASPSGPLRPTPRERRPGHTLLPGDHHIDAELNADT >gi568815597r:77597320_77841697|GENSCAN_predicted_CDS_3|2877_bp atgaaaagccctgttattaaaaaggtgttacctcattttgaaagtcttgggaaacaggaa aaaattcctaacaaaatgtcagcttttcgaaatcattgtccacatttggattcagttggt gaaataacaaaagaagatttgatacaaaaatcccttgattttaaaatacccagtaataca acattaaaaactcctctggttgccgtatttgatgatctggatatagaagcggatgaagaa gatgaacttagggccagaggtcttacaggtttgaaaaatattggaaatacttgttacatg aatgcagctttgcaggctctttctaattgcccacctttgacacagttttttcttgattgt ggaggactagctcgaacagataagaaacctgccatttgtaaaagttatctcaaactaatg acagagctgtggcataaaagcaggccaggatctgttgtgcctactactctgtttcaagga attaaaactgtaaatccaacatttcgggggtattctcagcaggatgctcaagaattcctt cgatgtttaatggatttgcttcatgaagaattgaaagagcaagtcatggaagtagaagaa gatccgcaaaccataaccactgaggagacaatggaagaagacaagagccagtcggatgta gattttcagtcttgtgaatcttgtagcaacagtgatagagcagaaaatgaaaatggctct agatgcttttctgaagataataatgaaacaacaatgttaattcaggatgatgaaaacaat tcagaaatgtcaaaggattggcaaaaagagaagatgtgcaataagattaataaagtaaat tctgaaggcgaatttgataaagatagagactctatatctgaaacagtcgacttaaacaac caggaaactgtcaaagtgcaaatacacagcagagcttcagaatatatcactgatgtccat tcgaatgacctgtctacaccacagatccttccatcaaatgaaggtgttaatccacgttta tcggcaagccctcctaaatcaggcaatttgtggccaggattggcaccaccacacaaaaaa gctcagtctgcatctccaaagagaaaaaaacagcacaagaaatacagaagtgttatttca gacatatttgatggaacaatcattagttcagtgcagtgtctgacttgtgacagggtgtct gtaaccctcgagacctttcaagatctgtccttgccaattcctggcaaggaagaccttgct aagctgcattcatcaagtcatccaacttctatagtcaaagcaggatcatgtggcgaagca tatgctccacaagggtggatagcttttttcatggaatatgtgaagagctggttttggggt ccagtagtaaccttgcaagattgtcttgctgccttctttgccagagatgaactaaaaggt gacaatatgtacagttgtgaaaaatgcaaaaagttgagaaatggagtgaagttttgtaaa gtacaaaactttcctgagattttgtgcatccaccttaaaagattcagacatgaactaatg ttttccaccaaaatcagtacccatgtttcatttccgctagaaggcttggatcttcagcca tttcttgctaaggatagtccagctcaaattgtgacatatgatcttctgtcagtcatttgc catcatggaactgcaagtagtggacactatatagcctactgccgaaacaatctaaataat ctctggtatgaatttgatgatcagagtgtcactgaagtttcagaatctactgtacaaaat gcagaagcttacgttcttttctataggtatggtggaggaccagctgtcaaccatctgtac atttgtcatacttgccaaattgaggcggagaaaattgaaaaaagaagaaaaactgaattg gaaatttttattcggcttaacagagcgttccaaaaagaggactctccagctactttttat tgcatcagtatgcagtggtttagagaatgggaaagttttgtgaagggtaaagatggagat cctccaggtcctattgacaatactaagattgcagtcactaaatgtggtaatgtgatgctt aggcaaggagcagattctggccagatttctgaagaaacatggaattttctgcagtctatt tatggtggagggcctgaagttatcctgcgacctccggttgttcatgttgatccagatata cttcaagcagaagaaaaaattgaaccctctgagaagggacctgaaatatgcatcgaacat cgaggagttaaaaaaaatctggaggaaatgttctgccagagaaatcggcaagcacaaagg cccgaagtggttcgagcctgcgtgctgtggtacaagccagagggccaggttggacggttg caagggatggactgcaacccgccctcggaagaccaaacttcgcccaggggcagggactgg ccggggagggcctcggcggcgcccgacccgcgggcgctgcacgggccgcggttaccagca ggcggtgtagtgagcgcgcctgcagcagcagcaacagcagcagcagcggtcgccgccgcc gccgccgcccgccagtcccaacgaggagaaaggaggggaccggaaggagccgctggtgct gcgggaagtggcaggagcgggaggcggggacccacctggaagcgccgcggcgccgctatc gagcttcctgcagcggtggccacccgagcaagtgccgtggcgggggcggagagcggccac ggcggcggcgcctccccaagtggcccgttgcgtccgaccccgcgtgaaaggcgacctggt cataccctgctccctggagatcaccatattgatgccgaacttaatgcagacacctga >gi568815597r:77597320_77841697|GENSCAN_predicted_peptide_4|502_aa MPGLDTHVPIEHDDQRDPFFSVQLEKVRTAYSINGEDFGSNTDELSWKEHVPHNCYTEGR TPEDPSSNQRPVPILHRPLTGRPLDYLCNAEHRGAKAFFPPANPPPPRTKKKRLLQRPQP QRAEARTETGPRTRAKRPWAPRPSQQRRAHLHGLARGTRQLDQQRPAAAPPQAPLFLLSS RRGAVSRQEGRKTAPQRCPRGVRLLNWPLPAAAAAPSGVSGAAAGVSGDRGGGQQGQDGA TDPHGVSTLGRSNDLDLHRAGPQGGDRLLHPVSDARVHGSAARQHCVGVEVFAYVHITLH DGVEGSFLDATGFHAQEGRLEEHLGAVEPLLADSDDLAVGQLVALLQGGAGGHRGHLLIQ VQGDVAQLLLDAMHDFPLGGGGEAVAALHEDLYEVISQVLVSQVQKQGGLEEGLPLVDGH SVGDPVAGVHHDVARSVQGQHGLDGHVHGWGVEGLKHDMGHLLVIGLGVQGGLSRHYGCS LEPHAARCRSCGARSSPCHPSW >gi568815597r:77597320_77841697|GENSCAN_predicted_CDS_4|1509_bp atgccaggacttgatacacatgttccaattgaacatgatgaccaaagggatcctttcttt tctgtgcaacttgagaaagtcagaacagcatattcaatcaacggtgaagacttcggttca aatacagatgaactttcatggaaggagcatgttccccataattgttacacagagggaaga acccctgaggacccttcctcaaatcaacgaccggttcccatcctccaccgaccgttgaca gggcgaccactagattatctctgcaacgcagagcaccgaggggccaaggccttcttcccg cccgcgaacccgccgccacccaggaccaagaaaaagaggctgctgcagcggccgcaaccc cagcgcgcggaagcgagaacagagaccggaccccggacgcgagcgaaacgcccttgggcc ccgcgcccctcccagcagcgccgcgctcacctccacggcctggcccgcgggacccgccag ctcgaccaacaacggcctgcagccgcacctccgcaagctcctcttttccttctcagctct cggagaggggcagtgtcgcgtcaggagggccggaaaacggccccgcagcgctgccctcgg ggggtccgcctcctgaactggccacttcccgcagcagccgcggctccttccggtgtctcc ggggccgccgcaggcgtctccggcgataggggaggtggacagcaaggccaggatggagcc actgatccacatggagtatctactctcgggaggagcaatgatcttgatctccatcgtgct gggccccagggtggtgatcgccttctgcatcctgtcagcgatgccagggtacatggtagt gccgccagacagcactgtgttggcgtagaggtctttgcatatgtgcacatcacacttcat gatggagttgaaggtagtttcttggatgccacaggattccatgcccaggaaggaagactg gaagagcaccttggggcagtggaaccgctccttgccgatagtgatgacctggctgttggg cagctagtagctcttctccagggaggagctggaggccaccgtggccatctcctgatccaa gtccagggtgacgtagcacagcttctccttgatgccatgcacgatttcccactgggcggt ggtggtgaagctgtagcggcgctccacgaggatctttatgaggtaatcagtcaggtcctg gtcagccaggtccagaaacagggtggcctggaggagggcctacccctcgtagatgggcac agtgtgggtgaccctgtcgctggagtccatcacgatgtggccagaagcgtacagggacag catggcctggatggccatgtacatggctggggtgttgaaggtctcaaacatgatatgggt catcttctcgtgattggccttggagttcaggggggcctcagtcggcactacgggtgctcc ttggagccacacgcggctcgttgtagaagttgtggtgccagatcttctccatgtcatccc agttggtga >gi568815597r:77597320_77841697|GENSCAN_predicted_peptide_5|251_aa MESEAPHRVPTGAPPSAAVRRGPPSSRPQNDRSTNSLHHTPGKATDIQCQPIKAAGREAV PCKATRAELPTTMGTHLLHQHDLDVRHGVKRDHFGTLIFDCPGRNTRKGGPSSSAVQTLP LAQSSGEPIPIPRFLCRPPSGVGTLLLSYASTNQARPCLASEMRRGRGCSGWHGRRPSGF PPLHLALRELTPAMESAPPLPRGLCPNCVAAQLSASSRPPLHPTPPLALYLELQARYSWP AHASFPADAWR >gi568815597r:77597320_77841697|GENSCAN_predicted_CDS_5|756_bp atggagtcggaggccccacacagagtccctactggggcaccgcctagtgcagctgtgaga agagggccaccgtcctccagaccccagaatgatagatccaccaacagcttgcaccataca cctgggaaagccacagacattcaatgccagcccataaaagcagccgggagggaggctgta ccctgcaaagccacaagggcagagctacctacgaccatgggaacccacctcttgcatcag catgacctggatgtgagacatggagtcaaaagagatcattttggaactttaatatttgac tgccctggccggaacaccagaaaagggggtccctccagctcggccgtccagacccttcct ctggctcagagcagtggcgagcccatcccaattccgcgcttcctttgcaggcctccctct ggcgtggggaccctcctgctttcctatgcaagtactaaccaggcccgaccctgcttagct tctgagatgagacgaggtcgggggtgttcagggtggcacggccgtagaccctctggcttt ccgccgctccacctggcactccgagagctaacgccagcgatggagtccgcgccacccctg ccccgaggcctctgccccaactgcgtggccgctcaactctccgcttcctcccgcccgcct ctccaccccaccccgcccctggccctgtacctggagctccaggccaggtacagctggcct gcccacgccagcttcccagctgatgcctggcgctga >gi568815597r:77597320_77841697|GENSCAN_predicted_peptide_6|306_aa MSEVTRSLLQRWGASFRRGADFDSWGQLVEAIDEYQILARHLQKEAQAQHNNSEFTEEQK KTIGKIATCLELRSAALQSTQSQEEFKLEDLKKLEPILKNILTYNKEFPFDVQPVPLRRI LAPGEEENLEFEEDEEEGGAGAGSPDSFPARVPGTLLPRLPSEPGMTLLTIRIAKIGLKD AGQCIDPYITVSVKDLNGIDLTPVQDTPVASRKEDTYVHFNVDIELQKHVEKLTKGAAIF FEFKPYKPKKRFTSTKCFAFMEMDEIKPGPIVIELYKKPTDFKRKKLQLLTKKPLYLHLH QTLHKE >gi568815597r:77597320_77841697|GENSCAN_predicted_CDS_6|921_bp atgtcggaggtgacccggagtctgctgcagcgctggggcgccagttttaggagaggcgcc gacttcgactcttggggccagctggtggaggcgatagacgagtatcagatattagcaaga catctacaaaaggaggcccaagctcaacacaataattctgaattcacagaagaacaaaag aaaaccataggcaaaattgcaacatgcttggaattgcgaagtgcagctttacagtccaca cagtctcaagaagaatttaaactggaggacctgaagaagctagaaccaatcctaaagaat attcttacatataataaagaattcccatttgatgttcagcctgtcccattaagaagaatt ttggcacctggtgaagaagagaatttggaatttgaagaagatgaagaagagggtggtgct ggagcagggtctcctgattcttttcctgctagagttcccggtactttattaccaaggttg ccatcggaaccaggaatgacattactcactatcagaattgcgaaaattggtttgaaagat gctgggcagtgcatcgatccctatattacagttagtgtaaaggatctgaatggcatagac ttaactcctgtgcaagatactcctgtggcttcaagaaaagaagatacatatgttcatttt aatgtggacattgagctccagaagcatgttgaaaaattaaccaaaggtgcagctatcttc tttgaattcaaaccctacaagcctaaaaaaaggtttaccagcaccaagtgttttgctttc atggagatggatgaaattaaacctgggccaattgtaatagaactatacaagaaacccact gactttaaaagaaagaaattgcaattattgaccaagaaaccactttatcttcatctacat caaactttgcacaaggaatga >gi568815597r:77597320_77841697|GENSCAN_predicted_peptide_7|103_aa MGFYHVGQAGLELLTSGMELFEEALRRWEQALTFRNRQAEDEACGSIKLGAGDAIAEENV DDIISTEFIHKLEALLQRAYRLQEEFEATLGASDPNSLADDIX >gi568815597r:77597320_77841697|GENSCAN_predicted_CDS_7|309_bp atggggttttaccatgttggccaggctggtctcgaactcctgacctcaggtatggaattg tttgaagaggcattgcgtcgatgggaacaagctctgacctttcgcaatagacaggctgaa gatgaagcctgtggttccattaaactgggtgcaggagatgccattgctgaagaaaatgta gatgatattattagtactgaatttatccataaactcgaagctctgctgcaaagagcctat cgtctccaagaggagtttgaagctacccttggggcatctgatcctaattcccttgctgat gatattgnn