GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:37:33 Sequence gi568815596f:222952831_223153340 : 200510 bp : 40.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1195 1318 124 0 1 80 78 111 0.641 7.75 1.02 Intr + 6519 6615 97 1 1 64 59 57 0.375 -1.45 1.03 Intr + 7100 7207 108 2 0 67 70 78 0.183 2.38 1.04 Intr + 11744 11954 211 2 1 86 61 95 0.165 4.59 1.05 Term + 12892 13344 453 0 0 41 49 198 0.892 5.37 1.06 PlyA + 16415 16420 6 1.05 2.07 PlyA - 17054 17049 6 1.05 2.06 Term - 19757 19609 149 0 2 82 42 106 0.309 2.48 2.05 Intr - 23771 23722 50 1 2 94 91 19 0.292 0.21 2.04 Intr - 29944 29743 202 2 1 42 113 149 0.902 10.22 2.03 Intr - 30347 30114 234 0 0 104 32 125 0.815 5.24 2.02 Intr - 31365 31220 146 2 2 85 90 12 0.214 0.11 2.01 Init - 36868 36033 836 1 2 40 12 357 0.054 17.51 2.00 Prom - 45584 45545 40 -2.35 3.00 Prom + 57616 57655 40 -3.65 3.01 Sngl + 60450 61085 636 2 0 45 43 303 0.652 17.43 3.02 PlyA + 61313 61318 6 1.05 4.00 Prom + 61482 61521 40 -7.55 4.01 Sngl + 61575 62231 657 2 0 49 41 238 0.824 11.12 4.02 PlyA + 62289 62294 6 1.05 5.03 PlyA - 64735 64730 6 1.05 5.02 Term - 68693 68480 214 1 1 57 38 139 0.106 1.52 5.01 Init - 79504 79182 323 1 2 53 110 157 0.559 10.20 5.00 Prom - 79549 79510 40 -5.45 6.03 PlyA - 79714 79709 6 1.05 6.02 Term - 80876 80422 455 0 2 56 45 227 0.880 9.53 6.01 Init - 81581 81545 37 2 1 106 62 25 0.380 1.93 6.00 Prom - 87203 87164 40 -5.55 7.00 Prom + 95318 95357 40 -5.45 7.01 Init + 97961 98022 62 1 2 60 109 65 0.437 6.57 7.02 Intr + 98835 98875 41 0 2 55 27 80 0.266 -4.45 7.03 Intr + 99304 99444 141 2 0 81 111 88 0.597 9.80 7.04 Term + 99978 100513 536 1 2 61 42 711 0.996 57.22 7.05 PlyA + 102768 102773 6 1.05 8.04 PlyA - 103397 103392 6 1.05 8.03 Term - 113507 113461 47 0 2 128 42 25 0.115 -1.71 8.02 Intr - 124016 123876 141 0 0 115 75 80 0.782 8.80 8.01 Init - 134168 134102 67 2 1 84 38 54 0.018 1.29 8.00 Prom - 134428 134389 40 -7.75 9.03 PlyA - 136743 136738 6 1.05 9.02 Term - 139195 139029 167 2 2 92 42 143 0.853 7.20 9.01 Init - 144668 144638 31 2 1 101 107 6 0.685 3.81 9.00 Prom - 145316 145277 40 -4.25 10.08 PlyA - 146493 146488 6 1.05 10.07 Term - 149022 148902 121 2 1 109 42 49 0.210 -0.63 10.06 Intr - 159622 159451 172 2 1 44 81 152 0.170 8.18 10.05 Intr - 165211 165005 207 2 0 91 30 141 0.581 6.73 10.04 Intr - 172228 172114 115 1 1 85 44 100 0.253 4.40 10.03 Intr - 185695 185583 113 2 2 -14 63 101 0.100 -3.42 10.02 Intr - 187296 187132 165 1 0 121 88 36 0.340 5.91 10.01 Intr - 190492 190377 116 0 2 58 103 88 0.649 6.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:222952831_223153340|GENSCAN_predicted_peptide_1|330_aa MVVAVVLLPGILLGREAALSSFQCLWSGLLVGPAVIKLLVRSLWLHEDCLWAMRTTQLPP PPSTDPEGLGIYSRSMVLEAQVFPDDVPGFPLHFVAFLQLVLFYPAIETDSLNSRTGNFC KIGCGAIMLTPKGPPGSSSRSLADQRNLSRLWEQLESCLGAASQGQIHILWGRCARAPWS KEAPTESCRGLWKGRDALAKAQEWAITSLSPSIVGQGPEGVGSSELLHFHSLHLGQTCPR SPGLSRRATGAGISTGNKSTGKKVGNTKTVKRDPRGGARTMSPYPARVCITSKGQGQRLH VHSPHTAEACLNSRSFIPLPAPMGRSSSFV >gi568815596f:222952831_223153340|GENSCAN_predicted_CDS_1|993_bp atggtggtcgcagtggtgctgcttccagggattctcctgggtcgagaggctgcactgtca tctttccagtgtctctggagtggcttgctggttggtccagctgttatcaaattgctagtg aggtcactatggcttcatgaggactgcctttgggctatgagaaccactcagctcccacct ccaccctccacagatccagaaggccttggaatatattctagatccatggttcttgaagcc caggtgttccctgatgacgtccctggctttcccctacatttcgtggcctttcttcagctt gtcctcttctatccagccattgaaacagattctctaaactccaggacaggaaatttctgc aagataggctgtggagcaataatgctaactcctaaaggaccccctgggtcatcctcaaga tctcttgcagaccagaggaacctttccagactctgggagcagcttgagtcttgtctcgga gctgcttcacaaggccagatacacatcctgtggggccgctgtgcccgagcaccatggtca aaggaagctcccactgaatcctgcagggggctgtggaagggaagagatgctctagccaag gcccaggagtgggctattacatccctgtccccatccattgtgggtcaagggcctgagggg gttggcagctcagagcttctccatttccactccctgcacctgggccaaacctgccccagg agccctggcctctcaagaagagccacaggtgctggcataagcacaggaaacaaaagtaca ggaaagaaggtgggaaacacaaaaacagtaaaaagggacccaaggggaggagcacggaca atgtccccttaccctgctagggtgtgcattacaagtaaaggtcaaggccaaagacttcat gtgcattccccgcacactgctgaagcatgtttgaattctcgttctttcattccactgcca gctcccatgggcagatccagctcctttgtgtag >gi568815596f:222952831_223153340|GENSCAN_predicted_peptide_2|538_aa MSETFEADSKVPTQTWRSRRKKWFCGLGQGSLCCVQPRDLVPCIPAAPAVAERRQCRAQA TASEGASLKSLHLPCGVEPPSTQKSRIGVWEPSPKFQRMYGNAWMLRQKFAAGAGPSWRT SARTVQKGNVGSEPPHRVPTGASHSGAVRRGPPSSRPQNGRSTDSLHCPPGKATGTQRQP MKAAKREAIPCKATGVKLPNTMGTHHLYKCDLEVKHGVKGDHFGALRFDCPAGFQTCMGP VVPLFWPISPIWNGYIYPMPVSHCIWEVTNLLLILQAHRSDHRIDLCYSKCGAQTRSNSS TCDLERNAESQPGMVAHACNTSTLGGRGYKGGSHGHEQNLYAYLHSDDQAVEWLAEKHNA ESSGLQKVRGTHQLQREGCQWRREVGKEWVAFSLPHRNGSKAMMQEEGQGGAVKGAGRGR GRWGTVVEALEKGIWEDSQMGVGEGTGFQRKWFWLRLAESNNEARNTEDTVGSNRHVKVS SLEVQTVVLGTSRPQDHCLCCAHCLEWSSPKTPVAHTINPSGLIEMSLAPRGLRCHAI >gi568815596f:222952831_223153340|GENSCAN_predicted_CDS_2|1617_bp atgtcagagacctttgaggcagactcaaaggtcccaacacagacttggaggtctaggagg aaaaagtggttttgtggtctgggccaaggctccctgtgctgtgtgcagcctagggacttg gtgccctgcatcccagctgctccagcagttgctgaaaggaggcaatgtagagctcaggcc acagcttcagagggtgcaagcctcaagtcattgcaccttccatgtggtgttgagcctccc agtacacagaagtcaagaattggggtttgggaaccatcacctaaatttcagaggatgtat ggaaatgcctggatgctcaggcagaagtttgctgcaggggcagggccctcatggagaacc tctgctaggacagtgcagaagggaaatgtggggtcagagcccccacacagagtccctact ggggcatcacatagtggagctgtgagaagagggccaccatcctccagaccccagaatggt agatccactgacagcttgcactgtccacctggaaaagccacaggcactcaacgccagccc atgaaagcagccaagagggaggctataccctgcaaagccacaggggtgaagctgcccaat accatgggaactcaccacttgtataagtgtgacctcgaggtgaaacatggagtcaaagga gatcattttggagctttaagatttgactgccccgctggatttcagacttgcatggggcct gtagtacctttgttttggccaatttctcccatttggaatggctatatttacccaatgcct gtatcccattgtatctgggaagtaactaacttgcttttgattttacaggctcataggtca gatcatcgtatagatctttgctactcaaaatgtggtgcccagaccagaagcaacagcagc acttgtgaccttgaaagaaatgcagaatctcagccaggcatggtggctcatgcctgtaat accagcactttgggaggccgaggatacaaaggtgggtcacatggtcacgagcagaacctc tatgcatatctgcattcagatgatcaagctgtagaatggctagccgaaaaacacaatgct gaaagttcagggttgcagaaagtcagaggcacacatcagcttcaaagagaaggctgtcaa tggagacgggaagtggggaaggagtgggtagccttctctctgccacacagaaatggctcc aaggctatgatgcaagaggaggggcaaggtggagctgttaagggagcaggtagagggaga ggcaggtgggggaccgtggttgaagcactagaaaagggcatctgggaagactcgcaaatg ggagtgggggaagggacaggcttccagaggaagtggttttggttgaggctggctgagagc aacaatgaggctaggaacaccgaggacaccgtgggaagcaacagacatgttaaggtcagc agtcttgaagtacagacagttgttcttggcacctcccgaccccaggaccattgcctttgc tgtgcccactgcctagaatggtcttcccctaaaactcccgtggctcacaccatcaatcct tcaggtcttattgaaatgtcactggctccaagaggtctccgatgccacgctatttaa >gi568815596f:222952831_223153340|GENSCAN_predicted_peptide_3|211_aa MKPEEKFREKRVKRNEQTLQEIWDYVKRPNLCLIGVPESDRENGTKSENTLQDIIQENFP NLARQANIQIQEIQRTPQRYSLRRATPRHIIVRFTKVEMKEKMLRAAREKGRVTHKGKPI RLTADLLAESLQARREWGPIFNILKEKNFQPRISYPAKLHFISEGKIKSFTDKQMLRDFV TTRPALKELLKEALNLERNNQYQPLQKHAKL >gi568815596f:222952831_223153340|GENSCAN_predicted_CDS_3|636_bp atgaagccagaagagaagtttagagaaaaaagagtaaaaagaaatgaacaaaccctccaa gaaatatgggactatgtgaaaagaccaaatctatgtctgattggtgtacctgaaagtgac cgggagaatggaactaagtcggaaaacactctgcaggatattatccaggagaacttcccc aatctagcaaggcaggccaacattcaaattcaggaaatacagagaacaccgcaaaggtac tccttgagaagagcaactccaagacacataattgtcagattcaccaaagtcgaaatgaag gaaaaaatgttaagggcagccagagagaaaggtcgggttacccacaaagggaagcccatc agactaacagcagatctcttggcagaaagtctacaagccagaagagagtgggggcccata ttcaacattcttaaagaaaagaattttcaacctagaatttcatatccagccaaactacac ttcataagtgaaggaaaaataaaatcctttacagacaagcaaatgctgagagattttgtc accaccaggcctgccctaaaagagctcctgaaggaagcactaaacttggaaaggaacaac cagtaccagccactgcaaaaacatgccaaactgtaa >gi568815596f:222952831_223153340|GENSCAN_predicted_peptide_4|218_aa MGDFNTPLSTLDRSMRQKVNKDIQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHRTYS KINHIVGSKALLSKCKRKEIITNYLSDHSAIKLELRIKKLTQNCSTIWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKANRRQEITKIRAELKEIETQKTLQKNQ >gi568815596f:222952831_223153340|GENSCAN_predicted_CDS_4|657_bp atgggagactttaataccccactgtcaacattagacagatcaatgagacagaaagttaac aaggatatccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacattcttctcagcaccacatcgcacttattcc aaaattaaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaaaagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaactgctcaactatatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaatgagaacaaagacaca acataccagaatctctgggacacatttaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaaattgacaccctaacgtcacaattaaaa gaactagagaagcaagagcaaacacattcaaaagctaacagaaggcaagaaataactaag atcagagcagaactgaaggagatagagacacaaaaaacccttcaaaaaaatcaatga >gi568815596f:222952831_223153340|GENSCAN_predicted_peptide_5|178_aa MRTARERPTLMMQLLLTGSLLRHVEMMLTTIRDLSGDTEPNRISTQGRTFCNWTEQSDDA VECHFPQTGATLVVLTMKIQRCNIKKHVLLNRQKQTQRLFNLALSMYSSLGHRVPDITQR PHLNVNNSCECRIFKGNEQSPLLRTESSYFMYEGFKWDRPVCHLPHAADISSSVKQGK >gi568815596f:222952831_223153340|GENSCAN_predicted_CDS_5|537_bp atgagaacagcacgcgaaagacccaccctcatgatgcaattacttctcacagggtccctc ctacgacatgtggaaatgatgttaactacaattcgagatttgagtggcgacacagagcca aaccgtatcagcacccaaggaaggacattctgtaactggacagagcagtcagatgatgca gttgagtgtcattttccacagactggtgctaccttagtggtgcttacaatgaagatacag agatgcaacataaagaagcatgttttactcaacagacaaaagcaaactcagagattattt aatctagccctttccatgtacagtagcctagggcaccgtgtccctgacatcacacagagg ccacatttaaatgtaaataacagctgtgaatgcaggattttcaaaggcaatgagcagagt ccgctcctcagaacagagtcctcttatttcatgtacgagggcttcaaatgggaccgtcca gtgtgtcacttgcctcatgctgctgacatttcctcctctgtaaaacaagggaaataa >gi568815596f:222952831_223153340|GENSCAN_predicted_peptide_6|163_aa MAMEKRKHLCTAVYPGAGWFGLGFKENVHELTLRPHWGTLKVAEKERKNVKGNTEVHSLI EQRSWKDAEISQWCVSLGTRAQQHAVVTHPTWPRKLLNDDVPLSCNFPGPQTVHPGTLSL PPAPATPPPFFYLQDLIQGPGGPCVTPHNHGSSFLGLSSSSAL >gi568815596f:222952831_223153340|GENSCAN_predicted_CDS_6|492_bp atggctatggagaaaaggaaacacttatgcacagctgtgtatcctggagctgggtggttt ggcttagggttcaaagaaaatgtccatgaactcaccttgaggccgcactggggcaccctg aaagtagcagaaaaagaaaggaagaatgtcaaaggaaacactgaggttcacagcctgatt gaacagagaagctggaaggatgctgagatatcccagtggtgtgtttccttgggtacaagg gcccagcagcatgctgtggtaactcaccccacatggcccagaaagctcttaaatgatgat gttcctctgagctgcaattttcctggtccccaaactgttcacccaggaacactgtctctg cccccagcccctgccacacctccaccttttttctacctgcaagacctaattcaagggcca ggaggaccttgtgtcacccctcacaatcatggctcctctttccttggcctgtcgtcctcc tctgctctttga >gi568815596f:222952831_223153340|GENSCAN_predicted_peptide_7|259_aa MLEWSMVGTSSASWGFMDEHRHNDFTDEECEVRREHVTMHFLTIYPNCSSGVVRAQSRTE QKNPLGLDDLGIQNLGQTVSLAPAVEAASMLKMEPLNSTHPGTAASSSPLESRAAGGGSG NGNEYFYILVVMSFYGIFLIGIMLGYMKSKRREKKSSLLLLYKDEERLWGEAMKPLPVVS GLRSVQVPLMLNMLQESVAPALSCTLCSMEGDSVSSESSSPDVHLTIQEEGADDELEETS ETPLNESSEGSSENIHQNS >gi568815596f:222952831_223153340|GENSCAN_predicted_CDS_7|780_bp atgctggagtggtcgatggttggtacctccagtgccagctggggatttatggatgaacac aggcacaacgattttacagacgaggaatgtgaggtgcggagagagcacgtgacgatgcac ttcttgactatatatcccaactgcagcagcggagttgtcagagcgcagagccggacagag cagaagaaccctcttggactggacgatttgggaattcaaaacttgggacaaactgtcagc cttgcccctgctgtggaggcagcctcaatgctgaaaatggagcctctgaacagcacgcac cccggcaccgccgcctccagcagccccctggagtcccgtgcggccggcggcggcagcggc aatggcaacgagtacttctacattctggttgtcatgtccttctacggcattttcttgatc ggaatcatgctgggctacatgaaatccaagaggcgggagaagaagtccagcctcctgctg ctgtacaaagacgaggagcggctctggggggaggccatgaagccgctgcctgtggtgtcg ggcctgaggtcggtgcaggtgcccctgatgctgaacatgctgcaggagagcgtggcgccc gcgctgtcctgcaccctctgttccatggaaggggacagcgtgagctccgagtcctcctcc ccggacgtgcacctcaccattcaggaggagggggcagacgatgagctggaggagacctcg gagacgcccctcaacgagagcagcgaagggtcctcggagaacatccatcagaattcctag >gi568815596f:222952831_223153340|GENSCAN_predicted_peptide_8|84_aa MRNQNHIVVCLEHSSRSEDICTGRNLIPSFGRVTCIDRNVSRHARHCVKNDSFGVTCGLA GILSLHHRKGPTQSFTPQSPGYFS >gi568815596f:222952831_223153340|GENSCAN_predicted_CDS_8|255_bp atgaggaatcagaaccatatcgtggtgtgcctggaacatagcagccgctcagaagacatt tgtacaggaaggaatctaattcccagctttggtcgtgtgacttgcattgaccggaatgtg agcagacatgccagacactgtgttaaaaatgactctttcggtgtgacttgtggtttggca ggcatcttgagcctccaccataggaagggacccacccagtccttcacccctcagagcccc ggttacttctcttga >gi568815596f:222952831_223153340|GENSCAN_predicted_peptide_9|65_aa MNMSQATRTGELATQVGGNPDGDLELMRRTGSHQVVGDKKSTRTPWKTGSHQIAEDEESM RTPFA >gi568815596f:222952831_223153340|GENSCAN_predicted_CDS_9|198_bp atgaacatgtctcaagccacccgcactggagagctggctacccaggtgggagggaaccca gatggggatctagagcttatgaggaggacagggagtcaccaggtagttggagacaagaag tctacaagaacaccttggaagacagggagtcaccagatagctgaagacgaggagtctatg agaacaccttttgcataa >gi568815596f:222952831_223153340|GENSCAN_predicted_peptide_10|336_aa XLLNSWNYRRVQEVLPHNLKVLTNHRKHIHRPHCQPWATGLSPRVSMPRALLAETSVWPV GLTSYNTINSPLLTSLTFSRSHSITGIHCLRMKTVCSSIKGREREKETQRQRKDRKNINA GSVRGTLWEGGSVLPLPQEQPTSNGIGREKRISPLTTKWNNEGPSQLLRSGSTPPSKHTH GNEVLPASQIRSQCGGISCLDIFFHNGPEASPDWELDLLLLPVSPSGAQCLVQSRFSATG LIALQEEETPESSLFVCTGKRPWENTERRHHLKPRREALPETDPAGTLVLDFQPPKWALP LPHQAYGLGRALPKPKPISTPASNRPQGLFLGEILV >gi568815596f:222952831_223153340|GENSCAN_predicted_CDS_10|1011_bp nngctcctgaatagctggaattacagacgcgtgcaagaagttttgccacataatttaaag gtgttgacaaatcaccggaaacatattcatagaccacattgccaaccatgggctacaggc ttgtcacccagggtatccatgccaagggcgctactggcagaaacatcagtgtggccggtg ggcttaacctcttacaataccattaactctccattgctcaccagccttacgttttcacgc agccactctatcactggcattcattgcctccgaatgaaaacagtatgttcctcaataaag ggacgagaaagagagaaagaaacacagagacagagaaaggacagaaagaacattaatgca ggctctgttagggggactttgtgggaaggaggctcagtcctgcccttgccccaggagcag cccacatctaatggtattggaagagagaagaggataagccctcttaccactaagtggaat aatgaagggccatctcagctccttcgctctggctccacacctccttcaaagcacacccat ggtaacgaagtcctccctgcatcacaaatcagatcacagtgtggaggaatttcctgctta gacatcttcttccataatggaccggaagcctctccagactgggaactggatctcctcctt cttcctgtgtctccatctggagcacagtgcctggtacaaagcaggttctctgcaactgga ctgatagccttacaagaagaagagacaccagagagctctctctttgtgtgcacagggaaa agaccatgggagaacacagagagaaggcatcatctgaagccgaggagagaggccttgcca gaaacggaccctgctggcaccttggtcttggactttcagcctccaaaatgggctcttccc ctcccccaccaagcatatggtctgggcagggctctccccaagcctaagccaatcagcaca cctgcatcgaaccggccacagggactgtttctgggagagatcctggtataa