GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:51:30 Sequence gi568815584f:61620382_61847082 : 226701 bp : 39.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 15771 15937 167 1 2 22 37 365 0.541 21.90 1.02 PlyA + 16185 16190 6 1.05 2.03 PlyA - 17719 17714 6 1.05 2.02 Term - 24610 24563 48 1 0 106 53 59 0.487 0.53 2.01 Init - 31995 31915 81 0 0 64 119 90 0.970 10.72 2.00 Prom - 32054 32015 40 -6.15 3.00 Prom + 37824 37863 40 -3.65 3.01 Init + 48831 48891 61 2 1 56 105 75 0.879 7.46 3.02 Intr + 50321 50448 128 0 2 5 89 107 0.710 1.98 3.03 Intr + 58808 58875 68 1 2 44 92 80 0.388 0.78 3.04 Term + 65368 65488 121 1 1 106 45 81 0.608 2.57 3.05 PlyA + 66591 66596 6 1.05 4.03 PlyA - 66984 66979 6 1.05 4.02 Term - 72011 71947 65 0 2 86 48 63 0.812 -0.83 4.01 Init - 75425 75116 310 2 1 72 -8 289 0.847 13.34 4.00 Prom - 87410 87371 40 -3.65 5.00 Prom + 88823 88862 40 -6.35 5.01 Init + 91947 91988 42 2 0 53 80 42 0.492 0.37 5.02 Intr + 93489 93626 138 2 0 91 30 85 0.530 2.74 5.03 Intr + 98715 98773 59 2 2 68 67 36 0.440 -3.74 5.04 Intr + 100001 100191 191 2 2 49 105 193 0.829 15.41 5.05 Intr + 101128 101273 146 2 2 121 98 141 0.999 17.48 5.06 Intr + 107072 107274 203 1 2 108 94 93 0.998 9.06 5.07 Intr + 112037 112143 107 2 2 78 92 33 0.996 1.64 5.08 Intr + 113757 113904 148 1 1 122 107 81 0.998 11.77 5.09 Intr + 116508 116728 221 0 2 69 113 145 0.999 12.12 5.10 Intr + 117706 117992 287 2 2 96 92 225 0.995 19.84 5.11 Intr + 120124 120246 123 0 0 42 76 91 0.897 3.16 5.12 Intr + 120374 120807 434 1 2 77 63 295 0.998 17.42 5.13 Intr + 124324 124432 109 1 1 76 87 118 0.998 9.87 5.14 Intr + 125310 125436 127 2 1 82 53 91 0.987 4.33 5.15 Term + 126553 126704 152 2 2 86 49 116 0.984 4.59 5.16 PlyA + 127854 127859 6 1.05 6.05 PlyA - 130498 130493 6 1.05 6.04 Term - 130708 130500 209 2 2 37 48 172 0.354 4.62 6.03 Intr - 130955 130810 146 1 2 101 -5 115 0.513 2.51 6.02 Intr - 133649 133476 174 1 0 46 72 147 0.966 7.03 6.01 Init - 135305 135169 137 2 2 73 86 53 0.609 3.26 6.00 Prom - 136380 136341 40 -8.85 7.00 Prom + 139462 139501 40 -6.05 7.01 Init + 142080 142207 128 2 2 92 68 156 0.661 13.84 7.02 Intr + 146495 146654 160 2 1 71 85 20 0.324 -0.93 7.03 Intr + 146831 146971 141 1 0 82 76 89 0.976 6.73 7.04 Intr + 148255 148359 105 0 0 38 84 130 0.980 7.09 7.05 Intr + 155714 155872 159 1 0 61 91 62 0.857 3.06 7.06 Intr + 157691 157759 69 1 0 84 55 69 0.751 1.66 7.07 Intr + 158467 158529 63 0 0 70 111 28 0.705 1.30 7.08 Intr + 161866 162016 151 0 1 5 88 121 0.920 2.61 7.09 Intr + 172426 172521 96 2 0 70 92 91 0.864 6.76 7.10 Intr + 188964 189085 122 1 2 40 93 93 0.926 4.29 7.11 Intr + 189419 189550 132 1 0 48 67 149 0.980 8.72 7.12 Intr + 191554 191943 390 0 0 66 85 238 0.853 15.29 7.13 Intr + 192723 192935 213 2 0 72 61 187 0.457 12.39 7.14 Term + 194854 194865 12 0 0 123 48 3 0.407 -2.97 7.15 PlyA + 197691 197696 6 1.05 8.04 PlyA - 197752 197747 6 1.05 8.03 Term - 202305 202090 216 0 0 98 37 148 0.394 6.86 8.02 Intr - 206871 206652 220 2 1 67 53 93 0.413 1.08 8.01 Init - 211988 211543 446 2 2 59 51 607 0.248 49.63 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 188349 188430 82 2 1 86 64 70 0.891 5.58 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:61620382_61847082|GENSCAN_predicted_peptide_1|55_aa XWMTVEEEEEEEEEEEEERRKKKKEKEKKRKKKKKEKEKKRKKKKKKKPKMNIIN >gi568815584f:61620382_61847082|GENSCAN_predicted_CDS_1|168_bp ncttggatgacagtggaagaagaagaagaggaagaggaagaagaagaagaagaaagaaga aagaagaagaaggaaaaggagaagaagaggaagaagaagaagaaggaaaaggagaagaag aggaagaagaagaagaagaagaaacccaaaatgaacatcataaattaa >gi568815584f:61620382_61847082|GENSCAN_predicted_peptide_2|42_aa MSSPSDGYCVQKPKEYAESDILYGMLLMPEKHPAPLDAGALF >gi568815584f:61620382_61847082|GENSCAN_predicted_CDS_2|129_bp atgagttcacctagtgatggatactgtgtacagaaaccaaaagaatatgcagagagtgat attctgtatggaatgctgctgatgcctgaaaaacaccctgctcctctagatgctggggcg cttttctga >gi568815584f:61620382_61847082|GENSCAN_predicted_peptide_3|125_aa MKQGIEMITFAPTNNTLAAADRETRAESRDKLLAAGETSLPRAKEIRMGADAGTFVKRNA GSQLNAGWSSAYTQLLASAMFMFYTRNVISYGTVAFKNLPIDFDVGEIPNIIFLSSQNIV NGMFY >gi568815584f:61620382_61847082|GENSCAN_predicted_CDS_3|378_bp atgaaacaaggaattgaaatgatcacatttgcacctacaaacaacactctggcagctgct gatagggagacaagagcagagagcagagataagctgctggcagcaggagagacatctctt cctagagcgaaagagataaggatgggagcagatgcaggtacatttgtcaagaggaatgca ggaagtcaactaaatgccggctggagctctgcttacacgcagctgctggcctctgctatg ttcatgttttacaccagaaatgtaatatcttatggaactgtggcttttaagaatcttcca attgactttgatgttggagaaataccaaatatcattttcctttctagtcagaatatagtg aatgggatgttttattaa >gi568815584f:61620382_61847082|GENSCAN_predicted_peptide_4|124_aa MVNRSPRCLHGGRPPGSLRPKRWLPPHAEKRRKGKSRGGGARRETPRETREKRAAIAGVR ARLRASPQVACQGPARHTGRSDEEGDPPVPSDEAALCTEELRQPIHNENNGSKRVLFGRK VRTR >gi568815584f:61620382_61847082|GENSCAN_predicted_CDS_4|375_bp atggtgaatcggtccccgcgatgtcttcacggcgggcggcccccaggctcgctccggcct aagcgctggctccctccacacgcggagaagagaaggaaaggcaagtccagaggtgggggt gcgaggcgggaaacccctcgtgagactagagagaagcgggcggcaatcgccggggtccgg gcccggctccgagcctctcctcaggtggcttgtcagggcccagcccgacacactggccga agcgacgaagagggtgatcctcctgtcccctcagacgaggcagcactgtgcactgaggag ctgaggcagcctattcataacgagaacaatggcagcaaaagggtattgtttggaaggaaa gtgcgtacaagataa >gi568815584f:61620382_61847082|GENSCAN_predicted_peptide_5|828_aa MSKGGKVVEGEIKEVDSVRIELKDTQLVSAAELIAHLVVERTPPLPIGLQKLSSVLLIAV IIWKEERFGVETIRYFIHRMISSERRKEKSRDAARSRRSKESEVFYELAHQLPLPHNVSS HLDKASVMRLTISYLRVRKLLDAGDLDIEDDMKAQMNCFYLKALDGFVMVLTDDGDMIYI SDNVNKYMGLTQVLHCTGHIHVYDTNSNQPQCGYKKPPMTCLVLICEPIPHPSNIEIPLD SKTFLSRHSLDMKFSYCDERITELMGYEPEELLGRSIYEYYHALDSDHLTKTHHDMFTKG QVTTGQYRMLAKRGGYVWVETQATVIYNTKNSQPQCIVCVNYVVSGIIQHDLIFSLQQTE CVLKPVESSDMKMTQLFTKVESEDTSSLFDKLKKEPDALTLLAPAAGDTIISLDFGSNDT ETDDQQLEEVPLYNDVMLPSPNEKLQNINLAMSPLPTAETPKPLRSSADPALNQEVALKL EPNPESLELSFTMPQIQDQTPSPSDGSTRQSSPEPNSPSEYCFYVDSDMVNEFKLELVEK LFAEDTEAKNPFSTQDTDLDLEMLAPYIPMDDDFQLRSFDQLSPLESSSASPESASPQST VTVFQQTQIQEPTANATTTTATTDELKTVTKDRMEDIKILIASPSPTHIHKETTSATSSP YRDTQSRTASPNRAGKGVIEQTEKSHPRSPNVLSVALSQRTTVPEEELNPKILALQNAQR KRKMEHDGSLFQAVGIGTLLQQPDDHAATTSLSWKRVKGCKSSEQNGMEQKTIILIPSDL ACRLLGQSMDESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRALDQVN >gi568815584f:61620382_61847082|GENSCAN_predicted_CDS_5|2487_bp atgagcaaagggggcaaagtggtagaaggtgagatcaaagaggtagatagtgttagaatt gaattgaaggacacccagttggtgtccgctgcagaactgattgctcacctggtggtggag agaacccctcctctcccgatagggttgcagaagttgtcttctgtgttgttgattgctgtg atcatttggaaggaggagagatttggggtggagacaattcggtacttcattcacaggatg ataagttctgaacgtcgaaaagaaaagtctcgagatgcagccagatctcggcgaagtaaa gaatctgaagttttttatgagcttgctcatcagttgccacttccacataatgtgagttcg catcttgataaggcctctgtgatgaggcttaccatcagctatttgcgtgtgaggaaactt ctggatgctggtgatttggatattgaagatgacatgaaagcacagatgaattgcttttat ttgaaagccttggatggttttgttatggttctcacagatgatggtgacatgatttacatt tctgataatgtgaacaaatacatgggattaactcaggtattgcactgcacaggccacatt cacgtatatgataccaacagtaaccaacctcagtgtgggtataagaaaccacctatgacc tgcttggtgctgatttgtgaacccattcctcacccatcaaatattgaaattcctttagat agcaagactttcctcagtcgacacagcctggatatgaaattttcttattgtgatgaaaga attaccgaattgatgggatatgagccagaagaacttttaggccgctcaatttatgaatat tatcatgctttggactctgatcatctgaccaaaactcatcatgatatgtttactaaagga caagtcaccacaggacagtacaggatgcttgccaaaagaggtggatatgtctgggttgaa actcaagcaactgtcatatataacaccaagaattctcaaccacagtgcattgtatgtgtg aattacgttgtgagtggtattattcagcacgacttgattttctcccttcaacaaacagaa tgtgtccttaaaccggttgaatcttcagatatgaaaatgactcagctattcaccaaagtt gaatcagaagatacaagtagcctctttgacaaacttaagaaggaacctgatgctttaact ttgctggccccagccgctggagacacaatcatatctttagattttggcagcaacgacaca gaaactgatgaccagcaacttgaggaagtaccattatataatgatgtaatgctcccctca cccaacgaaaaattacagaatataaatttggcaatgtctccattacccaccgctgaaacg ccaaagccacttcgaagtagtgctgaccctgcactcaatcaagaagttgcattaaaatta gaaccaaatccagagtcactggaactttcttttaccatgccccagattcaggatcagaca cctagtccttccgatggaagcactagacaaagttcacctgagcctaatagtcccagtgaa tattgtttttatgtggatagtgatatggtcaatgaattcaagttggaattggtagaaaaa ctttttgctgaagacacagaagcaaagaacccattttctactcaggacacagatttagac ttggagatgttagctccctatatcccaatggatgatgacttccagttacgttccttcgat cagttgtcaccattagaaagcagttccgcaagccctgaaagcgcaagtcctcaaagcaca gttacagtattccagcagactcaaatacaagaacctactgctaatgccaccactaccact gccaccactgatgaattaaaaacagtgacaaaagaccgtatggaagacattaaaatattg attgcatctccatctcctacccacatacataaagaaactactagtgccacatcatcacca tatagagatactcaaagtcggacagcctcaccaaacagagcaggaaaaggagtcatagaa cagacagaaaaatctcatccaagaagccctaacgtgttatctgtcgctttgagtcaaaga actacagttcctgaggaagaactaaatccaaagatactagctttgcagaatgctcagaga aagcgaaaaatggaacatgatggttcactttttcaagcagtaggaattggaacattatta cagcagccagacgatcatgcagctactacatcactttcttggaaacgtgtaaaaggatgc aaatctagtgaacagaatggaatggagcaaaagacaattattttaataccctctgattta gcatgtagactgctggggcaatcaatggatgaaagtggattaccacagctgaccagttat gattgtgaagttaatgctcctatacaaggcagcagaaacctactgcagggtgaagaatta ctcagagctttggatcaagttaactga >gi568815584f:61620382_61847082|GENSCAN_predicted_peptide_6|221_aa MRESLEFPKDWLNGCDQNAHSPLLGFGRRRSPIFLAIPTCVELSSRGTAEKKIIHELTPV KACTVGDIVMTHRCLRNEPLTLQAAGSAAEDGLQLSALSGDCLRPGAARASRMRSHTYAQ RAAATPQRTRKSPCAAGALEPTAGAAGGARDCAPATPGTEGCRVPSACLVFSPRGSDRCR SQAPVFPLRPRMEPRQGNKRNSIVHGDRVSGQRLGKTIHYR >gi568815584f:61620382_61847082|GENSCAN_predicted_CDS_6|666_bp atgagggaaagtttggaatttcctaaagactggttaaatggttgtgaccaaaatgctcat agtcctctactgggatttgggagaagaaggagccccatcttcttggccatacccacctgt gtggaactttcctcaagaggtacagctgaaaaaaagatcatccatgagttgacccctgtt aaagcttgcactgtgggtgatattgtgatgacacacagatgtctcaggaatgaaccactt actctccaagctgctggaagtgcagcagaagatggtcttcagctgtcagccctttctggg gattgtctcaggccaggagcggccagggcctcccgcatgcgctcccacacgtacgcgcag cgggcagcagccaccccacagcgtacacgaaagtcgccttgcgcggctggagccctggag cccacggccggggccgcgggcggggcccgggactgcgcgccagccacaccaggaacagaa gggtgccgggtaccttccgcatgcttggtattctccccgcggggctctgaccgctgccgc tctcaggcacctgtctttcctctccgtcccagaatggagccaagacaagggaataaacga aattcaatagtacacggagatcgggtgtctgggcagcgtcttggaaaaactatccactac aggtaa >gi568815584f:61620382_61847082|GENSCAN_predicted_peptide_7|646_aa MGTPPGLQTDCEALLSRFQETDSVRFEDFTELWRNMKFGTIFCGRMRNLEKNMFTKEALA LAWRYFLPPYTFQIRVGALYLLYGLYNTQLCQPKQKIRVALKDWDEVLKFQQDLVNAQHF DAAYIFRKLRLDRAFHFTAMPKLLSYRMKKKIHRAEVTEEFKDPSDRVMKLITSDVLEEM LNVHDHYQNMKHVISVDKSKPDKALSLIKDDFFDNIKNIVLEHQQWHKDRKNPSLKSKTN DGEEKMEGNSQETERCERAESLAKIKSKAFSVVIQASKSRRHRQVKLDSSDSDSASGQGQ VKATRKKEKKERLKPAGRKMSLRNKGNVQNIHKEDKPLSLSMPVITEEEENESLSGTDLG DFVLQQPYCRGGKWGVQVLMGVVEMLNQPRVDPLIYNKSNRKVKEKKSDLKGLWCGGEDR VVAATHTHGCGGLTSSEVQTEERPSLVSGRAVVGYDHVSVAHPVDVQSVSWPFIHSFMQA SSYKDIHSPTQQKSAESLLRAGNYFHIGDKTEGDMKSTAHLEPTEMALSGFAGELLPRRI NAQHLAHPPHTPSSSPALLLPAGAEHGVPTPLAQSQPSASRANEGTARFWRCEFKVRLEF WVGFVRLPFGDQGGLHGSAARWPTGCIRSGQHKAGWRPHLVTMVRK >gi568815584f:61620382_61847082|GENSCAN_predicted_CDS_7|1941_bp atggggactcctcccggcctgcagaccgactgcgaggcgctgctcagccgcttccaggag acggacagtgtacgcttcgaggacttcacggagctctggagaaacatgaagttcgggact atcttctgtggcagaatgagaaatttagaaaagaacatgtttacaaaagaagctttagct ttggcttggcgatattttttacctccatacaccttccagatcagagttggtgctttgtat ctgctatatggattatataatacccaactgtgtcaaccaaaacaaaagatcagagttgcc ctgaaggattgggatgaagttttaaaatttcagcaagatttagtaaatgcacagcatttt gatgcagcttatatttttaggaagctacgactagacagagcatttcactttacagcaatg cccaaattgctgtcatataggatgaagaaaaaaattcaccgagctgaagttacagaagaa tttaaggacccaagtgatcgtgtgatgaaacttatcacttctgatgtattagaggaaatg ctgaatgttcatgatcattatcagaacatgaaacatgtaatttcagttgataagtccaag ccagataaagccctcagcttgataaaggatgatttttttgacaatattaagaacatagtt ttggagcatcagcagtggcacaaagacagaaagaatccatccttaaagtcaaaaactaat gatggagaagaaaaaatggaaggaaattcacaagaaacggagagatgtgaaagggcagaa tcattagcgaaaataaaatcaaaggccttttcagttgtcatacaggcatccaaatcaaga aggcatcgtcaagtcaaactcgactcttctgactctgattctgcatctggtcaagggcaa gtcaaagcaactaggaaaaaagagaagaaagaaagattgaaaccagcaggaaggaagatg tctctcagaaacaaaggcaatgtgcagaatatacacaaggaagataaacctttaagtctg agtatgcctgtaattacagaagaagaagagaatgaaagtttgagtggaacagatttggga gattttgtgctgcagcagccttactgccgaggaggaaagtggggagtccaagttcttatg ggcgttgtggagatgctaaatcaaccacgggtggatcccttgatatataacaagagcaac aggaaagtcaaggagaaaaagtcagacctgaaggggctttggtgtggaggagaagacaga gtggtggctgccacccacactcatggctgtggaggtctgacgagctctgaggtccagaca gaagagcgtccatccctagtcagtggcagggctgtagttggctatgatcatgtatcagtt gctcatccagtggacgtccaatcagttagttggccattcattcattcattcatgcaggca tccagttataaagacatccattcacccactcaacagaaatctgctgagagcctactacgt gctgggaactattttcacatcggagacaaaacagagggtgacatgaagagcacggcccac ttggagcctacggagatggcgttgagcggtttcgcaggagagctcctgccccggaggatt aatgcccagcacctggctcatcctccccacacccctagctcttctcccgctttgctcctt ccagctggggctgagcatggggtcccaaccccgctggcccagagccagccttcagcgagc cgggcaaacgaaggcactgcgcgtttttggcgctgtgagttcaaggttcgcctcgagttc tgggtcgggtttgtgcgtctgccttttggggatcagggtggcttgcatggctcggctgcc cggtggcccacgggctgtattcgcagcgggcagcacaaggccggctggagaccgcatctg gttaccatggtgaggaagtga >gi568815584f:61620382_61847082|GENSCAN_predicted_peptide_8|293_aa MLATRVVSLVGKRAISTLVSVRAHGNVVKSDDYALPAYVDRRDYPVPDVAHVKHLSARQK AVKKKEKASWSNRSTDGKVELYHIQFKESFAEMNRGVNEWKTVVGAAMFFLGFTAFIIIW EKRCVYGPIPHTFDKEWVPMQTKRMLDMRRWGTEIGLKTGELAERMYREPSLHQIQQPTM KPEDYSLPKCHGLNYVPQNSYVEALTPLPPNTTVFRDGAFKESSPQQPLIQMDFPNSSSN TKNYPSSAIVSTVQQLSGWKEENNFFSDGGSYFRLYHLQWLTVGVNLTGLKDA >gi568815584f:61620382_61847082|GENSCAN_predicted_CDS_8|882_bp atgttggctaccagggtagttagcctagttggcaagcgagcaatttccaccttggtgtct gtacgagcacacggaaatgttgtgaagagcgatgactatgcgctcccagcttatgtggat cgacgtgactatcccgtacccgatgtggcccatgtcaagcacctgtctgccagacagaaa gccgtgaagaagaaggagaaggcctcctggagcaaccgctccacggatgggaaagtcgag ttgtatcacattcagttcaaggagagctttgctgagatgaacaggggcgtgaacgagtgg aagacggttgtgggcgctgccatgttcttccttggcttcacggcgttcattatcatctgg gagaagcgctgtgtgtacggccccatcccgcacacctttgacaaagagtgggtgcccatg cagaccaagaggatgctggacatgagaaggtggggaacagagatagggctgaaaacagga gaactggcagaaaggatgtacagagagccaagcctccaccagatccagcagccaacaatg aagccagaggactattcccttcctaagtgtcatggattgaattatgttccccaaaattca tatgttgaggccctgactccactacctccgaatacaactgtatttagagatggggccttt aaagagagctcaccacagcagcctcttattcagatggacttccccaatagcagctcaaac accaaaaattacccttccagcgccattgtttctacagtgcaacagctctcaggttggaaa gaagagaacaatttcttctctgacggaggcagttatttcaggctttaccacctgcaatgg ttaacagtaggtgtcaacttgactggattgaaggatgcctag