GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:53:37 Sequence gi568815584f:96137108_96341501 : 204394 bp : 44.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 6575 6614 40 -0.96 1.01 Init + 7929 8064 136 2 1 87 92 27 0.704 3.31 1.02 Term + 11411 11523 113 0 2 63 54 74 0.640 0.22 1.03 PlyA + 13595 13600 6 1.05 2.03 PlyA - 13611 13606 6 1.05 2.02 Term - 14413 14259 155 2 2 71 53 104 0.061 3.28 2.01 Init - 24285 24192 94 0 1 63 89 31 0.052 1.24 2.00 Prom - 26774 26735 40 -4.76 3.00 Prom + 27250 27289 40 -5.26 3.01 Init + 36783 36841 59 2 2 76 116 43 0.979 6.93 3.02 Intr + 37283 37382 100 2 1 47 81 74 0.960 2.71 3.03 Intr + 38643 38957 315 2 0 64 97 124 0.865 7.16 3.04 Intr + 41860 41973 114 0 0 78 80 28 0.613 1.64 3.05 Intr + 46661 46809 149 1 2 72 51 96 0.017 3.33 3.06 Intr + 52453 52598 146 1 2 56 53 81 0.005 1.33 3.07 Intr + 70457 70613 157 0 1 46 115 95 0.049 7.07 3.08 Intr + 77328 77446 119 0 2 56 85 19 0.024 -1.49 3.09 Term + 79488 79759 272 1 2 35 47 258 0.802 12.05 3.10 PlyA + 80377 80382 6 1.05 4.00 Prom + 83887 83926 40 -4.66 4.01 Sngl + 86001 86240 240 2 0 70 48 249 0.937 14.18 4.02 PlyA + 87166 87171 6 1.05 5.00 Prom + 97245 97284 40 -4.56 5.01 Init + 97706 97752 47 1 2 85 40 73 0.031 2.36 5.02 Intr + 103296 104393 1098 0 0 102 53 1931 0.003 180.05 5.03 Intr + 108772 108824 53 1 2 -9 73 168 0.026 4.15 5.04 Intr + 109325 109352 28 0 1 83 80 43 0.012 0.17 5.05 Intr + 126566 127541 976 2 1 125 47 998 0.235 90.36 5.06 Term + 132366 132527 162 2 0 69 38 106 0.158 1.64 5.07 PlyA + 132787 132792 6 1.05 6.20 PlyA - 136890 136885 6 1.05 6.19 Term - 148878 148648 231 0 0 124 45 174 0.994 13.17 6.18 Intr - 152741 152549 193 1 1 43 78 146 0.337 8.59 6.17 Intr - 154991 154922 70 0 1 123 47 66 0.301 4.24 6.16 Intr - 158453 158375 79 2 1 71 62 45 0.246 -0.68 6.15 Intr - 165001 164900 102 1 0 79 72 56 0.868 3.47 6.14 Intr - 166148 165954 195 2 0 65 95 48 0.738 2.81 6.13 Intr - 168708 168559 150 0 0 55 5 183 0.623 7.06 6.12 Intr - 169809 169607 203 1 2 68 90 114 0.998 8.60 6.11 Intr - 172487 172346 142 2 1 56 95 134 0.996 10.83 6.10 Intr - 174180 174010 171 0 0 80 87 43 0.803 3.54 6.09 Intr - 180210 180038 173 1 2 53 100 45 0.806 1.86 6.08 Intr - 180748 180591 158 0 2 97 45 65 0.860 2.75 6.07 Intr - 185147 185005 143 2 2 63 95 54 0.973 2.75 6.06 Intr - 185628 185433 196 2 1 46 98 110 0.941 7.22 6.05 Intr - 186891 186789 103 1 1 51 113 75 0.936 5.53 6.04 Intr - 188815 188542 274 1 1 74 91 148 0.993 10.81 6.03 Intr - 191428 191240 189 1 0 90 119 19 0.866 4.98 6.02 Intr - 191637 191567 71 1 2 45 110 -31 0.460 -6.30 6.01 Intr - 196766 196581 186 0 0 75 91 133 0.795 11.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 77302 77446 145 0 1 71 85 49 0.811 3.18 S.002 Sngl + 103303 104397 1095 0 0 95 53 1903 0.913 184.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:96137108_96341501|GENSCAN_predicted_peptide_1|82_aa MALVFFKSISCYQHGEELRITTLEEGSKEEETTSLWPNMRPPAPTATQWGYQALSLCWKM SAESYDVTCLQVFQPWIPVPTL >gi568815584f:96137108_96341501|GENSCAN_predicted_CDS_1|249_bp atggcattggtattttttaaatctatcagctgctaccagcatggagaggagctgagaatc actaccctagaagaaggtagcaaagaagaggaaactactagcctctggccaaacatgaga ccccctgctcccacagccacccagtggggataccaggctctgagcttgtgctggaagatg tctgcagagtcctatgatgtgacctgtcttcaggtcttccagccgtggataccagtgccc actctgtga >gi568815584f:96137108_96341501|GENSCAN_predicted_peptide_2|82_aa MVEATTLMVSESPQNQTPPKQCRVTHLSGYPGQGQTPQRGFPGPPGSTPACFPSFTFTKQ HDLSVHEGSLAKRHVVDKGGGQ >gi568815584f:96137108_96341501|GENSCAN_predicted_CDS_2|249_bp atggttgaggcaactacactgatggtctcagagtctccccagaatcagacacccccaaaa cagtgccgtgtaacccatctcagtggatatccagggcaaggccaaactccacagcgcggc tttccgggccctccgggatccactcctgcctgcttccccagcttcaccttcaccaagcag catgatctaagtgtccacgaaggctcccttgccaagcgacacgtggtggacaagggggga ggccagtga >gi568815584f:96137108_96341501|GENSCAN_predicted_peptide_3|476_aa MGQRPLQAQGSWWRLGSGERGLLGEKEDPAEETDRQRLVRQEAWRGKSFRKDKTLANASL MAATKFAPEIEALTHWCARAPHGMRQKLGSNQTTCLAFSTCCILLPLPPFSWEYSPVNHL SKNPTSGSALRSMNFHVPRTMMGVVVEKEPVNPVLTISMLSNTNDKSKQVVCLQSSRPAP KLQLGETHRALDPPVQLEPGHGSDPTLITEMTKVTWRIMECHHPDKLNGSLQDCYVRARD CKPVLTLGSQAGFSGRSDTSAKVARGPSWGRPIVSALANVEVLEGQGGADGHRTKCPSPR VNPDENYGLWVMMMCQRRFINYNKLTALVGDVSNGGGYVCVGTGDICPPAGDNAPYPTSG DLGSDRRPHTYSPSLSDRQSWDLNTPAWATEQDPVSTKEKEERRGRKKEEKRKKKEEEEE GGGRRRKRKEEEEGGGGGRRRRRRKEEEEEEEEVGEEEGGGGGGDPTALFGRLWEN >gi568815584f:96137108_96341501|GENSCAN_predicted_CDS_3|1431_bp atgggtcagcggccgctgcaggcccaggggagctggtggcgcctgggctcaggggaaagg ggtcttcttggcgagaaggaagatccagcagaggagacagacaggcagcggctcgtgagg caggaagcctggagagggaagagcttccggaaagacaagaccttggccaatgccagcctc atggctgccacaaagtttgccccggagatagaggcactaactcattggtgcgccagagcc ccccatgggatgaggcagaagctgggctccaaccagaccacctgcttagctttttccacc tgctgtatcctgcttcctttgccccccttttcatgggagtactccccagtaaatcatctg agcaagaatccaacctcaggctccgctttaagatctatgaattttcatgtgccaagaacc atgatgggggtggtggtggagaaagagccagtgaatccagtactaacaatatcgatgctg agcaacacaaatgataaatccaagcaagttgtttgcctgcagagttcccggcctgctcca aagttgcagctgggggaaacccacagggctctagacccacccgtgcagctggaacctgga catggcagtgacccaactctaatcacagaaatgaccaaggtgacctggagaataatggag tgccaccaccctgacaagctgaatggctcgctccaggactgttacgtgagagcgagagat tgcaagccagtcctcacattgggcagtcaggcaggcttctctgggagaagtgacacttca gccaaagtggccaggggaccgagctgggggaggcccatcgtcagtgccctagcgaatgtg gaagtgctcgagggacagggaggggccgacggccatcgcaccaaatgtccatcaccaaga gtgaaccctgatgaaaactatggactttgggtgatgatgatgtgtcaacgcaggttcatc aattacaacaaattgactgctcttgtgggggatgtcagtaacgggggaggctacgtatgt gtaggcacaggggatatatgccctcctgccggggacaatgctccttatcccacaagcggg gacctgggctctgacaggaggccacacacttacagtccctcccttagtgaccggcagagc tgggatttgaacacccctgcctgggcaacagagcaagatcctgtttcaacaaaagaaaaa gaagagagaagaggaagaaagaaggaggagaagaggaagaagaaggaggaggaggaggaa ggaggaggaaggaggaggaagaggaaggaggaggaggaaggaggaggaggaggaaggagg aggaggaggaggaaggaggaggaggaggaggaggaggaagtaggagaagaagaaggagga ggaggaggaggagatcctacagcactgtttggaagattatgggaaaattag >gi568815584f:96137108_96341501|GENSCAN_predicted_peptide_4|79_aa MSHKQIHYSDRYDDEEFEYRHVMLPKDIAKLVPKTHLMSESEWRNLGVQQSQGWVHNMIH ESEPHILLFRRPPPKKPKK >gi568815584f:96137108_96341501|GENSCAN_predicted_CDS_4|240_bp atgtcacacaaacaaattcactattcggacagatacgacgacgaggagtttgagtatcga catgtcatgctgcccaaggacatagccaagctggtccctaaaacccatttgatgtctgaa tctgaatggaggaatcttggcgttcagcagagtcagggatgggtccataatatgatccat gaatcagaacctcacatcttgctgttccggcgcccaccacccaagaagccaaagaaatga >gi568815584f:96137108_96341501|GENSCAN_predicted_peptide_5|787_aa MEASEETLDTSPMTELADMLNVTLQGPTLNGTFAQSKCPQVEWLGWLNTIQPPFLWVLFV LATLENIFVLSVFCLHKSSCTVAEIYLGNLAAADLILACGLPFWAITISNNFDWLFGETL CRVVNAIISMNLYSSICFLMLVSIDRYLALVKTMSMGRMRGVRWAKLYSLVIWGCTLLLS SPMLVFRTMKEYSDEGHNVTACVISYPSLIWEVFTNMLLNVVGFLLPLSVITFCTMQIMQ VLRNNEMQKFKEIQTERRATVLVLVVLLLFIICWLPFQISTFLDTLHRLGILSSCQDERI IDVITQIASFMAYSNSCLNPLVYVIVGKRFRKKSWEVYQGVCQKGGCRSEPIQMENSMGT LRTSISVERQIHKLQDWAGSRHFDDDDDDDDDDDGGGVGTSWRNEKKSRSLCMASSWPPL ELQSSNQSQLFPQNATACDNAPEAWDLLHRVLPTFIISICFFGLLGNLFVLLVFLLPRRQ LNVAEIYLANLAASDLVFVLGLPFWAENIWNQFNWPFGALLCRVINGVIKANLFISIFLV VAISQDRYRVLVHPMASRRQQRRRQARVTCVLIWVVGGLLSIPTFLLRSIQAVPDLNITA CILLLPHEAWHFARIVELNILGFLLPLAAIVFFNYHILASLRTREEVSRTRCGGRKDSKT TALILTLVVAFLVCWAPYHFFAFLEFLFQVQAVRGCFWEDFIDLGLQLANFFAFTNSSLN PVIYVFVGRLFRTKLRPLPKKGLTAGLQSAQCSRSEVQVSESHMPLFQSSLHHRMGLADH RNSASHL >gi568815584f:96137108_96341501|GENSCAN_predicted_CDS_5|2364_bp atggaggccagcgaggaaacgctggacacgtcccccatgactgagctcgccgacatgctc aatgtcaccttgcaagggcccactcttaacgggacctttgcccagagcaaatgcccccaa gtggagtggctgggctggctcaacaccatccagccccccttcctctgggtgctgttcgtg ctggccaccctagagaacatctttgtcctcagcgtcttctgcctgcacaagagcagctgc acggtggcagagatctacctggggaacctggccgcagcagacctgatcctggcctgcggg ctgcccttctgggccatcaccatctccaacaacttcgactggctctttggggagacgctc tgccgcgtggtgaatgccattatctccatgaacctgtacagcagcatctgtttcctgatg ctggtgagcatcgaccgctacctggccctggtgaaaaccatgtccatgggccggatgcgc ggcgtgcgctgggccaagctctacagcttggtgatctgggggtgtacgctgctcctgagc tcacccatgctggtgttccggaccatgaaggagtacagcgatgagggccacaacgtcacc gcttgtgtcatcagctacccatccctcatctgggaagtgttcaccaacatgctcctgaat gtcgtgggcttcctgctgcccctgagtgtcatcaccttctgcacgatgcagatcatgcag gtgctgcggaacaacgagatgcagaagttcaaggagatccagacagagaggagggccacg gtgctagtcctggttgtgctgctgctattcatcatctgctggctgcccttccagatcagc accttcctggatacgctgcatcgcctcggcatcctctccagctgccaggacgagcgcatc atcgatgtaatcacacagatcgcctccttcatggcctacagcaacagctgcctcaaccca ctggtgtacgtgatcgtgggcaagcgcttccgaaagaagtcttgggaggtgtaccaggga gtgtgccagaaagggggctgcaggtcagaacccattcagatggagaactccatgggcaca ctgcggacctccatctccgtggaacgccagattcacaaactgcaggactgggcagggagc agacactttgatgatgatgatgatgatgatgatgatgatgatggtggtggtgttggcaca agctggcgcaatgagaagaagtccaggtcactgtgcatggcatcatcctggccccctcta gagctccaatcctccaaccagagccagctcttccctcaaaatgctacggcctgtgacaat gctccagaagcctgggacctgctgcacagagtgctgccaacatttatcatctccatctgt ttcttcggcctcctagggaacctttttgtcctgttggtcttcctcctgccccggcggcaa ctgaacgtggcagaaatctacctggccaacctggcagcctctgatctggtgtttgtcttg ggcttgcccttctgggcagagaatatctggaaccagtttaactggcctttcggagccctc ctctgccgtgtcatcaacggggtcatcaaggccaatttgttcatcagcatcttcctggtg gtggccatcagccaggaccgctaccgcgtgctggtgcaccctatggccagccggaggcag cagcggcggaggcaggcccgggtcacctgcgtgctcatctgggttgtggggggcctcttg agcatccccacattcctgctgcgatccatccaagccgtcccagatctgaacatcaccgcc tgcatcctgctcctcccccatgaggcctggcactttgcaaggattgtggagttaaatatt ctgggtttcctcctaccactggctgcgatcgtcttcttcaactaccacatcctggcctcc ctgcgaacgcgggaggaggtcagcaggacaaggtgcgggggccgcaaggatagcaagacc acagcgctgatcctcacgctcgtggttgccttcctggtctgctgggccccttaccacttc tttgccttcctggaattcttattccaggtgcaagcagtccgaggctgcttttgggaggac ttcattgacctgggcctgcaattggccaacttctttgccttcactaacagctccctgaat ccagtaatttatgtctttgtgggccggctcttcaggaccaagctcaggcccctgcccaag aagggcctcaccgcggggctgcagagtgcacagtgcagccggtcagaggtccaggtctct gagtcacacatgccactgttccagtcctcactccaccaccgcatgggcctggccgaccac cgaaactcggcatcccatctgtaa >gi568815584f:96137108_96341501|GENSCAN_predicted_peptide_6|1009_aa XNSSKIGLANKDRKNRPMQQEDEYRIQMELNRYYLRKDSLSVGVSSEQSFYETETARTPS SREETGSHSPVCLQLHYKHSENRGPQGNQARLSSVPHKAELQIKLNPVCCELDISIVDRL NSLLQPQKLATVEMMASHMYTSYNKHISLHKAFTEVFLDDSHSPANCRISVQVATPALNL SVRFPIPDLRSDQERGPWFKKSLQKEILYLAFTDLEFKTEFIGGSTPEQIKLELTFRELI GSFQEEKGDPSIKFFHVSSGVDGDTTSSDDFDWPRIVLKINPPAMHSILERIAAEEEEEN DGHYQEEEEGGAHSLKDVCDLRRPAPSPFSSRRVMFENEQMVMPGDPVEMTEFQDKAISN SHYVLELTLPNIYVTLPNKSFYEKLYNRIFNDLLLWEPTAPSPVETFENISYGIGLSVAS QLINTFNKDSFSAFKSAVHYDEESGSEEETLQYFSTVDPNYRSRRKKKLDSQNKNSQSFL SVLLNINHGLIAVFTDVKTEPRFELHCSSDVVHIRTCSDSCAALMNLIQYIASYGDLQTP NKADMKPGAFQRRSKVDSSGRSSSRGPVLPEADQQMLRDLMSDAMEEIDMQQGTSSVKPQ ANGVLDEKSQIQEPCCSDLFLFPDESGNVSQESGPTYASFSHHFISDAMTGVPTENDDFC ILFAPKAAMQEKEEEPVIKIMVDDAIVIRDNYFSLPVNKTDTSKAPLHFPIPVIRYVVKE VKFQHEVYPPCKPDCDSSLSEHPVSRQVFIVQDLEIRDRLATSQMNKFLYLYCSKEMPRK AHSNMLTVKALHVCPESGRSPQECCLRVSLMPLRLNIDQDALFFLKDFFTSLSAEVELQM TPDPEENLDSRQKFPFDLIIMANMYQWIRLSSNSFLPFDKMIQAAAETAYDMVSPGTLSI EPKKTKRFPHHRLAHQPVDLREGVAKAYSVVKEGITDTAQTIYETAAREHESRGVTGAVG EVLRQIPPAVVKPLIVATEATSNVLGGMRNQIRPDVRQDESQKWRHGDD >gi568815584f:96137108_96341501|GENSCAN_predicted_CDS_6|3030_bp naaaattctagcaaaatagggttagctaataaagataggaaaaatcgacccatgcagcag gaagacgagtatcgaattcagatggaattaaaccggtattatttgagaaaagattccctc tctgtgggtgtatcttcagagcaaagcttttatgagacagaaacagctcgtacaccttct agccgtgaagaaactggttcccattcccctgtgtgtcttcagcttcattataagcattct gagaatagagggccccagggtaatcaagcaagacttagttcagttcctcacaaggcagaa ttgcaaattaaattaaatccagtgtgttgtgagctggatatcagtattgtggacaggtta aattccttgcttcaaccacagaaacttgccacagtagagatgatggcatcccacatgtat acttcatataataaacatattagtctgcacaaggctttcactgaagtgtttctagatgat tcacatagtcctgcaaattgtcggatatcagtacaagttgccacaccagcattaaacctt tctgttcgcttcccaatacctgatcttcgatctgatcaagaaagaggaccatggtttaag aagtcacttcagaaggagatcctttatttagccttcacagatctagaatttaagactgaa tttataggaggatcaaccccagaacaaattaaattggaacttacctttagagaactaatt ggatcgttccaggaagagaaaggagatccatctattaagtttttccatgtgtctagtgga gtagatggagatacaacatcgtcagatgactttgactggccacgaattgtactgaaaata aatccaccagccatgcattccattttggagagaattgcagctgaagaagaagaggagaat gatggtcactaccaggaggaagaggaaggaggtgctcattccttgaaagatgtttgtgat ctaagaagaccagccccatctcctttttcttctcgtagagtaatgtttgaaaatgaacag atggtgatgccaggagaccctgtagaaatgacagaatttcaggataaagcaatcagcaat tctcactatgtgctggaacttacgttaccaaatatttatgtaacactacctaataagagc ttttatgagaagctttataataggatctttaatgacttgctactgtgggaaccaacagct ccttcaccagtggagacattcgagaatatttcctatggcattgggctttcagtagccagt cagctcattaatactttcaacaaagatagttttagtgcatttaaatctgcagttcactat gatgaggaaagtggatctgaggaggagactttgcagtatttttccactgttgatcccaac tatcgttctcgcaggaaaaaaaaattagactctcagaacaagaactctcagagttttctc tcagttcttctgaatattaatcatggattaatagcagtgttcacagatgtgaagactgag ccccgctttgagttacactgttccagcgatgttgtccatatcagaacgtgctcagactct tgtgctgcgttaatgaatctcattcagtacattgcaagctatggtgacttgcagacacct aacaaggcagatatgaagcctggagcctttcaaagaaggtctaaggtagattccagtggt cgatcatcctcacgtggtccagtacttcctgaagcagatcaacaaatgttacgagatctg atgagtgatgctatggaggagatcgacatgcaacaaggcacctcgtcagtaaaaccacag gctaatggtgttttggatgaaaaatctcaaattcaggagccatgttgttcagacctcttc ctgtttcctgacgagagtgggaatgtatcccaggagtccggccccacctatgcctcattc tctcaccatttcatcagtgatgcaatgacaggtgtgcccactgagaatgatgacttttgc attctttttgcaccaaaagcagccatgcaggagaaggaagaagaaccagttataaaaatc atggttgatgatgcaattgtgataagagacaattatttcagtctgcccgttaataagacc gatacgagcaaagcccccttacactttcccattcctgtgattcgctatgtggtgaaggag gtgaagtttcagcatgaagtctacccgccatgcaaacctgattgtgattccagcctctca gaacacccagtctcccggcaggtgttcattgttcaggatcttgagattcgagatcgtttg gcaacatcacaaatgaataaatttttatacctgtattgcagtaaagaaatgcctcgaaaa gctcactccaacatgttgacagtgaaagccttacacgtgtgtccagaatctggcaggtcc ccacaggagtgctgcttgagagtgtcgctgatgccgctccgcctcaatattgaccaggat gctttgttcttcctgaaggatttcttcacaagtctttctgcagaagtagagcttcaaatg actccagatccagaagagaatttagattcacgtcagaagttcccattcgacttgattatc atggcaaacatgtatcaatggatcagactttcttctaattcatttcttccttttgacaaa atgatccaggcagctgcagagactgcttatgatatggtgtctcctggtaccctttctatc gagcccaagaagaccaaaaggtttcctcatcaccggttagcccaccagccagtagacctg agggaaggtgtggccaaggcctacagtgttgtgaaagagggaatcacagacacggctcag accatttatgaaactgcggctcgagaacacgagagcagaggggtgactggtgccgtgggc gaggttctgcgccagattcctccggcagtggtgaaacctctgattgttgccacagaagca acgtcaaacgtgctgggtggcatgagaaaccaaattaggccagatgtccggcaagacgag tcacagaaatggcgccacggggatgactga