GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:58:05 Sequence gi568815578r:35179566_35384487 : 204922 bp : 45.79% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3885 3957 73 2 1 80 65 108 0.951 8.93 1.02 Term + 10078 10106 29 2 2 51 41 68 0.393 -3.36 1.03 PlyA + 10299 10304 6 1.05 2.00 Prom + 16825 16864 40 -5.66 2.01 Init + 19439 19909 471 1 0 114 61 124 0.807 7.83 2.02 Term + 28844 28846 3 1 0 131 48 0 0.468 -2.80 2.03 PlyA + 29412 29417 6 1.05 3.05 PlyA - 32623 32618 6 1.05 3.04 Term - 32911 32898 14 2 2 127 28 16 0.195 -2.14 3.03 Intr - 38589 38433 157 0 1 56 91 51 0.070 1.78 3.02 Intr - 45387 45240 148 2 1 78 46 33 0.119 -1.66 3.01 Init - 47460 47078 383 0 2 61 56 426 0.927 31.04 3.00 Prom - 54508 54469 40 0.64 4.00 Prom + 54892 54931 40 -4.96 4.01 Init + 56428 56628 201 0 0 54 87 63 0.388 1.13 4.02 Intr + 67275 67423 149 2 2 108 54 180 0.357 15.63 4.03 Intr + 72340 72456 117 1 0 92 49 59 0.272 1.98 4.04 Intr + 74885 75189 305 2 2 91 103 256 0.965 23.63 4.05 Intr + 84226 84387 162 2 0 105 66 303 0.977 29.75 4.06 Intr + 87640 87854 215 2 2 82 97 226 0.935 21.33 4.07 Intr + 90195 90333 139 2 1 125 90 238 0.983 27.74 4.08 Intr + 92004 92274 271 1 1 97 74 405 0.703 36.60 4.09 Term + 94624 95044 421 1 1 85 53 807 0.732 71.66 4.10 PlyA + 97415 97420 6 1.05 5.08 PlyA - 97614 97609 6 -3.64 5.07 Term - 98367 98149 219 0 0 104 49 162 0.445 10.94 5.06 Intr - 98535 98481 55 2 1 -40 90 22 0.103 -11.42 5.05 Intr - 100182 100001 182 0 2 80 82 214 0.879 18.67 5.04 Intr - 100553 100377 177 2 0 59 91 141 0.994 11.62 5.03 Intr - 101264 101089 176 0 2 83 72 207 0.999 18.26 5.02 Intr - 104696 104611 86 1 2 94 82 213 0.960 20.76 5.01 Init - 104922 104816 107 0 2 83 80 166 0.937 14.99 5.00 Prom - 105513 105474 40 -15.42 6.05 PlyA - 106189 106184 6 1.05 6.04 Term - 108407 106970 1438 1 1 129 42 929 0.904 82.72 6.03 Intr - 109020 108896 125 0 2 73 99 168 0.999 15.78 6.02 Intr - 109393 109226 168 1 0 104 77 201 0.999 20.74 6.01 Init - 112739 112227 513 2 0 65 103 797 0.980 71.80 6.00 Prom - 120943 120904 40 -6.06 7.08 PlyA - 121672 121667 6 1.05 7.07 Term - 124504 124370 135 1 0 107 45 153 0.738 10.92 7.06 Intr - 127214 127101 114 2 0 79 109 116 0.975 13.44 7.05 Intr - 135200 135123 78 2 0 90 94 62 0.866 6.75 7.04 Intr - 135638 135612 27 2 0 71 98 29 0.551 0.51 7.03 Intr - 151483 151412 72 1 0 71 113 23 0.504 2.70 7.02 Intr - 153684 153607 78 0 0 34 92 85 0.638 3.25 7.01 Init - 155420 155385 36 2 0 75 102 87 0.683 6.82 7.00 Prom - 161233 161194 40 -5.66 8.07 PlyA - 161814 161809 6 1.05 8.06 Term - 166253 166230 24 2 0 105 54 20 0.803 -1.48 8.05 Intr - 166858 166676 183 1 0 54 -1 180 0.680 5.68 8.04 Intr - 167707 167599 109 0 1 91 98 59 0.783 7.49 8.03 Intr - 180392 180290 103 0 1 109 56 52 0.380 3.33 8.02 Intr - 194691 194619 73 0 1 72 97 69 0.948 5.18 8.01 Init - 201366 201307 60 0 0 78 119 3 0.559 3.66 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:35179566_35384487|GENSCAN_predicted_peptide_1|33_aa MIESFLRPHQKKMLVPCFLYILQKWGITEPADM >gi568815578r:35179566_35384487|GENSCAN_predicted_CDS_1|102_bp atgattgaaagcttcctgaggcctcaccagaagaagatgctggtgccatgcttcctgtac atcctgcagaagtggggcatcacggaacctgccgacatgtga >gi568815578r:35179566_35384487|GENSCAN_predicted_peptide_2|157_aa MPSQPPRTFIAKDNSLPGFKASKHRLTLLLGDNAAGDFKLKPMSMYHYENPRAIKNDVNS LPMLHKWNNKAWMTAYLFIAWFTGYFKPTVETYCSGKKIPLKILLLIGNARGHPRALMEM YKEIQVVFMLANTSFILQSMDQGVTSIFKSYLRNTGQ >gi568815578r:35179566_35384487|GENSCAN_predicted_CDS_2|474_bp atgcccagccagccacctaggacttttatagctaaagataactcattgcctggattcaaa gcttcaaagcacaggctaactctcttactaggggataatgcagctggtgattttaagttg aagccaatgtccatgtaccattatgaaaatcctagggccattaagaatgatgtgaattct ctgcctatgctccataaatggaacaacaaagcctggatgacagcatatctgttcatagca tggtttactggatattttaagcctactgttgagacctactgctcaggaaaaaagattcct ctcaaaatattactactcattggcaatgcacgtggtcacccaagagccctgatggagatg tacaaggagattcaggttgttttcatgcttgctaacacatcattcattctgcagtccatg gatcaaggagtaacttcgattttcaagtcttatttaagaaatacaggccagtga >gi568815578r:35179566_35384487|GENSCAN_predicted_peptide_3|233_aa MPRARAPAPRASPTCPAKGASASSARATATATAARFPAPAAAAAARAAPGRQQSAGSSSS SSRPGTRQRLQRGAWPGGGGGGGGPGAARPPRLLGIPAAGPAPGPPAFAAAAAASAASAP PPPRAPPRMGVRKETDLRDNGHTRRGKGVWSDGKSRKMDGSIFTETESSCGGTGLKETQP WAHLAQPMGQQEQQLFLQEVHLHSLHLQEPAHVQEPPVVQEQLGSISSRGPAT >gi568815578r:35179566_35384487|GENSCAN_predicted_CDS_3|702_bp atgccccgagcccgcgccccggccccgcgcgccagccccacctgcccggcgaagggcgcc tccgcctcgtccgcccgcgccaccgccaccgccaccgctgcccggttccctgcccccgcc gccgccgccgccgcccgcgcggcgcccgggaggcagcagagcgcgggcagcagcagcagc agcagccgcccagggacccgccagcggctccagcgcggggcctggcccggcggcggcggc ggcggcggcggccccggcgcggcgcggccgccccggctcctcggcatcccggcggcgggg cccgcgcccggcccgcctgccttcgccgccgccgccgccgcctcggccgccagcgcgccc ccgcctccgcgcgccccgccccggatgggggtgagaaaagagacagatttaagagacaat gggcatacaagaagagggaaaggagtttggagtgatggcaaatcgagaaagatggatggt agcatcttcacagaaacggagtcctcatgtggaggaactggcttgaaggagacacagccc tgggcacacttggcacagcccatggggcagcaggagcagcagctcttcttgcaggaggtg catttgcactctttgcacttgcaggagccggcgcacgtgcaggagccaccagtggtgcag gagcagttggggtccatttcaagccgcggacctgctacgtaa >gi568815578r:35179566_35384487|GENSCAN_predicted_peptide_4|659_aa MVIIWLVVASWEWAKDILQSAREFPLVHLPPTQEPMAANRQCCAIKVFVKKLSWALWAIL RALFRLANWLKSYGYLLPYDSRASALHSAKALQSAVSTMQQFYGIPVTGVLDQTTIEWMK KPRCGVPDHPHLSRRRRNKRYALTGQKWRQKHITYSIHNYTPKVGELDTRKAIRQAFDVW QKVTPLTFEEVPYHEIKSDRKEADIMIFFASGFHGDSSPFDGEGGFLAHAYFPGPGIGGD THFDSDEPWTLGNANHDGNDLFLVAVHELGHALGLEHSSDPSAIMAPFYQYMETHNFKLP QDDLQGIQKIYGPPAEPLEPTRPLPTLPVRRIHSPSERKHERQPRPPRPPLGDRPSTPGT KPNICDGNFNTVALFRGEMFVFKDRWFWRLRNNRVQEGYPMQIEQFWKGLPARIDAAYER ADGRFVFFKGDKYWVFKEVTVEPGYPHSLGELGSCLPREGIDTALRWEPVGKTYFFKGER YWRYSEERRATDPGYPKPITVWKGIPQAPQGAFISKEGCTACALLFTLPQSRCRKCLGVV MLGCISADYTYFYKGRDYWKFDNQKLSVEPGYPRNILRDWMGCNQKEVERRKERRLPQDD VDIMVTINDVPGSVNAVAVVIPCILSLCILVLVYTIFQFKNKTGPQPVTYYKRPVQEWV >gi568815578r:35179566_35384487|GENSCAN_predicted_CDS_4|1980_bp atggtgatcatctggcttgtggtggcctcctgggagtgggccaaggacattctgcagagc gccagggaattcccgctagtgcacttaccacccactcaagagccgatggcagccaacagg caatgctgcgccattaaggtgtttgtcaagaagctaagctgggcattgtgggcgatcctg agggccctttttagattggctaactggttaaagtcctatggctatctgcttccctatgac tcacgggcatctgcgctgcactcagcgaaggccttgcagtcggcagtctccactatgcag cagttttacgggatcccggtcaccggtgtgttggatcagacaacgatcgagtggatgaag aaaccccgatgtggtgtccctgatcacccccacttaagccgtaggcggagaaacaagcgc tatgccctgactggacagaagtggaggcaaaaacacatcacctacagcattcacaactat accccaaaagtgggtgagctagacacgcggaaagctattcgccaggctttcgatgtgtgg cagaaggtgaccccactgacctttgaagaggtgccataccatgagatcaaaagtgaccgg aaggaggcagacatcatgatcttttttgcttctggtttccatggcgacagctccccattt gatggagaagggggattcctggcccatgcctacttccctggcccagggattggaggagac acccactttgactccgatgagccatggacgctaggaaatgccaaccatgacgggaacgac ctcttcctggtggctgtgcatgagctgggccacgcgctgggactggagcactccagcgac cccagcgccatcatggcgcccttctaccagtacatggagacgcacaacttcaagctgccc caggacgatctccagggcatccagaagatctatggacccccagccgagcctctggagccc acaaggccactccctacactccccgtccgcaggatccactcaccatcggagaggaaacac gagcgccagcccaggccccctcggccgcccctcggggaccggccatccacaccaggcacc aaacccaacatctgtgacggcaacttcaacacagtggccctcttccggggcgagatgttt gtctttaaggatcgctggttctggcgtctgcgcaataaccgagtgcaggagggctacccc atgcagatcgagcagttctggaagggcctgcctgcccgcatcgacgcagcctatgaaagg gccgatgggagatttgtcttcttcaaaggtgacaagtattgggtgtttaaggaggtgacg gtggagcctgggtacccccacagcctgggggagctgggcagctgtttgccccgtgaaggc attgacacagctctgcgctgggaacctgtgggcaagacctactttttcaaaggcgagcgg tactggcgctacagcgaggagcggcgggccacggaccctggctaccctaagcccatcacc gtgtggaagggcatcccacaggctccccaaggagccttcatcagcaaggaaggatgtact gcctgtgccctccttttcacactgccccagagcaggtgccggaagtgtctgggagtggtg atgctgggctgtatttctgcagattacacctatttctacaagggccgggactactggaag tttgacaaccagaaactgagcgtggagccaggctacccgcgcaacatcctgcgtgactgg atgggctgcaaccagaaggaggtggagcggcggaaggagcggcggctgccccaggacgac gtggacatcatggtgaccatcaacgatgtgccgggctccgtgaacgccgtggccgtggtc atcccctgcatcctgtccctctgcatcctggtgctggtctacaccatcttccagttcaag aacaagacaggccctcagcctgtcacctactataagcggccagtccaggaatgggtgtga >gi568815578r:35179566_35384487|GENSCAN_predicted_peptide_5|333_aa MAVRASFENNCEIGCFAKLTNTYCLVAIGGSENFYSVFEGELSDTIPVVHASIAGCRIIG RMCVGNRHGLLVPNNTTDQELQHIRNSLPDTVQIRRVEERLSALGNVTTCNDYVALVHPD LDRETEEILADVLKVEVFRQTVADQVLVGSYCVFSNQGGLVHPKTSIEDQDELSSLLQVP LVAGTVNRGSEVIAAGMVVNDWCAFCGLDTTSTELSVVESVFKLNEAQPSTIATSMRDSL IDRVRSVRGGGVTAPAPPDAETMGAQLSGGRGAPEPAQTQPQPQPQPAAPEGPEQPRHPP QPQPQPQPQPQPEPSPWGPLDDVRFLIACTSWY >gi568815578r:35179566_35384487|GENSCAN_predicted_CDS_5|1002_bp atggcggtccgagcttcgttcgagaacaactgtgagatcggctgctttgccaagctcacc aacacctactgtctggtagcgatcggaggctcagagaacttctacagtgtgttcgagggc gagctctccgataccatccccgtggtgcacgcgtctatcgccggctgccgcatcatcggg cgcatgtgtgtggggaacaggcacggtctcctggtacccaacaataccaccgaccaggag ctgcaacacattcgcaacagcctcccagacacagtgcagattaggcgggtggaggagcgg ctctcagccttgggcaatgtcaccacctgcaatgactacgtggccttggtccacccagac ttggacagggagacagaagaaattctggcagatgtgctcaaggtggaagtcttcagacag acagtggccgaccaggtgctagtaggaagctactgtgtcttcagcaatcagggagggctg gtgcatcccaagacttcaattgaagaccaggatgagctgtcctctcttcttcaagtcccc cttgtggcggggactgtgaaccgaggcagtgaggtgattgctgctgggatggtggtgaat gactggtgtgccttctgtggcctggacacaaccagcacagagctgtcagtggtggagagt gtcttcaagctgaatgaagcccagcctagcaccattgccaccagcatgcgggattccctc attgacagggtgaggtcggtacgcggtggtggcgtcacggcgccagctcctcccgacgcc gagaccatgggggctcagctaagcggcggccgcggcgccccggagcctgcgcaaacccag ccccagccccagccccagcctgcggcgccggagggcccggaacagccccggcatccgccc cagccccagccccagccccagccccagccccagcccgagcccagcccgtgggggccgctg gacgacgtgcgcttcctcatcgcctgcacttcctggtactga >gi568815578r:35179566_35384487|GENSCAN_predicted_peptide_6|747_aa MFGGPGPGVLGAQGMAGPLRGRVEELKLPWWRESSPLVLRHSEAARLAADALLERGEAAY LRVISEERELPFLSALDVDYMTSHVRGGPELSEAQGQEASGPDRLSLLSEVTSGTYFPMA SDIDPPDLDLGWPEVPQATGFSPTQAVVHFQRDKAKNIKDLLRFLFSQAHTVVAVVMDIF TDMELLCDLMEASSRRGVPVYLLLAQEHLRHFLEMCYKMDLNGEHLPNMRVRSTCGDTYC SKAGRRFTGQALEKFVLIDCEQVVAGSYSFTWLCSQAHTSMVLQLRGRIVEDFDREFRCL YAESQPVEGFCGGEDPLSPRALRPPPVALAFRPDVPSPTSSLPSSTSLSSIKQSPLMGRS SYLALPGGGDCSDTGVVSSSLGPARREASGQPSLHRQLSDPNHGSPPGLYRANLGKLGAY PWSQSSPALNHNSTSPLTLAVGSPLLPRSRPLLQFHRGAPALSRFPENGLPGSQEPSPLR GRWVPGTTLETVEEKEKKASPSQSRGQLDLLVPFPRAREVGDPDSGVTPNSGPLRPGEQA PEDRRLSPSQADSQLDLLSRALGTGGAPELGSLRPGDRALEDRRLSLNQSRGQSDLLMQY PKAQGSRVPLETNSSARPARRAPDERRQTLGHSQLDLITKFGPFRGEGPGPNGLPISSPA RTAGAGSGDEKRLTLGHSKLDLITKYHQLHGARQGTEPGGPKGGHLNGGNSDLVRDEKRL TLGHSKLDLITKYNKSKFKQLRSRFES >gi568815578r:35179566_35384487|GENSCAN_predicted_CDS_6|2244_bp atgttcggaggcccggggcctggggtcctgggagcccagggcatggcgggacccctgcgg ggccgggtggaagagctgaagctgccgtggtggcgggagagctcaccgctggtgctgcgg cacagcgaggcggctcggctggcggccgacgccctcctggagcggggtgaggctgcctac ctgcgggtcatctccgaggagcgggagctgcccttcctgagcgccctggatgtggactac atgaccagccatgtgcgcgggggccctgagctcagcgaggctcaggggcaggaggcctcc gggccagaccgcctcagcctgctctctgaagtcacctcagggacttacttccccatggcc tctgacatagaccccccagacctggacctgggctggcccgaggtgccacaggccacaggc ttcagccccacccaggctgtggtccacttccagagggacaaggccaagaacatcaaggac ctgctgcgcttccttttcagccaggcccacacggtggtggctgtggtgatggacatattc actgacatggagcttctgtgtgacctcatggaggcctcaagccggcgtggtgtccctgtg tacctgctccttgcccaggagcacctgaggcacttcctggagatgtgctacaagatggac ctcaatggggagcacctgccgaacatgcgtgtgcggagcacgtgtggggacacatactgc agcaaggctggccgccgcttcacggggcaggccctggagaagttcgtcctcattgactgt gagcaagtggtggcgggcagttacagcttcacctggctttgcagccaggcccacactagc atggtgctgcagctgaggggccgcatcgtggaagactttgaccgggagttccgctgtctg tacgctgagtcgcagcctgtggagggcttctgtggcggtgaggacccgctgtctccccgg gcactgcgtcctccccctgtggccctagccttcaggcctgatgtcccaagccccacgtcg tccctgccctccagcaccagcctcagcagcatcaagcagtcaccgcttatgggtcgctcc tcctacctcgctctaccaggaggtggtgattgcagtgatacgggtgtggtgtcctcgtcc ctgggtcctgcccgccgtgaggccagtggccagccctccctacatcgccaactgtcagac cctaaccacggctcccctcctgggctctatagggccaatctcggcaagctaggggcatac ccatggtcccagtcctcccctgccctcaaccataatagtaccagccccttaaccttggca gtggggtcacctctgcttcctcgctcccggcccctcctccagttccatcggggtgcccca gctctgtcccggttcccagagaatgggctcccaggaagccaagagcccagccccctgcgg ggtcgatgggtacctggcacaaccctggagacagtggaggagaaggagaagaaggcatct ccaagtcagagccgtggccagctggatctccttgtccccttccccagagcccgagaagtg ggagaccctgactctggggttacccccaactcaggcccccttcggcctggcgagcaggcc ccagaggacaggaggttgtccccaagccaggccgacagccagctggatctcctgtcccga gccctgggtactgggggtgcccctgagttgggttccctcagacctggtgatcgggccctg gaggacaggaggctgtccctaaaccaaagccgtggccaatcagacctcctgatgcagtac cccaaggcccagggttccagagtgccccttgaaaccaactcctcagccagacctgccaga cgggcaccagatgagcggcggcagaccctggggcacagccagctggacctcatcacaaag ttcggcccattccgtggtgaggggcctgggcccaatggtctcccgatatcaagccctgct cgcacggctggagctgggtctggggatgagaaacggctaaccctgggccacagcaagctg gacctcatcaccaagtatcatcagttgcacggggccaggcagggaactgagcctgggggt cccaagggtggccatctcaatggtggtaacagtgacctggtcagggatgagaaacggctg accctgggtcacagcaaactggacctcatcactaagtacaacaagtccaagttcaagcag ctccgaagccgctttgagtcctag >gi568815578r:35179566_35384487|GENSCAN_predicted_peptide_7|179_aa MERVLPGLATLRSWECPHITTEENQGAENLKDLAVAAIVFVAILLLLQRQKVFIVSCALG PKRALTSSLKQVNPYILKKNMILMTNHFYAAILGYDEGILSDDHGLAAALWRTFFNRKCE DPRHLELLVEYVRKQIQYLDSMNGEDLLLTGEVSWRPLVEKNPQSILKPHSPTYNDEGL >gi568815578r:35179566_35384487|GENSCAN_predicted_CDS_7|540_bp atggaacgagttctgcctggcctggccactcttcggagttgggagtgtccccatattaca actgaggagaaccaaggagcagaaaacttaaaggacttggctgtggccgccatagtgttt gtggctatactccttctccttcaaagacagaaggtgtttatcgtctcctgtgctttaggt cccaagagagcccttaccagctccctcaagcaggttaatccctatatcctgaagaagaac atgatcctcatgacaaatcatttctatgcagcgatcttgggatatgatgaggggatcctt tcagatgatcatgggctggccgctgccctctggagaaccttcttcaaccggaaatgtgaa gaccctcgacatcttgaattgctggtagagtatgtgaggaaacagatacagtacctggac tccatgaacggggaggatctgcttctgacaggggaggtgagctggcgccctctagtggag aagaatcctcagagcatcctgaagccccattctccgacttacaacgacgagggactttga >gi568815578r:35179566_35384487|GENSCAN_predicted_peptide_8|183_aa MPQGNAHWSILDFWIWDAQLKIKIAALRMYTSCVEKTDFEEFFLSCPVALMLPPEGLGFI VREFGKPGGDSHVHSPGKGMCLVRMKQEGRSGKYMCRIIVHFMWEDVQQRGRVMGWVRGL ADFKNEAADLHSKCYQLLKVVRTQRVSSSKIYGEDRKNKASTVWKGDLSGLPLLARGKVL PVD >gi568815578r:35179566_35384487|GENSCAN_predicted_CDS_8|552_bp atgccccagggaaatgctcactggagcattttggatttttggatttgggatgctcagctg aagattaagattgcggccctgcgcatgtatactagctgtgtggagaaaactgacttcgag gaattctttctaagctgtcctgtggccctcatgctccccccagaagggttaggcttcata gtgagggagtttgggaaaccaggtggagatagccatgtacacagccctggaaaagggatg tgtctagtccgaatgaagcaggaaggccggagtgggaagtacatgtgtcgtatcatagtt cattttatgtgggaggatgttcagcagcgcggcagagtcatggggtgggttcgtggtctc gctgacttcaagaatgaagccgcagaccttcacagcaagtgttaccagctcttaaaggtg gtgcggacccaaagagtgagcagcagcaagatttatggtgaagaccgaaagaacaaagct tccacagtgtggaagggggacctgagcgggttgccactgctggctaggggcaaagttctc cctgtggactga