GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:57:11 Sequence gi568815587r:8124619_8330505 : 205887 bp : 46.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 2186 2041 146 1 2 12 92 158 0.467 8.53 1.02 Intr - 3706 3601 106 2 1 45 115 64 0.735 4.07 1.01 Init - 5211 5187 25 0 1 77 100 30 0.798 2.79 1.00 Prom - 9862 9823 40 -2.46 2.02 PlyA - 10146 10141 6 -0.45 2.01 Sngl - 10966 10379 588 1 0 86 32 262 0.539 16.59 2.00 Prom - 15359 15320 40 -8.56 3.00 Prom + 15895 15934 40 -3.16 3.01 Init + 17343 19915 2573 2 2 47 53 727 0.169 52.65 3.02 Intr + 21241 21317 77 1 2 15 106 42 0.073 -2.14 3.03 Intr + 33836 33959 124 0 1 29 86 109 0.038 4.44 3.04 Intr + 37700 37840 141 2 0 30 37 128 0.200 1.37 3.05 Intr + 40362 40489 128 0 2 61 32 103 0.208 2.32 3.06 Intr + 44131 44382 252 2 0 75 -13 223 0.150 8.31 3.07 Term + 44393 44568 176 0 2 66 36 221 0.326 12.62 3.08 PlyA + 45244 45249 6 1.05 4.04 PlyA - 45859 45854 6 1.05 4.03 Term - 52994 52870 125 0 2 129 43 24 0.110 0.55 4.02 Intr - 73394 73286 109 2 1 91 61 112 0.858 8.66 4.01 Init - 75200 75123 78 2 0 72 75 44 0.743 2.56 4.00 Prom - 78064 78025 40 -0.86 5.09 PlyA - 82075 82070 6 1.05 5.08 Term - 100103 99998 106 1 1 79 42 123 0.921 4.68 5.07 Intr - 102482 102357 126 1 0 110 78 172 0.983 18.19 5.06 Intr - 105886 105673 214 2 1 119 64 407 0.991 39.17 5.05 Intr - 116766 116663 104 2 2 24 75 120 0.198 4.12 5.04 Intr - 118422 118359 64 1 1 95 10 53 0.077 -4.02 5.03 Intr - 119809 119723 87 2 0 81 101 20 0.130 2.54 5.02 Intr - 124646 124603 44 1 2 109 94 22 0.323 2.78 5.01 Init - 128096 128032 65 2 2 75 16 126 0.380 2.93 5.00 Prom - 128348 128309 40 -2.16 6.08 PlyA - 129714 129709 6 1.05 6.07 Term - 137556 137396 161 1 2 79 53 106 0.603 4.30 6.06 Intr - 137698 137629 70 1 1 69 73 73 0.941 2.65 6.05 Intr - 138335 138140 196 1 1 70 89 128 0.921 10.52 6.04 Intr - 138979 138948 32 1 2 112 56 -5 0.809 -4.37 6.03 Intr - 139440 139345 96 0 0 46 40 147 0.071 5.91 6.02 Intr - 141933 141764 170 1 2 73 72 30 0.027 -0.43 6.01 Init - 143830 143809 22 1 1 103 90 22 0.011 3.78 6.00 Prom - 146079 146040 40 -1.36 7.00 Prom + 149089 149128 40 -5.16 7.01 Init + 153326 153418 93 1 0 68 91 38 0.586 2.49 7.02 Intr + 155034 155150 117 2 0 107 66 24 0.649 2.76 7.03 Intr + 171202 171265 64 0 1 64 81 68 0.149 1.99 7.04 Term + 173737 173912 176 2 2 105 47 85 0.480 4.02 7.05 PlyA + 175497 175502 6 -1.95 8.00 Prom + 177048 177087 40 -6.26 8.01 Init + 177223 177267 45 0 0 90 121 2 0.201 4.38 8.02 Term + 188257 188586 330 0 0 49 43 209 0.369 7.16 8.03 PlyA + 188668 188673 6 1.05 9.03 PlyA - 190434 190429 6 1.05 9.02 Term - 191866 191765 102 1 0 21 37 143 0.108 0.78 9.01 Intr - 204501 204452 50 1 2 74 117 29 0.134 2.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 141394 141487 94 0 1 63 91 40 0.808 0.78 S.002 Term + 143388 144214 827 1 2 70 35 262 0.832 12.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:8124619_8330505|GENSCAN_predicted_peptide_1|93_aa MEKISMEKGNRIKTVSFIEDQNALRTVWKSNMCRLKRQEFVISRAQTVTSDQEKRLLHQL REITRVMKEGKFIDRFSPEKEAEEAPYMEDWEX >gi568815587r:8124619_8330505|GENSCAN_predicted_CDS_1|279_bp atggagaaaatctccatggagaaaggaaacagaattaagactgtttcattcattgaagat cagaatgccctgcgtacagtctggaaaagtaacatgtgccgcctgaagagacaagagttt gtgataagcagagcacagactgtgacttctgaccaagagaaacggttgctacatcagctc cgagaaatcaccagggtcatgaaagaaggaaaattcattgacagattttctccagagaaa gaagctgaggaggccccttacatggaggactgggaagnn >gi568815587r:8124619_8330505|GENSCAN_predicted_peptide_2|195_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKGAHIAKSVLSQKNKAGGIMLPD FKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEVISHIYNYLIFDKPDKNKKWGKDSLFNK RLWENWLAICRKLKLDPFLTPYTKINSRWFKDLNVRPKTIKTLEENLGNTIGVDKDFMSK TPKAMATKPKLTNGI >gi568815587r:8124619_8330505|GENSCAN_predicted_CDS_2|588_bp atggccatactgcccaaggtaatttatagattcaacgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaggagcc cacattgccaagtcagtcctaagccaaaagaacaaagctggaggcatcatgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagaccaatggaacagaacagagccctcagaagtaatatcacacatctacaactat ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaa aggctttgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagatggtttaaagacttaaatgttagacctaaaaccata aaaaccctagaagaaaacctaggcaataccataggcgtggacaaggacttcatgtctaag acaccaaaagcaatggcaacaaaaccaaaattgacaaatgggatctaa >gi568815587r:8124619_8330505|GENSCAN_predicted_peptide_3|1156_aa MFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHRRKQERSKIDTLTSQLKELEKQEQTH SKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLATLIKKKREKNQID TIKNDKGDITTYPTEIQTTIREYYKHLYAKKLENLEEMDKFLNTYTLPRLNQEEVESLNR PVTGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLLQSIEKEGILPNSFYE ASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILAKRIQQHIKKLIHHDQVGFIPGMQ GWFNIRKSINVIQHINRAKDKNHRIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIR SIYDKPIANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLG KEEVKLSLFADDMTVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTES QIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKRKNIPCSWVGRIS IVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKAILSQKNKAGGIM LPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNDQTFDKPEKNKQWGKDSL FNKWCWDNWLAICRKLKLDPFLTPYTKINSRWIKALNLRPKTIKTLEENLGITIQDTGMG KDFMSRTPKAMATKDKIVKWDLMKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLIS RIYDELKQIYKKKTNNPIKKWVKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMR YHLTPVRMAIIKKSGNNRNKKIVGFDLNKIDKDVERHESSCTVEWHTVILEYFNYPATYS NSHSADNGTTDKFLTKCADIFMFHSTLTGEPNDNDLVMLMMSQLDDYNNLLPEEILTTQC LHCCQGVLSKTRMAIIKKTDNKQVLTRNQKPSYTAGGHVKWYSCFGKWSGSSSKVSLVDI SVKAADSQSHPSLKQRSPYSSAPGSSEGAGLLLPSGVGGGSCRFPRDRKALGSSSDRART SPEARATLCTVEYAMTAHGADASRNRNRNRSTGCRSRPNRGRRGRRRARDKVREAHASSP PLSGVDLLLPRGYPSL >gi568815587r:8124619_8330505|GENSCAN_predicted_CDS_3|3471_bp atgttctttgaaaccaacgagaacaaagacacaacataccagaatctctgggacgcattc aaagcagtgtgtagagggaaatttatagcactaaatgcccacaggagaaagcaggaaaga tccaaaattgacaccctaacatcacaattaaaagaactagaaaagcaagagcaaacacat tcaaaagctagcagaaggcaagaaataactaaaatcagagcagaactgaaggaaatagag acacaaaaaacccttcaaaaaattaatgaatccaggagctggttttttgaaaggatcaac aaaattgatagaccgctagcaacactaataaagaaaaaaagagagaagaatcaaatagac acaataaaaaatgataaaggggatatcaccacctatcccacagaaatacaaactaccatc agagaatactacaaacacctctacgcaaaaaaactagaaaatctagaagaaatggataaa ttcctcaacacatacactctcccaagactaaaccaggaagaagttgaatctctgaataga ccagtaacaggagctgaaattgtggcaataatcaatagtttaccaaccaaaaagagtcca ggaccagatggattcacagccgaattctatcagaggtacaaggaggaactggtaccattc cttctgaaactactccaatcaatagaaaaagagggaatcctccctaactcattttatgag gccagcatcattctgataccaaagccgggcagagacacaaccaaaaaagagaattttaga ccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggcaaaacgaatc cagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaa ggctggttcaatatacgcaaatcaataaatgtaatccagcatataaacagagccaaagac aaaaaccacaggattatctcaatagatgcagaaaaagcctttgacaaaattcaacaaccc ttcatgctaaaaactctcaataaattaggtattgatgggacgtatttcaaaataataaga tctatctatgacaaacccatagccaatatcatactgaatgggcaaaaactggaagcattc cctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaacatagtg ttggaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaattagga aaagaggaagtcaaattgtctctgtttgcagacgacatgactgtatatctagaaaacccc attgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatac aaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagt caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatc caacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaa ataaaagaggatacaaacaaacggaagaacattccatgctcatgggtaggaagaatcagt atcgtgaaaatggccatactgcccaaggtaatttacagattcaatgccatccccataaag ctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaa aaaagagcccgcatcgccaaggcaatcctaagccaaaagaacaaagctggaggcatcatg ctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtac caaaacagagatatagatcaatggaacagaacagagccctcagaaataacgccgcatatc tacaacgatcagacctttgacaaacctgagaaaaacaagcaatggggaaaggattcccta tttaataaatggtgctgggataactggctagccatatgtagaaagctgaaactggatccc ttccttacaccttatacaaaaatcaattcaagatggattaaagccttaaaccttagacct aaaaccataaaaaccctagaagaaaacctaggcattaccattcaggacacaggcatgggc aaggacttcatgtccagaacaccaaaagcgatggcaacaaaagacaaaattgtcaaatgg gatctaatgaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacagg caacctacaaaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatcc agaatctacgatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaag tgggtgaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacac atgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgaga taccatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacagaaataaa aaaattgttggatttgacttgaacaaaattgacaaagatgtggagaggcatgaatcctca tgcactgtggagtggcacactgtcatcttggagtacttcaactaccctgcaacctattca aactctcattcagctgacaatggaacaactgacaagttcttgaccaagtgtgctgacatt tttatgttccacagcaccctaactggtgaacccaatgacaatgacttggtgatgctcatg atgtctcaactggatgactacaacaatcttctccctgaagaaatcctcacaacgcagtgt ctacactgctgccagggtgtcctttctaaaaccaggatggctatcatcaagaagacagat aacaaacaagtattgacaagaaaccagaagccctcttacactgctggtggtcatgtaaaa tggtacagctgctttggaaaatggtctggcagttcctcaaaagtcagtttggtggatatt tcggtgaaggctgcagactctcagagtcacccctccctcaagcagcgttctccgtactct tcggcgccgggaagctcagagggagctggcctgctcttaccttcaggtgtcggcggcggc tcctgccgcttcccgcgggacaggaaggccttgggcagcagcagcgacagagccaggaca agcccagaagccagagcgactctctgcactgtggagtacgccatgactgctcacggtgca gacgccagccggaaccggaaccggaaccggagcacgggctgccgctcccgaccgaaccgc gggcggcgcggacggcgtcgcgcgagggacaaggtgcgcgaggcgcacgcatcttcacct ccactgtcgggagttgacctgctcctcccgcgtggttacccgtcgctgtaa >gi568815587r:8124619_8330505|GENSCAN_predicted_peptide_4|103_aa MGGHLEEQNKGEDGEQNRHAMSHCCQERAWNLQSPTSLSLLYTPITMDCYYRYSVQCGYP KRGAEASEESWQESSKDILTLTTHHQETSILVPLTFAHLSPFG >gi568815587r:8124619_8330505|GENSCAN_predicted_CDS_4|312_bp atgggaggtcatttggaggagcagaacaagggagaggatggagagcagaaccgtcatgcc atgagccactgttgtcaggaaagagcctggaatctgcagtctccaacttcactaagtctc ctgtatactcctatcaccatggactgctattaccgttacagtgttcagtgtggctaccca aagagaggggcagaggccagtgaggagagttggcaggaaagcagtaaggacattctcacc ttgaccactcaccaccaagaaacttctatcctcgtacccctaacctttgcccacctgtcc ccatttgggtag >gi568815587r:8124619_8330505|GENSCAN_predicted_peptide_5|269_aa MRFLTGCWTEGLHALLAMVAMPFVQSSQSGSVQGEAGLYREQIDLEVNLLTTREEIISAF SKHSPAPWLADGLWRLQEGSRTRQLPRSLQLPCEELAVGTRHHAMLAVHRETVMAVEMQD AGVPMLSVQPKGKQKGCAGCNRKIKDRYLLKALDKYWHEDCLKCACCDCRLGEVGSTLYT KANLILCRRDYLRLFGTTGNCAACSKLIPAFEMVMRARDNVYHLDCFACQLCNQRFCVGD KFFLKNNMILCQMDYEEGQLNGTFESQVQ >gi568815587r:8124619_8330505|GENSCAN_predicted_CDS_5|810_bp atgcggttcctcacgggctgctggactgagggcctccacgccttgctggccatggtggcc atgcctttcgtccagagctcgcagtctgggtctgtgcagggtgaggcagggctctatcgt gaacaaattgatctggaagtgaatttattaaccactagggaagaaatcattagtgctttc tctaaacattccccagccccgtggctagcagatgggctctggcggcttcaagaaggaagc aggacccggcagctgcctagatctctccagctgccgtgtgaagaattggctgtgggtacc agacatcatgccatgctggcagtgcatcgagagacggtgatggctgtggaaatgcaagat gcaggcgtgccgatgctctccgtccagcccaaagggaagcagaagggctgtgcgggctgt aaccgcaagatcaaggaccgctatctgctgaaggcattggacaagtactggcacgaagac tgcctcaagtgtgcctgctgtgactgccgcctgggcgaggtgggctccaccctctacacc aaggccaacctcatcctgtgccgacgcgactacctgaggctctttggcaccacagggaac tgtgctgcttgcagcaagctgatcccagccttcgagatggtgatgcgggcccgggacaac gtgtatcacctcgactgcttcgcctgccagctctgcaaccagagattttgtgtgggagac aaattcttcctgaagaacaacatgatcttgtgtcagatggactatgaggaagggcagctc aatggcacctttgaatcccaagttcagtaa >gi568815587r:8124619_8330505|GENSCAN_predicted_peptide_6|248_aa MVLDQEDEAFRGHFGALLACRSESTCSSPVECVGGKLPPPAPGDLRIPLWCCGRVPGPRS SLGGTIIEDHDPGIGAKTLTELGPRSLQVALRSRTAFVVPSNSEAQNHHPRASRRVTREA LSAEQISGFGENRAPELGASGSAGGRRERQGARQARGRRVGPSGGHLHERDSGHLYVTVG AHAAAGVLPLVRYRKGVSAGPEPSVPGAALLLMWPVGENAKLSDLLRVTQDVRIAAARCR CPQAATAF >gi568815587r:8124619_8330505|GENSCAN_predicted_CDS_6|747_bp atggtgttggaccaggaggacgaggcatttcgagggcattttggagcactcttggcttgt cgaagtgagagtacatgctcatcacctgtggaatgtgttgggggaaaactacctccccca gccccaggggatctgaggattccactctggtgctgtgggcgtgtccctggacctagaagc agcttgggtgggacaatcattgaagaccacgaccctgggatcggagcaaagacgctcacc gagctaggacctagaagcctccaggtggcgctgcggagccggacagccttcgtagttcct tccaattctgaggcacaaaaccatcaccctcgcgccagccgccgcgtaacccgagaggcg ctgagcgccgagcaaatcagcggcttcggggagaatcgggctccagagctcggggcaagc gggagcgcgggaggacggcgcgagcgccagggtgcccggcaggcgaggggaaggcgggtt ggaccctccgggggacatctgcatgagcgtgactcggggcatttgtatgtgactgttggt gcccacgctgccgccggcgttctgccattggtgaggtaccgaaagggagtctctgctggg cctgagccctctgtccctggggcagcgttgttattgatgtggccagttggtgaaaatgca aagctaagtgacttgctcagggtcacacaggatgttcgcattgccgctgccaggtgccga tgcccccaggctgctaccgccttctga >gi568815587r:8124619_8330505|GENSCAN_predicted_peptide_7|149_aa MQVVPLELQPHVPTAHASQKQSFSMLQAGAQTRLSETKDVVQAYGHKKRSGGCFFQALPH RTTDGSLAQQPLDAEQLVEYTTKLPNMQVLQALMSLPRYDGGCAARIRTDQGTGAKATTE AVGVTAPGPRWPQQLVEGALVELLAEFPS >gi568815587r:8124619_8330505|GENSCAN_predicted_CDS_7|450_bp atgcaggtggtccccttggagctacaaccgcatgtcccaacagcacatgcatctcagaaa cagtccttctccatgctgcaggcaggggcccagaccagactctctgaaactaaagatgtg gtccaggcctatggacacaaaaaaaggtctgggggctgctttttccaagctctacctcat aggaccacagatgggtccctggctcaacagcctctggatgctgaacagctggttgaatac acaactaaactacccaacatgcaagtgctgcaagctttaatgtccctccccaggtatgat ggtggctgtgcagccaggatcagaacagatcagggcactggagccaaggccaccacggaa gcagtgggtgttacagcgccgggaccaaggtggccccagcagctggttgaaggcgcgctt gtggagctgctggctgagtttccttcatga >gi568815587r:8124619_8330505|GENSCAN_predicted_peptide_8|124_aa MRLSAQVPWHLLHNKRGLQRQLEEPPCLPSIDNGDLGSSTPYPCPANCPKRPSQAYGPQT PSTTAKTLTCSTPPRPICIHGPVGDHSVRSSLSYGLGRDPSLASLSLSQASQFSVKAAGA SKET >gi568815587r:8124619_8330505|GENSCAN_predicted_CDS_8|375_bp atgaggttatcagcccaggtgccctggcacctcctccacaacaagagaggacttcagagg cagctagaggagcctccatgtctgccatccatagacaatggagacttaggaagcagcacc ccatacccctgtccagccaactgccccaagaggccttcccaagcctatgggccccagacc cctagcaccacagccaagaccctcacctgctccaccccaccccgccccatctgcatccac gggccagtgggggaccattcagtgagatcctccctgtcctacggccttggccgtgacccc agcttggcctctttgtctctcagccaggccagccagttcagcgttaaagctgctggagca tccaaggagacataa >gi568815587r:8124619_8330505|GENSCAN_predicted_peptide_9|50_aa XSLQRLSCSSASDRERPWLLTPAILYTLAYGPTRTYFEDERNTAVLSPVV >gi568815587r:8124619_8330505|GENSCAN_predicted_CDS_9|153_bp ngttctctgcagcggctcagctgcagcagtgcctctgacagggagaggccgtggctcctc accccagccattctgtacactctggcctatgggcccacacgaacctattttgaagacgag aggaacacagccgtgttgtctccagtggtttag