GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:29:00 Sequence gi568815592r:166486124_166686477 : 200354 bp : 47.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.18 Intr - 1326 1214 113 1 2 42 101 53 0.117 2.10 1.17 Intr - 2798 2710 89 1 2 47 86 88 0.267 4.11 1.16 Intr - 7606 7428 179 1 2 75 90 86 0.319 6.22 1.15 Intr - 12527 12385 143 0 2 59 89 300 0.720 27.27 1.14 Intr - 14801 14764 38 1 2 93 72 38 0.729 0.61 1.13 Intr - 18489 18383 107 0 2 104 100 139 0.977 15.71 1.12 Intr - 22159 22080 80 2 2 90 83 98 0.410 8.77 1.11 Intr - 24234 24154 81 1 0 96 115 57 0.960 8.81 1.10 Intr - 27554 27482 73 2 1 123 41 12 0.216 -1.02 1.09 Intr - 28455 28321 135 0 0 99 67 55 0.393 5.26 1.08 Intr - 29039 29014 26 0 2 136 12 37 0.262 -1.36 1.07 Intr - 29922 29845 78 1 0 62 81 44 0.179 0.62 1.06 Intr - 34978 34893 86 0 2 96 47 48 0.211 0.96 1.05 Intr - 36384 36291 94 1 1 35 85 80 0.303 1.42 1.04 Intr - 45190 45109 82 1 1 74 116 104 0.560 11.01 1.03 Intr - 52661 52545 117 2 0 105 115 176 0.999 22.66 1.02 Intr - 54735 54708 28 2 1 96 67 2 0.532 -2.98 1.01 Init - 57395 57346 50 2 2 82 86 50 0.654 4.62 1.00 Prom - 71893 71854 40 -3.46 2.00 Prom + 72506 72545 40 -5.46 2.01 Sngl + 79299 79865 567 2 0 64 44 286 0.674 15.96 2.02 PlyA + 79909 79914 6 1.05 3.04 PlyA - 85991 85986 6 1.05 3.03 Term - 86528 86495 34 1 1 93 46 86 0.624 1.96 3.02 Intr - 87631 87525 107 1 2 25 93 107 0.101 3.91 3.01 Init - 91389 91339 51 0 0 68 90 43 0.128 3.72 3.00 Prom - 98412 98373 40 -4.16 4.02 PlyA - 98561 98556 6 1.05 4.01 Sngl - 100354 99998 357 1 0 56 44 263 0.614 14.76 4.00 Prom - 102994 102955 40 -6.96 5.00 Prom + 107218 107257 40 -1.96 5.01 Init + 110909 110994 86 1 2 72 52 81 0.435 3.19 5.02 Intr + 111633 111706 74 0 2 6 85 140 0.510 4.55 5.03 Intr + 116229 116513 285 1 0 61 49 109 0.386 1.61 5.04 Intr + 125693 125787 95 0 2 88 45 49 0.044 0.28 5.05 Intr + 126401 126567 167 1 2 93 57 33 0.055 -0.54 5.06 Intr + 130786 130871 86 1 2 57 92 51 0.095 1.86 5.07 Intr + 132739 132951 213 2 0 5 38 191 0.147 4.49 5.08 Intr + 139785 139904 120 1 0 109 50 23 0.055 1.07 5.09 Intr + 140515 140681 167 2 2 94 -11 53 0.072 -4.32 5.10 Intr + 140721 140890 170 2 2 65 38 184 0.082 9.84 5.11 Intr + 144411 144554 144 0 0 27 64 190 0.108 9.90 5.12 Intr + 160079 160168 90 2 0 18 107 88 0.172 2.81 5.13 Intr + 160849 160966 118 1 1 111 36 51 0.365 2.67 5.14 Intr + 171360 171435 76 2 1 67 85 31 0.006 -0.21 5.15 Intr + 189238 189334 97 2 1 119 86 75 0.615 9.47 5.16 Intr + 190595 190682 88 2 1 94 25 49 0.387 -0.83 5.17 Term + 192124 192306 183 0 0 84 36 104 0.368 2.34 5.18 PlyA + 192328 192333 6 1.05 6.04 PlyA - 192403 192398 6 1.05 6.03 Term - 193890 193475 416 1 2 48 42 158 0.475 2.62 6.02 Intr - 194447 194313 135 0 0 60 96 101 0.775 8.64 6.01 Intr - 199818 199680 139 0 1 128 46 68 0.439 6.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:166486124_166686477|GENSCAN_predicted_peptide_1|533_aa MPSAMEDGNLATRFSNGKPPPAAAWKEEGVVKEIDISHHVKEGFEKADPSQFELLKVLGQ GSYGKVFLVRKVKGSDAGQLYAMKVLKKATLKVALHQPQSIIEPLTARPSGHQGRLRMLS PPQRRQLLGLTGHQGLPKAELHPGQPGTAVLEGRHKTSINICMMNIGSVRKGWTTRSKGL PTVDHIMASSVWHCSSVRQPTGIPEGTPAATLQGHPLACILLPKHLEAPGSPRLSSVPQS MLPLCPVRTVLTCWLGLRDRVRSKMERDILAEVNHPFIVKLHYAFQTEGKLYLILDFLRG GDLFTRLSKEVMFTEEDVKFYLAELALALDHLHSLGIIYRDLKPENILLDEEGHIKITDF GLSKEAIDHDKRAYSFCGTIEYMAPEVVNRRGHTQSADWWSFGVLMATVTPVEKHCVMSV QLPSQKHRACTRWSFGYISLSPSVPVCSLVSVPIGKESFGVPSLPRAKLGMPQFLSGEAQ SLLRALFKRNPCNRLVKQFLLVTEHLCMPSVATRRAGDTVSSRGTGDVRAGDT >gi568815592r:166486124_166686477|GENSCAN_predicted_CDS_1|1599_bp atgcccagtgctatggaggatggaaatttagccactcgatttagcaatgggaagccacct ccagcagccgcgtggaaggaagaaggcgtcgtgaaggagatagacatcagccatcatgtg aaggagggctttgagaaggcagatccttcccagtttgagctgctgaaggttttaggacaa ggatcctatggaaaggtgttcctggtgaggaaggtgaaggggtccgacgctgggcagctc tacgccatgaaggtccttaagaaagccaccctaaaagttgcgctgcatcaaccacaaagc atcatcgagcctctgactgccaggccttcaggtcaccaaggccgtctgcgtatgctcagc cccccacagagaaggcaactgctggggctcacagggcaccagggtctgcccaaggcagag ctgcacccgggacagccagggacagcagtgctggaggggagacataagacatcaatcaac atatgtatgatgaacattggttcagtccggaaaggttggacaactcgaagcaaaggtctt cccactgtggaccacatcatggcctcatcagtgtggcactgcagctctgtgcgccagccc actggcatccctgagggtaccccagctgccaccctacagggccatcccttggcctgtatc ctcctgcccaagcacctggaggctcccgggtcccctcgtttgtcttctgtgccacagtcc atgctgcccctgtgtccagtgaggacagtgctcacatgctggctgggacttcgggaccga gtgagatcgaagatggagagagacatcttggcagaagtgaatcaccccttcattgtgaag cttcattatgcctttcagacggaaggaaagctctacctgatcctggacttcctgcgggga ggggacctcttcacccggctctccaaagaggtcatgttcacggaggaggatgtcaagttc tacctggctgagctggccttggctttagaccatctccacagcctggggatcatctacaga gatctgaagcctgagaacatcctcctggatgaagaggggcacattaagatcacagatttc ggcctgagtaaggaggccattgaccacgacaagagagcgtactccttctgcgggacgatc gagtacatggcgcccgaggtggtgaaccggcgaggacacacgcagagtgccgactggtgg tccttcggcgtgctcatggcaacggtgaccccagttgagaagcactgcgttatgtctgtc caattaccttcccagaagcatcgggcctgcacccggtggagctttggctacatctccttg agcccctctgtgcctgtctgcagcctggtctcggtgccgattgggaaagaatcctttgga gtgccctccctcccccgagccaagctggggatgccgcagttcctcagtggggaggcacag agtttgctgcgagctctcttcaaacggaacccctgcaaccggctggtaaagcaattcctc cttgtcaccgagcacctctgcatgccgagcgtggctaccaggagagcaggtgacacagtg agctcccgtggcactggtgacgtcagagcaggtgacacg >gi568815592r:166486124_166686477|GENSCAN_predicted_peptide_2|188_aa MQGSALPGTWLQALLVWLAWAPAASAYSAIHAREQLSPQEPCGPCLGSDRLTCSQPHTLQ WCRGPVADSANPSVDLTRSWTPGASELAVGPVGLSLILDRAWTWGLWLGRGAPVWANRTA DGLEDSVTELSASTRDSTWMQRSPLAISGDTTPRSYCSQHCQAATSQEDISNKTKSIFPS SWTTAMWF >gi568815592r:166486124_166686477|GENSCAN_predicted_CDS_2|567_bp atgcaggggtccgccctccctggcacctggctgcaggctttgctggtgtggctggcctgg gccccggctgcctctgcctacagtgccatccacgctagggaacagctgtccccacaagaa ccatgtggtccatgtctgggcagtgaccgcctcacttgcagccaaccccacactcttcag tggtgccgaggtcctgtggcagactcagccaatcccagtgtggacttgactcggtcttgg acaccgggggccagtgagctagctgttgggccagtggggctcagcctgatactggacaga gcctggacctggggcctgtggctggggaggggtgctcctgtgtgggccaacaggacggcc gatggcctggaagacagtgtgaccgagctatcagctagcactcgggactccacatggatg cagaggtcaccgctggccatttcaggagacactacgcctaggagctactgcagccagcac tgccaggcagcgacatcacaggaggacatttctaacaaaacaaaaagcatcttcccatct tcgtggacaactgccatgtggttctga >gi568815592r:166486124_166686477|GENSCAN_predicted_peptide_3|63_aa MISIDSTSHFRVMLMQEPMDSRLQAYATRLWKRTSVFTVTTLAVNGSKCSKGRPYDQWSV EEL >gi568815592r:166486124_166686477|GENSCAN_predicted_CDS_3|192_bp atgatctccattgactccacatctcacttccgggtcatgctgatgcaagagccaatggac tcgaggctgcaagcttatgccaccaggctctggaagagaacctcggtgtttacggtcacg accctggcagtcaatgggagtaagtgcagcaaaggccggccatatgaccagtggagcgtg gaggagctgtaa >gi568815592r:166486124_166686477|GENSCAN_predicted_peptide_4|118_aa MTDTAEAVPNFEEMFASRFTENDKEYQEYLKRPPESPPIVEEWNSRAGGNQRNRGNRLQD NRQFRGRDNRWGWPSDNRSNQWHGRSWGNNYPQHRQEPYYPQQYGHYGYNQRPPYGYY >gi568815592r:166486124_166686477|GENSCAN_predicted_CDS_4|357_bp atgactgacactgccgaagctgttccaaattttgaagagatgtttgctagtagattcaca gaaaatgacaaggagtatcaggaatacctgaaacgccctcctgagtctcctccaattgtt gaggaatggaatagcagagctggtgggaaccaaagaaacagaggcaatcggttgcaagac aacagacagttcagaggcagggacaacagatgggggtggccaagtgacaatcgatccaat cagtggcatggacgatcctggggtaacaactacccgcaacacagacaagaaccttactat ccccagcaatatggacattatggttacaaccagcggcctccttacggttactactga >gi568815592r:166486124_166686477|GENSCAN_predicted_peptide_5|752_aa MGTDGERTRGGATDAENQPMVFTHENETKDKKYPMGDGIIEKRIETDFRSPRAGIYPREP PPTPAPTVLKGWMHKIIHGSTALRNTRRKQQASSPRATDTGLLAQSDEIPRRSKHRNQAL PISMDTAHECNLEQDAFMHTRDDSIYMTPTGQGSQVTSYLTILGHILNPELIPVTRKVPW TRARKGEMMSSELGPCCPTASTNPAHLTEVRHCFAEPVGCAPFITLSGRLLLPMLGCAKA PNTQEEEEQMNVGERTEPRRFMGPGRHQDRGLDAYNPERHTGRCAQEPCAYGLRLTPSDS PTSPVRMGSNRCHQIRAPALCIWAPIDAIRFAHEPCLSMVTEPMSVSNIMAAGTKKTKMK HLRLFQLKQTNQNPTGDHPTPRAASGGETSWKCSGGGVAPARTHGPPNYYGVPAITFGAV WFDFRNRTSFPGWRGGLGRPRPAQCPAPARPEGGRITSSRLRLELLERDFLRRYTEKNLR TANFFMLRITRVTWCWPSAKAGSRAFPEYQHSKALPITGILLDAPETVSSQYEGGSLSDH VESLDRFHMDQLLTPWLQNRSGGWERLTPSPPDLLPQFRATGPKASVDVLQALPGKNYST NICISKKRNTSSLPETNDTEPLWEPGEGGVAQPAISQPWSCPSFHAVNQLPLKMFVAQSY LQSTYHTLLRTLLPTSTYSRSTRFKLPSIYMFVGLNPLCSSRGSSTLILAQNISHPPDWA STLHGEPNHGKAGFHKPPCKPTDRRRLPSTTA >gi568815592r:166486124_166686477|GENSCAN_predicted_CDS_5|2259_bp atgggaacagatggagaaaggacacggggaggggccacagatgctgagaaccagcccatg gtgttcacccacgaaaacgagaccaaagacaagaaatatccaatgggtgacggcatcatc gagaagagaatagaaacagatttccggagtccacgggctggtatatacccgagagaacct cccccaacacctgcccctacggtgctcaagggatggatgcacaagatcattcatggcagc actgctctcagaaacaccagaaggaaacaacaagcatccagccccagggcaactgacaca ggactgttggcacagtcagatgaaataccacgcagaagcaaacataggaaccaagcgcta cccatcagcatggatacagctcatgaatgcaacctggaacaagacgcattcatgcataca cgagatgactccatttatatgacacccactgggcagggatctcaggttacatcctatttg accatcttgggccacattctcaaccctgagctcatccctgtgaccaggaaagtgccatgg accagggcgaggaaaggagaaatgatgagctctgagctggggccttgctgcccgactgcc tccactaatcctgctcatctgaccgaggtcagacactgctttgcagagcccgtgggctgt gctccatttatcactctgtccgggcgtctgttgttacccatgctcgggtgcgccaaggcc cctaacacccaagaggaagaggaacaaatgaatgtgggagaaagaactgagcccagaagg ttcatgggtccaggaagacatcaggacagaggtctagacgcttacaaccctgagagacac actggacgctgcgcgcaagagccctgtgcatatggactccgattgacgccatcagattcg cccacaagccctgtgcgtatgggctccaatcgatgccatcagattcgcgcaccagccctg tgcatatgggctcccatcgatgccatcagattcgcgcacgagccctgtctctcaatggtt acagaacccatgtctgtgtctaatatcatggcagctgggacaaagaaaaccaagatgaaa catttgagattgttccagctgaaacaaaccaaccaaaaccccacaggggaccatcccaca ccccgtgcagccagcggcggagaaaccagctggaaatgcagtgggggtggagttgccccc gcgaggacccacggaccgcccaattactacggagtcccagcgataacgtttggagccgtt tggttcgattttcggaaccggaccagctttccagggtggcgaggcgggctcgggcgacca cggcccgctcagtgcccggcacctgcgcgccccgagggcggccgcattacctcgagccgg ctcaggctggagctcttggagcgcgacttcctgcgcaggtacacagagaagaacctgcgc acggcgaacttcttcatgctcaggatcaccagggtcacctggtgctggccatctgcgaaa gcaggaagtcgtgcttttccggagtatcagcattcgaaggctctgcccatcacaggcatc ctgctggatgctccagagacagtgtccagccagtacgagggagggagtctctcagatcac gtggaatctcttgatcggttccacatggaccagcttctgacaccctggctgcagaacaga agcggcggctgggagaggctgacgcccagccctccagacctcctaccccagttcagagcc acaggtcccaaagcctccgtggatgtcctccaggcactgccagggaagaattattccaca aatatctgtatatcaaaaaagaggaatacgtcctcattgcctgaaactaacgacacggaa cccttgtgggaaccaggggagggaggtgttgcccaacctgccatctctcagccctggtcc tgccccagtttccacgctgttaatcagctgcctttaaagatgttcgtggcacagtcgtac ctgcaaagcacttaccacactctgctgcgaacactgctcccaacgtctacatattcacgc tcaaccagatttaaacttccttcaatctacatgtttgtgggtctaaatcctctctgctcc tcacgaggctcttccacgcttatcctggctcagaacatctctcatccccctgactgggcc tccaccctccacggggaaccaaatcatgggaaagctggattccacaagccgccctgcaag cccactgaccgcagaaggttgccttctacaacagcttag >gi568815592r:166486124_166686477|GENSCAN_predicted_peptide_6|229_aa GMGPSSITMVHCLRDHSMVGVKSSDVDTMEIEAQTQLSRGARQQGGGVKLQTFVEMLQFT KAAQTQKVGTSKIYYKEQKNKPSTTWKGTRAARQKNSPSPHPSQKPSWLHLSLALPRDFA APSPGTPAAQRELVPHQAQQALANRAECGACRAALTWNPRQPVSTTRNPSSRPRRSLHTS PPAERAGSGLGQPQRWAPTAQQRAEGLLERGQIEERRQVEEVPRASQGC >gi568815592r:166486124_166686477|GENSCAN_predicted_CDS_6|690_bp ggcatgggcccatcgtccatcaccatggtgcattgtctgcgggatcactcgatggtagga gttaaaagcagtgatgtagataccatggagatagaggcacagacccagcttagccgggga gcaaggcagcaaggaggtggagtgaagctgcaaaccttcgtggagatgttacagttcaca aaagcggcacagacccaaaaagtgggcaccagcaagatttactacaaagagcaaaaaaac aaaccttccacaacgtggaaggggaccagagcagctagacagaaaaattctccaagtccc cacccatcccagaagcccagctggcttcacctctcactggcactgccgagggactttgcg gcacctagtccaggtactcctgcagcccagagggagctcgtcccccatcaagcccagcag gcgctggccaaccgcgctgagtgcggggcttgccgagccgcgctcacctggaacccgcgc cagccagtgagcaccactcgcaaccccagctcccgcccgcgccgctccctccacacctcc ccaccagcagagagagccggctccggcctcggccagccccagagatgggcccccacagca cagcagcgggctgaagggctcctcgagcgcggccagatcgaggagcgccgccaggttgag gaggtgccgagagcgagccagggctgctag