GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:38:54 Sequence gi568815597f:212515022_212719552 : 204531 bp : 44.22% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 4189 4184 6 1.05 1.02 Term - 7905 7861 45 0 0 106 52 81 0.332 3.51 1.01 Init - 25559 25464 96 2 0 68 105 68 0.788 5.58 1.00 Prom - 26608 26569 40 -1.66 2.00 Prom + 37612 37651 40 -3.16 2.01 Init + 44859 44925 67 2 1 16 86 55 0.107 -0.67 2.02 Intr + 45197 45475 279 0 0 90 87 63 0.236 3.85 2.03 Intr + 67046 67084 39 0 0 96 115 21 0.039 3.80 2.04 Intr + 68933 68973 41 0 2 91 78 41 0.031 1.34 2.05 Term + 71399 71497 99 1 0 58 42 81 0.060 -1.37 2.06 PlyA + 71687 71692 6 1.05 3.00 Prom + 78411 78450 40 -2.16 3.01 Init + 82254 82315 62 2 2 81 65 131 0.714 10.82 3.02 Intr + 86806 86833 28 1 1 38 86 11 0.082 -5.98 3.03 Intr + 89286 89370 85 2 1 79 71 64 0.343 3.19 3.04 Intr + 92001 92111 111 1 0 21 90 87 0.305 2.55 3.05 Intr + 92256 92337 82 1 1 90 80 38 0.241 1.90 3.06 Intr + 94069 94245 177 1 0 27 60 147 0.220 4.83 3.07 Intr + 99997 100240 244 1 1 63 90 236 0.720 18.80 3.08 Intr + 103106 103213 108 1 0 103 87 185 0.999 20.38 3.09 Term + 104337 104534 198 2 0 111 47 243 0.999 19.90 3.10 PlyA + 105726 105731 6 1.05 4.00 Prom + 106508 106547 40 -7.16 4.01 Sngl + 109857 111641 1785 2 0 87 36 1252 0.966 114.42 4.02 PlyA + 111732 111737 6 1.05 5.05 PlyA - 112611 112606 6 1.05 5.04 Term - 115234 115074 161 2 2 101 47 49 0.029 0.20 5.03 Intr - 121432 121251 182 0 2 70 -7 109 0.073 -0.89 5.02 Intr - 122204 122082 123 1 0 60 106 38 0.026 2.50 5.01 Init - 136386 136316 71 0 2 100 106 35 0.759 7.02 5.00 Prom - 138266 138227 40 -4.26 6.00 Prom + 145455 145494 40 -4.96 6.01 Init + 146269 146845 577 0 1 27 -14 346 0.738 13.60 6.02 Term + 147295 147932 638 2 2 55 44 281 0.451 14.81 6.03 PlyA + 148314 148319 6 -0.45 7.00 Prom + 148682 148721 40 -2.46 7.01 Init + 165249 165315 67 2 1 73 94 41 0.720 4.43 7.02 Intr + 167503 167592 90 2 0 4 99 83 0.414 0.97 7.03 Intr + 168104 168163 60 0 0 100 93 18 0.595 2.21 7.04 Term + 168540 168616 77 1 2 86 42 90 0.726 2.20 7.05 PlyA + 168957 168962 6 1.05 8.06 PlyA - 171469 171464 6 1.05 8.05 Term - 171958 171770 189 1 0 104 48 271 0.999 22.15 8.04 Intr - 172566 172446 121 2 1 -1 109 130 0.607 6.70 8.03 Intr - 180783 180680 104 0 2 57 62 38 0.320 -2.93 8.02 Intr - 182044 181940 105 1 0 83 78 147 0.802 13.61 8.01 Init - 182220 182218 3 0 0 52 101 0 0.880 -2.30 8.00 Prom - 182515 182476 40 -7.96 9.00 Prom + 183080 183119 40 -6.26 9.01 Init + 184310 185498 1189 1 1 97 21 384 0.058 24.03 9.02 Intr + 195203 195256 54 0 0 78 111 31 0.256 3.35 9.03 Term + 196701 196909 209 1 2 106 39 45 0.185 -1.10 9.04 PlyA + 196912 196917 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:212515022_212719552|GENSCAN_predicted_peptide_1|46_aa MSSPSCASRGAYSCLCLMLITTTTTTTQESSLILLMAFELLLNHLH >gi568815597f:212515022_212719552|GENSCAN_predicted_CDS_1|141_bp atgagctccccgtcgtgtgcaagtagaggagcctactcctgcctttgtctgatgctgatc accacgacaaccaccacgacccaggaaagctcactgattctgctcatggcctttgagctg ctgctgaaccacttgcattga >gi568815597f:212515022_212719552|GENSCAN_predicted_peptide_2|174_aa MNGELEVSPRSSDSEVHGFGDEALPDPPLTLCFLSGNSRGHFYLKSPHLAYQRPDLPKCR RNTWLSPLSSSPLEHKTTESFLSEQHACAKNKLETTTTSQNFEQNDGDRVLEMEQRLQNL ANENTSGPDTVMTKTDRISALMCRVNDVAVTPWGIFSTKSNAWHAVSTKGVFGE >gi568815597f:212515022_212719552|GENSCAN_predicted_CDS_2|525_bp atgaatggcgagctggaagtcagtcccaggtcatcagactccgaagtccatggttttggt gatgaagccctgccagacccacccttgactctgtgtttcctttctggaaactcgagaggc catttctacttaaagagtcctcacctggcctaccagcgaccagatctgcccaagtgcaga agaaacacttggttatctcctctttcatcgtccccgctcgagcataaaacaacagaaagt ttcctatcggaacagcatgcttgtgccaaaaacaaattagaaacaacaacaacttctcaa aattttgagcaaaatgatggtgaccgtgttctggaaatggagcaaagattgcaaaattta gcaaatgaaaatacatcaggcccagatacagtgatgaccaaaacagaccgtatctctgcc ctcatgtgcagagttaatgacgtagcagtgactccatggggcatcttcagcaccaagagc aatgcctggcatgcagtgagcactaaaggagtattcggtgaatga >gi568815597f:212515022_212719552|GENSCAN_predicted_peptide_3|364_aa MIIIFIIDCRHSAYNHKIEELFPYDLVASRDFILDSGVSGFNLSNGNICYLGFGPVDQGP EQDLGGSRVELQGSRVRRGAKTSEAALRVFPGPWREAGGKEQVFLPFTVPPHPAAAQVSL PGSHRCSSALRKVSRKAKELTFSFDLGRERFLVTEVGRPLCRQPRAPLLSADGAGLPGWE RAKMMLQHPGQVSASEVSASAIVPCLSPPGSLVFEDFANLTPFVKEELRFAIQNKHLCHR MSSALESVTVSDRPLGVSITKAEVAPEEDERKKRRRERNKIAAAKCRNKKKEKTECLQKE SEKLESVNAELKAQIEELKNEKQHLIYMLNLHRPTCIVRAQNGRTPEDERNLFIQQIKEG TLQS >gi568815597f:212515022_212719552|GENSCAN_predicted_CDS_3|1095_bp atgatcatcatcttcatcatcgactgtagacacagcgcttataatcacaaaattgaagaa ctgtttccatatgacttggtagcttcaagggacttcatcttggattctggggtctctgga ttcaatctctcaaatgggaatatctgctacctgggttttggccctgtagaccaaggtcca gagcaggatctcggaggatcccgcgtggaactccagggctcccgggtccgccggggcgca aagacttccgaggccgccctccgcgtgttcccaggcccgtggagagaggcaggtggaaag gagcaggtgtttctgcccttcaccgtgcccccacaccctgcggccgcgcaggtctccctc ccaggcagccaccgctgctcctcggcgcttcgaaaagtttcccgcaaagcgaaagaacta actttctcttttgacttggggcgcgaaaggttcctggtgacggaggtcgggcgtccactc tgccgccagcctcgagcgccgcttctctccgccgacggcgctggcttgcccggctgggag agggccaaaatgatgcttcaacacccaggccaggtctctgcctcggaagtgagtgcttct gccatcgtcccctgcctgtcccctcctgggtcactggtgtttgaggattttgctaacctg acgccctttgtcaaggaagagctgaggtttgccatccagaacaagcacctctgccaccgg atgtcctctgcgctggaatcagtcactgtcagcgacagacccctcggggtgtccatcaca aaagccgaggtagcccctgaagaagatgaaaggaaaaagaggcgacgagaaagaaataag attgcagctgcaaagtgccgaaacaagaagaaggagaagacggagtgcctgcagaaagag tcggagaagctggaaagtgtgaatgctgaactgaaggctcagattgaggagctcaagaac gagaagcagcatttgatatacatgctcaaccttcatcggcccacgtgtattgtccgggct cagaatgggaggactccagaagatgagagaaacctctttatccaacagataaaagaagga acattgcagagctaa >gi568815597f:212515022_212719552|GENSCAN_predicted_peptide_4|594_aa MNADFLLPYYTAQSGSSMSMFNTTMGKLQRQLYKGEYDIFKYAPIFESDFIQITKRGEVI DVHNRVRMVTMGIARTSPILPLPDVMLLARPATGCEEYAGHGQATKRKKRKAAKNLELTR LLPLRFVRISVQDHEKQQLRLKFATGRSCYLQLCPALDTRDDLFAYWEKLIYLLRPPMES NSSTCGIPAEDMMWMPVFQEDRRSLGAVNLQGKGDQDQVSIQSLHMVSEVCGATSAAYAG GEGLQNDFNKPTNVLNASIPKTSTELAEEPATGGIKEAAAAGAAAGAATGTVAGALSVAA ANSAPGQVSAAIAGAATIGAGGNKGNMALAGTASMAPNSTKVAVAGAAGKSSEHVSSASM SLSREGSVSLAIAGVVLTSRTAAEADMDAAAGPPVSTRQSKSSLSGQHGRERTQASAEGC KEGRERREKDRALGRSSHRRRTGESRHKTRGDKIAQKSSSRSSFSHRANRDDKKEKGCGN PGSSRHRDSHKGVSHTPISKESRTSHKSGRSLWTTSSGSSKGLGRVSSFLRNVRANLTTK VVGTPHGRDVNVMAKMAERSTNVAIAETAEGGQGLETVGSMTPDIMETVTFEAH >gi568815597f:212515022_212719552|GENSCAN_predicted_CDS_4|1785_bp atgaatgctgattttctgctgccgtattatacggcccagagtggctccagcatgagcatg ttcaacaccaccatggggaaactgcagcgacaactgtacaagggggagtacgatatattc aagtatgcaccgatatttgagagcgactttatccagatcaccaaaaggggagaagtgatt gatgtgcacaaccgtgtccgtatggtgaccatgggcattgcacgtaccagccccatcctc ccactcccagatgtcatgctactggcacgaccggccaccggctgcgaagagtatgctgga catggccaggccaccaagagaaaaaaacgcaaggcagcaaagaacttagagctcaccagg cttctgcccctgaggtttgtacggatctctgttcaagaccatgagaaacaacagctgcgc ctgaagttcgccactggcagatcttgctatctgcaattgtgtcccgctcttgacacacgg gatgacctctttgcctattgggaaaaactaatttacctcttgcggccacccatggagagt aacagcagtacctgtggcattccagctgaagacatgatgtggatgcctgtgtttcaggaa gacaggaggagcctgggagccgtgaaccttcaaggaaagggggatcaggaccaggtcagc atccaaagcctccacatggtctctgaggtgtgtggggccacctctgctgcttatgctgga ggggagggactccaaaatgactttaacaaacccactaatgtgctcaatgcatccatcccc aaaacatctacagaacttgctgaggagccagcaacaggggggattaaagaggcagcagca gcaggggcagctgcaggggcagcaacaggcaccgtagcaggtgccttgagtgtggcagca gccaattctgcccctggacaggtgagcgcagccatagctggggcggccaccatcggtgca ggaggaaacaaaggcaacatggcccttgcaggcactgccagcatggctccaaacagcacg aaggtggctgtggcaggggctgcaggcaagtcctcagagcatgtttccagcgcatccatg agcctttcccgagagggcagtgtgagcctggccattgcaggagtagtactgaccagcagg acagctgcagaagcagacatggatgcagcagcgggacctcccgtctccacccggcagagc aagagcagcctgagtggacagcatggaagggagcgaacccaggccagcgctgaaggctgc aaggaggggagggaaagaagggaaaaggacagggctctcggaaggagttcccatcgccgc aggacaggtgaaagccgccacaaaacaaggggagacaagattgcccaaaagtcctccagc aggtcctcattcagccacagagccaatagagatgacaaaaaggagaaaggctgtggcaac ccggggagcagcaggcacagggactcgcataaaggtgtcagccacacgcccatctcaaag gagtccaggacctctcacaaatctgggaggagcttatggaccaccagttccggttccagc aagggacttggcagggtcagctctttcctgaggaacgtcagagccaaccttactacaaaa gtagtgggcacaccacatggcagagatgtgaacgtcatggctaagatggcggagaggagc accaacgtggccatcgccgagacagcagagggtggccaggggctggagacggttggttct atgacaccggacatcatggagacagtgacctttgaagcccattaa >gi568815597f:212515022_212719552|GENSCAN_predicted_peptide_5|178_aa MINYQEASHDYMLHMILQVLLTAGVGWKPSKAVRSAESAHDGARSVGSSVTPLGFNPGCT SGGLGPPVRASGDLLEGGREEGMYGCHHAWMEDQKVFITSICGAPNMCQTVVDLDTQKHG HSPKGALTCSNSTGPPEALLGDSCPAEKGPVGEHLTQFPADEKGRRVQEASTLRGLQH >gi568815597f:212515022_212719552|GENSCAN_predicted_CDS_5|537_bp atgattaattaccaggaagcaagtcacgactatatgcttcatatgattttgcaagtttta ttaacagctggtgtaggttggaagccttctaaggccgtgcgttctgcagagtcagcccat gatggggccaggagtgtgggctccagtgtcacccccttgggatttaatcctggctgcacc tctgggggccttgggcctcctgtcagagctagcggagatctgctggaaggaggccgggag gaggggatgtacggatgccatcatgcctggatggaggaccagaaggtctttataacaagt atttgtggagctcctaatatgtgccaaactgtggtagacctggatacccagaaacacggc cactcccctaaaggagccctgacctgcagtaacagcacagggccaccagaggctctattg ggcgactcatgccccgcggagaaaggcccagtaggagagcatctgacccagttcccagca gatgagaaaggaagaagagttcaggaggcttctactctcagaggtctccagcactga >gi568815597f:212515022_212719552|GENSCAN_predicted_peptide_6|404_aa MKKKREKNQIDSIKNDEGDITTDLTEIQTTIREYYKHLYANKLENLEEMDKFLNTYTLPR LNQEELESLNRPITDSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIE KEGILPNSFYEASIILIPKPGRDTTTKENFRPISLMNIDAKILSKTLANRIQQHIKNLST MIKWASSLGCKAGNFSNVSGYKINVQKSQAFLYTKNRQTESQIMSEFPFTTASKRIKYLG IQLTRDVKDLFKENYKPLLNEIKKDTNKWKNIPCSSVGRINIMKMAILPKVIYRFNAIPI KLPMTFFTELEKTTLKFIWNQKKARIAKSILSQKNKAGDITLPDFKLYYKATVTKTAWYW YQNRDIDQWNRTEPSEIMLHIYNHLIFDKPDKNKKRGNDSLFNK >gi568815597f:212515022_212719552|GENSCAN_predicted_CDS_6|1215_bp atgaagaagaaaagagagaagaatcaaatagactcaataaaaaatgatgaaggggatatc accaccgatctcacagaaatacaaactaccatcagagaatactataaacacctctacgca aataaactagaaaatctagaagaaatggataaattcctcaacacatacaccctcccaaga ctaaaccaggaagaacttgaatctctgaatagaccaataacagactctgaaattgaggca ataattaatagcttaccaaccaaaaaaagtccaggaccagatggattcacagctgaattc taccagaggtacaaggaggagctggtaccattccttctgaaactattccaatcaatagaa aaagagggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagcct ggcagagacacaacaacaaaagagaattttagaccaatatccctgatgaacatcgatgca aaaatcctcagtaaaacactggcaaaccgaatccagcaacacatcaaaaacttatccacc atgatcaagtgggcttcatccctgggatgcaaggctggcaacttcagcaacgtctcagga tacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaaaaacagacaaacagag agccaaatcatgagtgaattcccattcacaactgcttcaaagagaataaaatatctagga atccaacttacaagggatgtgaaggacctctttaaggagaactacaaaccactcctcaat gaaataaaaaaggatacaaacaaatggaagaacattccatgctcatcagtaggaagaatc aatatcatgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatc aagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaac caaaaaaaagcccgcattgccaagtcaatcctaagccaaaagaacaaagctggagacatc acgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagatcaatggaacagaacagagccctcagaaataatgctgcat atctacaaccatctgatctttgacaaacctgacaaaaacaagaaacggggaaacgattcc ctatttaacaaatga >gi568815597f:212515022_212719552|GENSCAN_predicted_peptide_7|97_aa MTGRNSSYEGLIQDKAAQDSRTVAFVLICDDISGLLKNIIPEKGDARTAKRTGLLHAARM TLPGLPGPQGREAQQAAALRLEMQMPSTSPGAQRQYE >gi568815597f:212515022_212719552|GENSCAN_predicted_CDS_7|294_bp atgacggggaggaatagcagctatgagggcctaattcaggacaaggcagcccaggacagc agaacagtagccttcgttctgatctgtgatgatatttctggtcttctcaagaatatcata ccagaaaaaggtgatgctagaacagccaagagaacaggcctccttcatgcagcacggatg accctgccaggacttcctggacctcaagggagggaagcccagcaggcagccgctctgaga cttgagatgcagatgcccagcacaagtcctggtgcccagcggcagtacgaatag >gi568815597f:212515022_212719552|GENSCAN_predicted_peptide_8|173_aa MSPEDDDRKVRRREKNRVAAQRSRKKQTQKADKLHELGRGAAPYQELALKASKDLSEEWA TIPCSCLCRGWSAQNKLDMDGFYIVWIGLRDLSDTLQFYASVNPDGEETEQEYESLEQEN TMLRREIGKLTEELKHLTEALKEHEKMCPLLLCPMNFVPVPPRPDPVAGCLPR >gi568815597f:212515022_212719552|GENSCAN_predicted_CDS_8|522_bp atgagccctgaggatgatgacaggaaggtccgaaggagagaaaaaaaccgagttgctgct cagagaagtcggaagaagcagacccagaaggctgacaagctccatgagttgggacgtgga gctgctccataccaggaactggccttgaaagcatccaaggatctatctgaggaatgggcc accattccatgttcttgtctttgtcggggttggagtgcccagaacaagttggatatggat ggtttctacattgtgtggattgggctaagggacctctctgataccctgcagttctacgcc tctgtaaaccctgatggagaggaaactgaacaggaatatgagagcctggagcaagaaaac accatgctgcggagagagatcgggaagctgacagaggagctgaagcacctgacagaggca ctgaaggagcacgagaagatgtgcccgctgctgctctgccctatgaactttgtgccagtg cctccccggccggaccctgtggccggctgcttgccccgatga >gi568815597f:212515022_212719552|GENSCAN_predicted_peptide_9|483_aa MGRPSGQCGAHVPGSTGHARPSQASAPSGFFPQRAQELAQEPFQEPRTLCARQGPWIAPI PQHPYAPTSACLRPHPPPLHLRSHHHGTPAPTSSPPAARRSPHPTALPEPLALYLLLRLR LRLVPGRRDAPLQDAAGGREPLRHAGRSSGPARPAPPARPARGAAYGRCPAAGILVPHPR PRAGPAPSGFPLSPLLHASAQCPRAPVPAAVLPVPRLPRPPSPTPASFPPLPALSPHLGV PNPTPAAAFRRRPALRGSHVVALKPRREREAHPREATPGGVGTRGAPCSLWGPSAAAGEL RLRNTVGVLGSVFFPGPIAASSCASATPCVRPPARGSQSRGTFSAAISHPGATLAPPAAP KPRLWGEDAGFTESHRAPRATGAPGTLQGVPALPVSGHVHIQSPLALHLGISMPDYKPYE DKGNVFIMLAPPNSVLAVVITSTRNDALKRRQLKKLAHVTQRSDWAGIQTQAEHALNTIL IGQ >gi568815597f:212515022_212719552|GENSCAN_predicted_CDS_9|1452_bp atggggcggccctcagggcagtgcggagcccacgtgccggggagcaccggccatgcccgg cccagccaggcgtcggcgccctcgggcttcttcccccagagggcgcaggagctggctcag gagccattccaggagccgcggacactgtgcgcacgacagggtccctggatcgcccccatt ccccaacatccctacgcccctacctctgcttgtctccggccgcacccgccacctctgcac ctccgcagtcaccaccacggaactccagcacccacctcctcgccccccgcggcgcgccgg tccccgcaccccacggccctccctgagcctctcgccctctacctgctgctgcggctgcgg ctgcggctggttcccgggcgccgcgacgctcctctgcaggacgctgccggcggccgggag cccttgcgacatgccgggcgctcctctggcccggcccgccccgccccgcccgcgcgccct gcccgtggggctgcctacgggcgctgtcccgccgccggcatcctcgtgccgcacccacgg cctcgcgccggccccgccccctcaggcttcccgctctccccgctactccacgcgtcggcc cagtgtccccgcgcgccagtcccggcagccgtccttcccgtcccacgtctgccccgccct ccttctcccacccccgcctcgtttcctcctctgcccgccctctctccgcacctcggggtc ccaaatcccacccctgccgcggctttccgccggcgccccgcgcttcgtggaagtcacgtg gtggctctgaaaccccggagggaacgcgaagcccacccacgcgaggcgacgcccgggggg gtggggaccaggggggcgccatgcagcctttggggtccttcggcggctgccggggaactc aggttgagaaacacagtgggtgttttggggtcggtgttttttcccggtcccatcgcggca tcttcctgcgcctcagccacgccctgcgtccgccccccggcgcgggggtcacagtcccgg gggactttctccgcggcgatcagtcatccgggcgccactctcgctcctccagcggccccg aagccgcggctctggggtgaagacgccggcttcaccgagtcccaccgcgcaccccgagcc acgggcgccccagggactctgcagggagtgcctgcccttcccgtctcgggacatgttcat atccaatcacctctggccctgcacctcggcatctccatgccagattacaaaccatatgag gacaaaggcaatgtcttcattatgctggcccctcccaactcagtgctagctgtagtaata accagcactagaaatgatgcattgaagcgcagacaattgaagaaacttgcccacgttacc cagagaagtgactgggctgggattcaaactcaggcagagcatgcacttaatacaatttta attggtcaataa