GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:22:21 Sequence gi568815597r:212586794_212799762 : 212969 bp : 43.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 6639 6678 40 -2.16 1.01 Init + 10482 10543 62 2 2 81 65 131 0.711 10.82 1.02 Intr + 15034 15061 28 1 1 38 86 11 0.081 -5.98 1.03 Intr + 17514 17598 85 2 1 79 71 64 0.343 3.19 1.04 Intr + 20229 20339 111 1 0 21 90 87 0.305 2.55 1.05 Intr + 20484 20565 82 1 1 90 80 38 0.241 1.90 1.06 Intr + 22297 22473 177 1 0 27 60 147 0.220 4.83 1.07 Intr + 28225 28468 244 1 1 63 90 236 0.720 18.80 1.08 Intr + 31334 31441 108 1 0 103 87 185 0.999 20.38 1.09 Term + 32565 32762 198 2 0 111 47 243 0.999 19.90 1.10 PlyA + 33954 33959 6 1.05 2.00 Prom + 34736 34775 40 -7.16 2.01 Sngl + 38085 39869 1785 2 0 87 36 1252 0.966 114.42 2.02 PlyA + 39960 39965 6 1.05 3.05 PlyA - 40839 40834 6 1.05 3.04 Term - 43462 43302 161 2 2 101 47 49 0.029 0.20 3.03 Intr - 49660 49479 182 0 2 70 -7 109 0.073 -0.89 3.02 Intr - 50432 50310 123 1 0 60 106 38 0.026 2.50 3.01 Init - 64614 64544 71 0 2 100 106 35 0.759 7.02 3.00 Prom - 66494 66455 40 -4.26 4.00 Prom + 73683 73722 40 -4.96 4.01 Init + 74497 75073 577 0 1 27 -14 346 0.738 13.60 4.02 Term + 75523 76160 638 2 2 55 44 281 0.451 14.81 4.03 PlyA + 76542 76547 6 -0.45 5.00 Prom + 76910 76949 40 -2.46 5.01 Init + 93477 93543 67 2 1 73 94 41 0.720 4.43 5.02 Intr + 95731 95820 90 2 0 4 99 83 0.414 0.97 5.03 Intr + 96332 96391 60 0 0 100 93 18 0.595 2.21 5.04 Term + 96768 96844 77 1 2 86 42 90 0.726 2.20 5.05 PlyA + 97185 97190 6 1.05 6.06 PlyA - 99697 99692 6 1.05 6.05 Term - 100186 99998 189 1 0 104 48 271 0.999 22.15 6.04 Intr - 100794 100674 121 2 1 -1 109 130 0.607 6.70 6.03 Intr - 109011 108908 104 0 2 57 62 38 0.320 -2.93 6.02 Intr - 110272 110168 105 1 0 83 78 147 0.802 13.61 6.01 Init - 110448 110446 3 0 0 52 101 0 0.880 -2.30 6.00 Prom - 110743 110704 40 -7.96 7.00 Prom + 111308 111347 40 -6.26 7.01 Init + 112538 113726 1189 1 1 97 21 384 0.064 24.03 7.02 Intr + 123431 123484 54 0 0 78 111 31 0.337 3.35 7.03 Intr + 124929 125059 131 1 2 106 -18 90 0.104 0.61 7.04 Term + 139754 139924 171 1 0 113 52 63 0.166 3.03 7.05 PlyA + 140977 140982 6 1.05 8.00 Prom + 150199 150238 40 -4.06 8.01 Init + 161713 161870 158 0 2 68 95 80 0.294 6.08 8.02 Term + 172876 172969 94 1 1 101 46 42 0.185 -1.40 8.03 PlyA + 174833 174838 6 1.05 9.06 PlyA - 175636 175631 6 1.05 9.05 Term - 193315 193221 95 2 2 84 49 39 0.456 -2.41 9.04 Intr - 195633 195579 55 0 1 38 109 52 0.675 0.85 9.03 Intr - 197700 197570 131 1 2 70 99 128 0.987 12.51 9.02 Intr - 200844 200766 79 0 1 48 71 15 0.224 -5.08 9.01 Intr - 204998 204737 262 1 1 91 55 318 0.213 26.29 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:212586794_212799762|GENSCAN_predicted_peptide_1|364_aa MIIIFIIDCRHSAYNHKIEELFPYDLVASRDFILDSGVSGFNLSNGNICYLGFGPVDQGP EQDLGGSRVELQGSRVRRGAKTSEAALRVFPGPWREAGGKEQVFLPFTVPPHPAAAQVSL PGSHRCSSALRKVSRKAKELTFSFDLGRERFLVTEVGRPLCRQPRAPLLSADGAGLPGWE RAKMMLQHPGQVSASEVSASAIVPCLSPPGSLVFEDFANLTPFVKEELRFAIQNKHLCHR MSSALESVTVSDRPLGVSITKAEVAPEEDERKKRRRERNKIAAAKCRNKKKEKTECLQKE SEKLESVNAELKAQIEELKNEKQHLIYMLNLHRPTCIVRAQNGRTPEDERNLFIQQIKEG TLQS >gi568815597r:212586794_212799762|GENSCAN_predicted_CDS_1|1095_bp atgatcatcatcttcatcatcgactgtagacacagcgcttataatcacaaaattgaagaa ctgtttccatatgacttggtagcttcaagggacttcatcttggattctggggtctctgga ttcaatctctcaaatgggaatatctgctacctgggttttggccctgtagaccaaggtcca gagcaggatctcggaggatcccgcgtggaactccagggctcccgggtccgccggggcgca aagacttccgaggccgccctccgcgtgttcccaggcccgtggagagaggcaggtggaaag gagcaggtgtttctgcccttcaccgtgcccccacaccctgcggccgcgcaggtctccctc ccaggcagccaccgctgctcctcggcgcttcgaaaagtttcccgcaaagcgaaagaacta actttctcttttgacttggggcgcgaaaggttcctggtgacggaggtcgggcgtccactc tgccgccagcctcgagcgccgcttctctccgccgacggcgctggcttgcccggctgggag agggccaaaatgatgcttcaacacccaggccaggtctctgcctcggaagtgagtgcttct gccatcgtcccctgcctgtcccctcctgggtcactggtgtttgaggattttgctaacctg acgccctttgtcaaggaagagctgaggtttgccatccagaacaagcacctctgccaccgg atgtcctctgcgctggaatcagtcactgtcagcgacagacccctcggggtgtccatcaca aaagccgaggtagcccctgaagaagatgaaaggaaaaagaggcgacgagaaagaaataag attgcagctgcaaagtgccgaaacaagaagaaggagaagacggagtgcctgcagaaagag tcggagaagctggaaagtgtgaatgctgaactgaaggctcagattgaggagctcaagaac gagaagcagcatttgatatacatgctcaaccttcatcggcccacgtgtattgtccgggct cagaatgggaggactccagaagatgagagaaacctctttatccaacagataaaagaagga acattgcagagctaa >gi568815597r:212586794_212799762|GENSCAN_predicted_peptide_2|594_aa MNADFLLPYYTAQSGSSMSMFNTTMGKLQRQLYKGEYDIFKYAPIFESDFIQITKRGEVI DVHNRVRMVTMGIARTSPILPLPDVMLLARPATGCEEYAGHGQATKRKKRKAAKNLELTR LLPLRFVRISVQDHEKQQLRLKFATGRSCYLQLCPALDTRDDLFAYWEKLIYLLRPPMES NSSTCGIPAEDMMWMPVFQEDRRSLGAVNLQGKGDQDQVSIQSLHMVSEVCGATSAAYAG GEGLQNDFNKPTNVLNASIPKTSTELAEEPATGGIKEAAAAGAAAGAATGTVAGALSVAA ANSAPGQVSAAIAGAATIGAGGNKGNMALAGTASMAPNSTKVAVAGAAGKSSEHVSSASM SLSREGSVSLAIAGVVLTSRTAAEADMDAAAGPPVSTRQSKSSLSGQHGRERTQASAEGC KEGRERREKDRALGRSSHRRRTGESRHKTRGDKIAQKSSSRSSFSHRANRDDKKEKGCGN PGSSRHRDSHKGVSHTPISKESRTSHKSGRSLWTTSSGSSKGLGRVSSFLRNVRANLTTK VVGTPHGRDVNVMAKMAERSTNVAIAETAEGGQGLETVGSMTPDIMETVTFEAH >gi568815597r:212586794_212799762|GENSCAN_predicted_CDS_2|1785_bp atgaatgctgattttctgctgccgtattatacggcccagagtggctccagcatgagcatg ttcaacaccaccatggggaaactgcagcgacaactgtacaagggggagtacgatatattc aagtatgcaccgatatttgagagcgactttatccagatcaccaaaaggggagaagtgatt gatgtgcacaaccgtgtccgtatggtgaccatgggcattgcacgtaccagccccatcctc ccactcccagatgtcatgctactggcacgaccggccaccggctgcgaagagtatgctgga catggccaggccaccaagagaaaaaaacgcaaggcagcaaagaacttagagctcaccagg cttctgcccctgaggtttgtacggatctctgttcaagaccatgagaaacaacagctgcgc ctgaagttcgccactggcagatcttgctatctgcaattgtgtcccgctcttgacacacgg gatgacctctttgcctattgggaaaaactaatttacctcttgcggccacccatggagagt aacagcagtacctgtggcattccagctgaagacatgatgtggatgcctgtgtttcaggaa gacaggaggagcctgggagccgtgaaccttcaaggaaagggggatcaggaccaggtcagc atccaaagcctccacatggtctctgaggtgtgtggggccacctctgctgcttatgctgga ggggagggactccaaaatgactttaacaaacccactaatgtgctcaatgcatccatcccc aaaacatctacagaacttgctgaggagccagcaacaggggggattaaagaggcagcagca gcaggggcagctgcaggggcagcaacaggcaccgtagcaggtgccttgagtgtggcagca gccaattctgcccctggacaggtgagcgcagccatagctggggcggccaccatcggtgca ggaggaaacaaaggcaacatggcccttgcaggcactgccagcatggctccaaacagcacg aaggtggctgtggcaggggctgcaggcaagtcctcagagcatgtttccagcgcatccatg agcctttcccgagagggcagtgtgagcctggccattgcaggagtagtactgaccagcagg acagctgcagaagcagacatggatgcagcagcgggacctcccgtctccacccggcagagc aagagcagcctgagtggacagcatggaagggagcgaacccaggccagcgctgaaggctgc aaggaggggagggaaagaagggaaaaggacagggctctcggaaggagttcccatcgccgc aggacaggtgaaagccgccacaaaacaaggggagacaagattgcccaaaagtcctccagc aggtcctcattcagccacagagccaatagagatgacaaaaaggagaaaggctgtggcaac ccggggagcagcaggcacagggactcgcataaaggtgtcagccacacgcccatctcaaag gagtccaggacctctcacaaatctgggaggagcttatggaccaccagttccggttccagc aagggacttggcagggtcagctctttcctgaggaacgtcagagccaaccttactacaaaa gtagtgggcacaccacatggcagagatgtgaacgtcatggctaagatggcggagaggagc accaacgtggccatcgccgagacagcagagggtggccaggggctggagacggttggttct atgacaccggacatcatggagacagtgacctttgaagcccattaa >gi568815597r:212586794_212799762|GENSCAN_predicted_peptide_3|178_aa MINYQEASHDYMLHMILQVLLTAGVGWKPSKAVRSAESAHDGARSVGSSVTPLGFNPGCT SGGLGPPVRASGDLLEGGREEGMYGCHHAWMEDQKVFITSICGAPNMCQTVVDLDTQKHG HSPKGALTCSNSTGPPEALLGDSCPAEKGPVGEHLTQFPADEKGRRVQEASTLRGLQH >gi568815597r:212586794_212799762|GENSCAN_predicted_CDS_3|537_bp atgattaattaccaggaagcaagtcacgactatatgcttcatatgattttgcaagtttta ttaacagctggtgtaggttggaagccttctaaggccgtgcgttctgcagagtcagcccat gatggggccaggagtgtgggctccagtgtcacccccttgggatttaatcctggctgcacc tctgggggccttgggcctcctgtcagagctagcggagatctgctggaaggaggccgggag gaggggatgtacggatgccatcatgcctggatggaggaccagaaggtctttataacaagt atttgtggagctcctaatatgtgccaaactgtggtagacctggatacccagaaacacggc cactcccctaaaggagccctgacctgcagtaacagcacagggccaccagaggctctattg ggcgactcatgccccgcggagaaaggcccagtaggagagcatctgacccagttcccagca gatgagaaaggaagaagagttcaggaggcttctactctcagaggtctccagcactga >gi568815597r:212586794_212799762|GENSCAN_predicted_peptide_4|404_aa MKKKREKNQIDSIKNDEGDITTDLTEIQTTIREYYKHLYANKLENLEEMDKFLNTYTLPR LNQEELESLNRPITDSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIE KEGILPNSFYEASIILIPKPGRDTTTKENFRPISLMNIDAKILSKTLANRIQQHIKNLST MIKWASSLGCKAGNFSNVSGYKINVQKSQAFLYTKNRQTESQIMSEFPFTTASKRIKYLG IQLTRDVKDLFKENYKPLLNEIKKDTNKWKNIPCSSVGRINIMKMAILPKVIYRFNAIPI KLPMTFFTELEKTTLKFIWNQKKARIAKSILSQKNKAGDITLPDFKLYYKATVTKTAWYW YQNRDIDQWNRTEPSEIMLHIYNHLIFDKPDKNKKRGNDSLFNK >gi568815597r:212586794_212799762|GENSCAN_predicted_CDS_4|1215_bp atgaagaagaaaagagagaagaatcaaatagactcaataaaaaatgatgaaggggatatc accaccgatctcacagaaatacaaactaccatcagagaatactataaacacctctacgca aataaactagaaaatctagaagaaatggataaattcctcaacacatacaccctcccaaga ctaaaccaggaagaacttgaatctctgaatagaccaataacagactctgaaattgaggca ataattaatagcttaccaaccaaaaaaagtccaggaccagatggattcacagctgaattc taccagaggtacaaggaggagctggtaccattccttctgaaactattccaatcaatagaa aaagagggaatcctccctaactcattttatgaggccagcatcatcctgataccaaagcct ggcagagacacaacaacaaaagagaattttagaccaatatccctgatgaacatcgatgca aaaatcctcagtaaaacactggcaaaccgaatccagcaacacatcaaaaacttatccacc atgatcaagtgggcttcatccctgggatgcaaggctggcaacttcagcaacgtctcagga tacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaaaaacagacaaacagag agccaaatcatgagtgaattcccattcacaactgcttcaaagagaataaaatatctagga atccaacttacaagggatgtgaaggacctctttaaggagaactacaaaccactcctcaat gaaataaaaaaggatacaaacaaatggaagaacattccatgctcatcagtaggaagaatc aatatcatgaaaatggccatactgcccaaggtaatttatagattcaatgccatccccatc aagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaac caaaaaaaagcccgcattgccaagtcaatcctaagccaaaagaacaaagctggagacatc acgctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagatcaatggaacagaacagagccctcagaaataatgctgcat atctacaaccatctgatctttgacaaacctgacaaaaacaagaaacggggaaacgattcc ctatttaacaaatga >gi568815597r:212586794_212799762|GENSCAN_predicted_peptide_5|97_aa MTGRNSSYEGLIQDKAAQDSRTVAFVLICDDISGLLKNIIPEKGDARTAKRTGLLHAARM TLPGLPGPQGREAQQAAALRLEMQMPSTSPGAQRQYE >gi568815597r:212586794_212799762|GENSCAN_predicted_CDS_5|294_bp atgacggggaggaatagcagctatgagggcctaattcaggacaaggcagcccaggacagc agaacagtagccttcgttctgatctgtgatgatatttctggtcttctcaagaatatcata ccagaaaaaggtgatgctagaacagccaagagaacaggcctccttcatgcagcacggatg accctgccaggacttcctggacctcaagggagggaagcccagcaggcagccgctctgaga cttgagatgcagatgcccagcacaagtcctggtgcccagcggcagtacgaatag >gi568815597r:212586794_212799762|GENSCAN_predicted_peptide_6|173_aa MSPEDDDRKVRRREKNRVAAQRSRKKQTQKADKLHELGRGAAPYQELALKASKDLSEEWA TIPCSCLCRGWSAQNKLDMDGFYIVWIGLRDLSDTLQFYASVNPDGEETEQEYESLEQEN TMLRREIGKLTEELKHLTEALKEHEKMCPLLLCPMNFVPVPPRPDPVAGCLPR >gi568815597r:212586794_212799762|GENSCAN_predicted_CDS_6|522_bp atgagccctgaggatgatgacaggaaggtccgaaggagagaaaaaaaccgagttgctgct cagagaagtcggaagaagcagacccagaaggctgacaagctccatgagttgggacgtgga gctgctccataccaggaactggccttgaaagcatccaaggatctatctgaggaatgggcc accattccatgttcttgtctttgtcggggttggagtgcccagaacaagttggatatggat ggtttctacattgtgtggattgggctaagggacctctctgataccctgcagttctacgcc tctgtaaaccctgatggagaggaaactgaacaggaatatgagagcctggagcaagaaaac accatgctgcggagagagatcgggaagctgacagaggagctgaagcacctgacagaggca ctgaaggagcacgagaagatgtgcccgctgctgctctgccctatgaactttgtgccagtg cctccccggccggaccctgtggccggctgcttgccccgatga >gi568815597r:212586794_212799762|GENSCAN_predicted_peptide_7|514_aa MGRPSGQCGAHVPGSTGHARPSQASAPSGFFPQRAQELAQEPFQEPRTLCARQGPWIAPI PQHPYAPTSACLRPHPPPLHLRSHHHGTPAPTSSPPAARRSPHPTALPEPLALYLLLRLR LRLVPGRRDAPLQDAAGGREPLRHAGRSSGPARPAPPARPARGAAYGRCPAAGILVPHPR PRAGPAPSGFPLSPLLHASAQCPRAPVPAAVLPVPRLPRPPSPTPASFPPLPALSPHLGV PNPTPAAAFRRRPALRGSHVVALKPRREREAHPREATPGGVGTRGAPCSLWGPSAAAGEL RLRNTVGVLGSVFFPGPIAASSCASATPCVRPPARGSQSRGTFSAAISHPGATLAPPAAP KPRLWGEDAGFTESHRAPRATGAPGTLQGVPALPVSGHVHIQSPLALHLGISMPDYKPYE DKGNVFIMLAPPNSVLAVVITSTRNDALKRRQLKKLAHQMPLSTLSLGTSDDNVCGWFIE FVLGLCPPPTTWLISAPSGTVLPLRLAPLQCPDR >gi568815597r:212586794_212799762|GENSCAN_predicted_CDS_7|1545_bp atggggcggccctcagggcagtgcggagcccacgtgccggggagcaccggccatgcccgg cccagccaggcgtcggcgccctcgggcttcttcccccagagggcgcaggagctggctcag gagccattccaggagccgcggacactgtgcgcacgacagggtccctggatcgcccccatt ccccaacatccctacgcccctacctctgcttgtctccggccgcacccgccacctctgcac ctccgcagtcaccaccacggaactccagcacccacctcctcgccccccgcggcgcgccgg tccccgcaccccacggccctccctgagcctctcgccctctacctgctgctgcggctgcgg ctgcggctggttcccgggcgccgcgacgctcctctgcaggacgctgccggcggccgggag cccttgcgacatgccgggcgctcctctggcccggcccgccccgccccgcccgcgcgccct gcccgtggggctgcctacgggcgctgtcccgccgccggcatcctcgtgccgcacccacgg cctcgcgccggccccgccccctcaggcttcccgctctccccgctactccacgcgtcggcc cagtgtccccgcgcgccagtcccggcagccgtccttcccgtcccacgtctgccccgccct ccttctcccacccccgcctcgtttcctcctctgcccgccctctctccgcacctcggggtc ccaaatcccacccctgccgcggctttccgccggcgccccgcgcttcgtggaagtcacgtg gtggctctgaaaccccggagggaacgcgaagcccacccacgcgaggcgacgcccgggggg gtggggaccaggggggcgccatgcagcctttggggtccttcggcggctgccggggaactc aggttgagaaacacagtgggtgttttggggtcggtgttttttcccggtcccatcgcggca tcttcctgcgcctcagccacgccctgcgtccgccccccggcgcgggggtcacagtcccgg gggactttctccgcggcgatcagtcatccgggcgccactctcgctcctccagcggccccg aagccgcggctctggggtgaagacgccggcttcaccgagtcccaccgcgcaccccgagcc acgggcgccccagggactctgcagggagtgcctgcccttcccgtctcgggacatgttcat atccaatcacctctggccctgcacctcggcatctccatgccagattacaaaccatatgag gacaaaggcaatgtcttcattatgctggcccctcccaactcagtgctagctgtagtaata accagcactagaaatgatgcattgaagcgcagacaattgaagaaacttgcccaccaaatg cccctgtccaccttgtctctgggcacatctgatgataacgtctgtggctggtttattgaa tttgttctaggcttgtgtcccccaccgaccacctggctcatctcagctcccagcggcact gtgttacccctgaggttggctcctcttcaatgtccagaccgctga >gi568815597r:212586794_212799762|GENSCAN_predicted_peptide_8|83_aa MEYYTTITKNFCYIQNVDESHRNNTNLKKVTQECILYDSIYMKFKKRQKSMMMAQKPLHL HIPGALLTSPLHPPGELQWHNTD >gi568815597r:212586794_212799762|GENSCAN_predicted_CDS_8|252_bp atggaatactacacaacaataacaaagaacttctgctacattcaaaacgtggatgaatct cataggaataacaccaatttaaagaaggtaacacaagaatgcatactatatgattccatt tacatgaagttcaagaaaagacaaaaatctatgatgatggcccaaaagcccctacatctc cacatccctggagccttgctgacatccccactgcatccacctggagagctgcagtggcac aacactgactga >gi568815597r:212586794_212799762|GENSCAN_predicted_peptide_9|207_aa XVPRPQFRRKMAGSPELVVLDPPWDKELAAGTESQALVSATPREDFRVRCTSKRAVTEML QLCGRFVQKLGDALPEEIREPALRDAQWTFESAVQENISINGQAWQEASDNCFMDSDIKV LEDQFDEIIVDIATKRKQYPRKILECVIKTIKAKQEILKQYHPVVHPLDLKYDPDPDTAT IRFLNLFPTFPPFLFHKTAIVIMARSQ >gi568815597r:212586794_212799762|GENSCAN_predicted_CDS_9|624_bp nnagtgccccgcccacagttccgacgaaaaatggcggggtctcctgagttggtggtcctt gaccctccatgggacaaggagctcgcggctggcacagagagccaggccttggtctccgcc actccccgagaagactttcgggtgcgctgcacctcgaagcgggctgtgaccgaaatgcta caactgtgcggccgcttcgtgcaaaagctcggggacgctctgccggaggagattcgggag cccgctctgcgagatgcgcagtggacttttgaatcagctgtgcaagagaatatcagcatt aatgggcaagcatggcaggaagcttcagataattgttttatggattctgacatcaaagta cttgaagatcagtttgatgaaatcatagtagatatagccacaaaacgtaagcagtatccc agaaagatcctggaatgtgtcatcaaaaccataaaagcaaaacaagaaattctgaagcag taccaccctgttgtacatccactggacctaaaatatgaccctgatccagacacggcaacc atccgatttctcaatcttttccccacctttcccccctttctattccacaaaaccgccatt gtcatcatggcccgttctcaatga