GENSCAN 1.0 Date run: 6-Nov-116 Time: 04:37:07 Sequence gi568815588r:124347432_124549943 : 202512 bp : 49.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1133 1259 127 2 1 59 90 57 0.578 3.68 1.02 Intr + 5863 5974 112 0 1 74 98 65 0.176 6.05 1.03 Intr + 16366 16494 129 2 0 27 115 96 0.201 6.87 1.04 Intr + 18816 18896 81 1 0 119 100 17 0.823 5.61 1.05 Intr + 22075 22133 59 2 2 58 90 23 0.196 -1.90 1.06 Intr + 23512 23652 141 0 0 78 42 57 0.159 0.65 1.07 Intr + 27465 27567 103 2 1 59 77 71 0.602 2.85 1.08 Intr + 38183 38278 96 0 0 73 61 88 0.375 4.58 1.09 Intr + 41944 42068 125 2 2 64 1 139 0.540 3.10 1.10 Term + 46681 46740 60 0 0 122 54 33 0.507 1.10 1.11 PlyA + 48752 48757 6 1.05 2.12 PlyA - 49895 49890 6 1.05 2.11 Term - 50671 50511 161 2 2 63 55 172 0.999 9.50 2.10 Intr - 53553 53409 145 0 1 57 86 134 0.999 9.96 2.09 Intr - 54408 54295 114 0 0 81 88 141 0.999 14.04 2.08 Intr - 55624 55496 129 1 0 77 95 78 0.996 8.29 2.07 Intr - 58132 58005 128 2 2 104 53 146 0.999 13.10 2.06 Intr - 61206 61111 96 1 0 86 91 46 0.933 4.68 2.05 Intr - 61534 61310 225 2 0 105 98 112 0.998 11.86 2.04 Intr - 64769 64542 228 0 0 102 102 57 0.815 6.34 2.03 Intr - 71772 71699 74 2 2 43 92 51 0.526 0.05 2.02 Intr - 72360 72237 124 1 1 67 106 47 0.185 4.04 2.01 Init - 86824 86776 49 1 1 82 58 46 0.542 0.11 2.00 Prom - 95230 95191 40 -3.96 3.04 PlyA - 95447 95442 6 1.05 3.03 Term - 100716 99998 719 1 2 99 41 741 0.986 64.15 3.02 Intr - 102637 102299 339 2 0 101 99 199 0.813 17.75 3.01 Init - 104761 104722 40 1 1 106 86 -30 0.321 -2.56 3.00 Prom - 106503 106464 40 -3.26 4.00 Prom + 111254 111293 40 -6.46 4.01 Init + 114432 114556 125 2 2 106 109 352 0.977 36.74 4.02 Intr + 114705 114793 89 0 2 66 92 -26 0.040 -4.79 4.03 Intr + 121333 121461 129 2 0 77 36 105 0.014 4.87 4.04 Intr + 125698 125848 151 2 1 71 42 81 0.012 1.02 4.05 Intr + 127468 127585 118 1 1 107 49 49 0.012 3.37 4.06 Intr + 134282 134376 95 1 2 78 98 31 0.046 1.86 4.07 Intr + 136708 136895 188 1 2 91 78 257 0.948 24.33 4.08 Intr + 140991 141144 154 1 1 126 110 184 0.996 23.53 4.09 Intr + 147368 147498 131 2 2 52 86 63 0.184 2.84 4.10 Intr + 149433 149593 161 1 2 56 66 182 0.245 12.51 4.11 Intr + 150605 150697 93 1 0 98 96 49 0.975 6.86 4.12 Term + 157343 157519 177 1 0 78 54 113 0.817 4.59 4.13 PlyA + 157957 157962 6 1.05 5.00 Prom + 163812 163851 40 -4.16 5.01 Init + 169755 169840 86 2 2 85 82 138 0.454 13.24 5.02 Intr + 173859 173955 97 0 1 105 89 35 0.747 5.31 5.03 Term + 174868 175053 186 0 0 96 54 103 0.363 5.19 5.04 PlyA + 175682 175687 6 1.05 6.12 PlyA - 176638 176633 6 1.05 6.11 Term - 179863 179816 48 1 0 119 44 37 0.141 -0.30 6.10 Intr - 183563 183479 85 1 1 79 81 32 0.293 1.42 6.09 Intr - 184895 184703 193 0 1 76 89 76 0.726 5.05 6.08 Intr - 185402 185331 72 0 0 49 92 62 0.270 2.08 6.07 Intr - 192754 192683 72 2 0 45 97 89 0.876 4.88 6.06 Intr - 192957 192891 67 0 1 40 48 89 0.765 -1.42 6.05 Intr - 194527 194471 57 1 0 83 91 41 0.756 2.98 6.04 Intr - 195052 194974 79 0 1 48 94 58 0.359 1.95 6.03 Intr - 196022 195685 338 2 2 103 68 48 0.383 -1.38 6.02 Intr - 196703 196571 133 1 1 56 85 59 0.306 3.05 6.01 Init - 202341 202262 80 0 2 84 67 51 0.033 1.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:124347432_124549943|GENSCAN_predicted_peptide_1|344_aa XAAWLRAAPKPFQQQNRSAKAQGGLTAQIQRVLPPPDTPSPLQAASGDPALKSPGMATCP SVPVGDPPSLSPMGAESTGQGKKKGVRCADSTGHMDEDQPRHFPPRKFGVFVAMQKSGSF LQPVCLSFLTAWQLVSKKARGDTAKPLRVQAPIIPTCPGKDLVGGNCIMEGPSTGGLGKQ SLDRPARILTPLPGKPLSAAGGKAKTGSIKMRIPVPAGVFEKQINPSDNRAAAWVLVYCS FNGVVTSLRGQAYLENVNGFTQCTEYVSEPASGHPRAEGTPISGRPGSNALRAAPPAVPT RVGPYGCEQFRPKAPGPGLGVIDLRGTKCRQQVPAHLFTTGHRP >gi568815588r:124347432_124549943|GENSCAN_predicted_CDS_1|1035_bp nnggccgcctggttgagagcagcaccaaagccattccaacagcagaaccgcagtgccaag gctcaaggtgggctaacagctcagattcagcgagttctacctcccccagacacccccagc cctctgcaggcagcttctggagatcccgccctcaagagtccaggcatggccacctgcccc tcagtccccgtgggagatcctccgagcctgagccccatgggtgcagagagtactggccaa ggaaagaagaaaggtgtgcgctgtgcagacagcaccggtcacatggatgaggaccaaccc agacatttcccaccacgcaaattcggggtctttgttgcaatgcagaagtcgggatctttc ctgcaaccagtctgcctcagcttccttacagcatggcagctggtttccaagaaagcaaga ggggacactgctaagcctcttcgagttcaagctcccataattcccacatgtcctgggaag gacctggtgggaggtaactgcatcatggagggtccaagcacaggtggacttggtaagcag agcctcgacagacctgcccgcatactgacacccctgccagggaagcccctcagtgcagcg gggggcaaagcgaaaacgggaagtatcaagatgaggattccagtccctgctggagtcttt gagaagcaaattaacccctcggacaacagggcagcagcctgggtacttgtttactgttcc ttcaatggcgttgtcacgtcccttcgtgggcaagcgtacctggagaatgtgaatggattt acacagtgcaccgagtacgtgagcgagcctgcctccggtcaccccagggctgaagggacc cccatctcaggacggccgggaagcaacgctttgcgggcagcgccccctgccgtcccaacg cgcgtggggccctatggctgcgagcagttccggccaaaagccccaggacccggacttggg gttattgacttgcgcggcactaagtgccgtcagcaggttcctgcccacctgttcaccact ggacataggccctga >gi568815588r:124347432_124549943|GENSCAN_predicted_peptide_2|490_aa MGFCYVGQAGLELLTSAGPPTGHCGLMECGAGSCQRLPSERGGEGHTGPDCPYCWMTGLL VSKGGQGLFSSRQRRRPRGLSSDLWFFYLKDTMFSKLAHLQRFAVLSRGVHSSVASATSV ATKKTVQGPPTSDDIFEREYKYGAHNYHPLPVALERGKGIYLWDVEGRKYFDFLSSYSAV NQGHCHPKIVNALKSQVDKLTLTSRAFYNNVLGEYEEYITKLFNYHKVLPMNTGVEAGET ACKLARKWGYTVKGIQKYKAKIVFAAGNFWGRTLSAISSSTDPTSYDGFGPFMPGFDIIP YNDLPALEVLFIADEIQTGLARTGRWLAVDYENVRPDIVLLGKALSGGLYPVSAVLCDDD IMLTIKPGEHGSTYGGNPLGCRVAIAALEVLEEENLAENADKLGIILRNELMKLPSDVVT AVRGKGLLNAIVIKETKDWDAWKVCLRLRDNGLLAKPTHGDIIRFAPPLVIKEDELRESI EIINKTILSF >gi568815588r:124347432_124549943|GENSCAN_predicted_CDS_2|1473_bp atgggcttttgctatgttggccaggctggtctcgaactcctgacctcagctgggcccccg actggtcattgtggtctaatggagtgtggtgccggcagctgccagcgcctgccctctgag aggggtggcgaggggcacacaggaccagactgcccctactgttggatgactggtcttctc gtcagcaagggcggacaaggactcttctcgtcccgccagaggaggcgaccgaggggcctg agctcagatctgtggtttttctacttgaaggacacaatgttttccaaactagcacatttg cagaggtttgctgtacttagtcgcggagttcattcttcagtggcttctgctacatctgtt gcaactaaaaaaacagtccaaggccctccaacctctgatgacatttttgaaagggaatat aagtatggtgcacacaactaccatcctttacctgtagccctggagagaggaaaaggtatt tacttatgggatgtagaaggcagaaaatattttgacttcctgagttcttacagtgctgtc aaccaagggcattgtcaccccaagattgtgaatgctctgaagagtcaagtggacaaattg accttaacatctagagctttctataataacgtacttggtgaatatgaggagtatattact aaacttttcaactaccacaaagttcttcctatgaatacaggagtggaggctggagagact gcctgtaaactagctcgtaagtggggctataccgtgaagggcattcagaaatacaaagca aagattgtttttgcagctgggaacttctggggtaggacgttgtctgctatctccagttcc acagacccaaccagttacgatggttttggaccatttatgccgggattcgacatcattccc tataatgatctgcccgcactggaggttctctttattgctgatgaaatacagacaggattg gccagaactggtagatggctggctgttgattatgaaaatgtcagacctgatatagtcctc cttggaaaggccctttctgggggcttataccctgtgtctgcagtgctgtgtgatgatgac atcatgctgaccattaagccaggggagcatgggtccacatacggtggcaatccactaggc tgccgagtggccatcgcagcccttgaggttttagaagaagaaaaccttgctgaaaatgca gacaaattgggcattatcttgagaaatgaactcatgaagctaccttctgatgttgtaact gccgtaagaggaaaaggattattaaacgctattgtcattaaagaaaccaaagattgggat gcttggaaggtgtgtctacgacttcgagataatggacttctggccaagccaacccatggc gacattatcaggtttgcgcctccgctggtgatcaaggaggatgagcttcgagagtccatt gaaattattaacaagaccatcttgtctttctga >gi568815588r:124347432_124549943|GENSCAN_predicted_peptide_3|365_aa MAHACNLSTLRTQASLGSADPAPGWKATAKPQPKASPSGAIPNPAPPTAGPWATGMLAWQ DGGAKAAPSHHKISFSVLDILDPQKFTRAALPAVRPAPREARKSLAEVEAGKDASSRDPV RQLETPDAAGPGAGQASPLEGSEAEEEEDAEDPRRPRLRERAARLLPGLARSPDAPAGAL ASGEPCEDGGGGPVRSPPGSPGSPRPRRRRLEPNCAKPRRARTAFTYEQLVALENKFRAT RYLSVCERLNLALSLSLTETQVKIWFQNRRTKWKKQNPGADGAAQVGGGAPQPGAAGGGG GGGSGGSPGPPGTGALHFQTFPSYSAANVLFPSAASFPLTAAAPGSPFAPFLGPSYLTPF YAPRL >gi568815588r:124347432_124549943|GENSCAN_predicted_CDS_3|1098_bp atggctcatgcctgtaatctcagcactttgagaacccaagcttcacttggcagcgcggac ccggctcctggctggaaagctaccgccaagccacagccgaaggcaagcccgagcggcgcc atccccaaccccgcgccgccgaccgccggcccgtgggcgacgggcatgctggcatggcag gacggcggggccaaggcggctccctcccaccacaagatttctttctctgtcctggacatc ctggacccacagaaattcacccgcgcagcgctccctgccgtgcgcccggctccccgggaa gccaggaaaagtttggcggaggtcgaagcggggaaagatgccagctccagggaccctgtc cgacagctggagacccctgatgctgcgggcccaggcgccggccaggcgtcccccctggag ggttccgaggcggaagaggaggaggatgcggaggatccgaggaggccgcggctgcgggag cgggctgcgcgcttgctgccgggcctagcgcgctcacctgacgccccggccggggcattg gcgtctggggagccctgcgaggacggcgggggcggccctgtgaggtcccccccgggatcc cccggctccccgcgtcccaggcgccggcgcctggagcccaactgcgccaagccgcggcgc gcgcgcaccgccttcacctacgagcagctggtggccttggagaacaagttccgggccacg cgctacctgtcagtgtgcgagcgcctgaacctcgcgctgtctctcagcctcaccgagacg caggtcaaaatctggttccagaatcgcaggaccaagtggaagaagcagaacccgggtgcc gacggcgcggcgcaggtggggggtggcgcgccccagccaggggcggcggggggcggcggc ggcggcggctcggggggcagtcctggccctcccggcaccggcgctctgcacttccagact ttcccctcctactccgcggccaatgtcctcttcccgtccgccgcctccttcccgctgacg gctgccgcccccgggagccctttcgcgccgttccttgggccttcctacctgacccccttc tacgccccgcgtctatga >gi568815588r:124347432_124549943|GENSCAN_predicted_peptide_4|536_aa MAPWGKRLAGVRGVLLDISGVLYDSGAGGGTAIAGSVEAVASADHGRPHCCPRRAPGLCW LGLRGQTGRPPGQGPSTLAQCLWGQIITLRAREAPYTLPRMPVGLVADAQSGCEGAASGW KPLANYKGSVAQVTKLMNGVLPLPPVPHNVQEAILAKQCHTCPLWAPGFLSECTCPNSGF VRDVVVIVTTTAITAIYLVFEMCQPHFLAPPSTYPTLLEFFLSERVGPVHTCLCARLKRS RLKVRFCTNESQKSRAELVGQLQRLGFDISEQEVTAPAPAACQILKEQGLRPYLLIHDGV RSEFDQIDTSNPNCVVIADAGESFSYQNMNNAFQVLMELEKPVLISLGKGLSVGFFASPP DCESLWSRLGLGVSLSPAKGCGASGIVNAKGDQASGTGPVLSSRYLASCGQLPVLSIPCS VLLSPRRYYKETSGLMLDVGPYMKALEYACGIKAEVVGKPSPEFFKSALQAIGVEAHQLF SQSASFTFRMRYSPPHRRKKTPHGGLVNLGLSFSDAGAVRLGGSGAVDSWWVTVAA >gi568815588r:124347432_124549943|GENSCAN_predicted_CDS_4|1611_bp atggcaccgtggggcaagcggctggctggcgtgcgcggggtgctgcttgacatctcgggc gtgctgtacgacagcggcgcgggcggcggcacggccatcgccggctcggtggaggcggtg gccagtgctgaccacggacgaccccactgttgcccccggcgagcaccaggactctgctgg ttagggctgcgcggtcagacagggcggccacctgggcaaggcccctccactctggcccag tgcctctggggccagatcatcaccctcagggcccgggaagctccgtacaccctccctcgc atgcctgtggggctcgtggcagatgcccagagtggctgcgaaggtgctgcctctggttgg aagcccttggccaattacaaggggagtgtggcacaagttacaaagctcatgaatggagtc ctgcccctacccccagtgccccacaacgtgcaagaagccatattggcaaagcagtgccac acctgcccactgtgggcacctggcttcctctcagaatgtacctgccccaacagtgggttt gtcagagacgtggttgtcattgtcaccaccacagcaattactgccatttatttagtattt gaaatgtgccagcctcatttcctggctccaccttccacataccccacgctgctcgagttc tttctatccgagagggtggggccagtgcacacctgcctgtgcgcaagactgaagcgttcc cggctgaaggtgaggttctgcaccaacgagtcgcagaagtcccgggcagagctggtgggg cagcttcagaggctgggatttgacatctctgagcaggaggtgaccgccccggcaccagct gcctgccagatcctgaaggagcaaggcctgcgaccatacctgctcatccatgacggagtc cgctcagaatttgatcagatcgacacatccaacccaaactgtgtggtaattgcagacgca ggagaaagcttttcttatcaaaacatgaataacgccttccaggtgctcatggagctggaa aaacctgtgctcatatcactgggaaaaggtttgtctgtgggcttctttgcttccccacca gactgtgagtccctgtggagcaggctggggctgggagtatccctgagtcctgcaaagggc tgtggagcatctgggatcgttaatgccaagggggaccaagccagcgggacaggcccggtg ctcagctcccgatacttagcatcctgcggtcagctccccgtgctcagcatcccgtgctcc gttctgctctctcctaggcgttactacaaggagacctctggcctgatgctggacgttggt ccctacatgaaggcgcttgagtatgcctgtggcatcaaagccgaggtggtggggaagcct tctcctgagtttttcaagtctgccctgcaagcgataggagtggaagcccaccagttattt tcacagtcagcttccttcacatttcgaatgagatacagcccacctcaccgccgaaagaag acccctcatgggggtctggtgaatctgggtttgtcgttttcggatgctggggcagtgcgt ctgggagggagcggggctgtggattcctggtgggtcaccgtggctgcttga >gi568815588r:124347432_124549943|GENSCAN_predicted_peptide_5|122_aa MIGDDIVGDVGGAQRCGMRALQVRTGKFRKYPQPGSMEQHLALPGPPPALWHLSTAPFCT ADSWRIGHGCSPRLLEVPSSHSVPAAQALDNSVSSVGGPVPWVANPHDAMSPAAPEGAAA LP >gi568815588r:124347432_124549943|GENSCAN_predicted_CDS_5|369_bp atgattggggacgatatcgtgggcgacgtcggcggtgcccagcggtgtggaatgagagcg ctgcaggtgcgcaccgggaagttcaggaaatacccccagccaggctctatggagcagcac ctcgcccttccagggccacccccagccctctggcacctcagcacggccccattctgcaca gcggactcctggcgcattggacatggctgctcccccagactcctcgaggtcccatccagc cacagcgttcctgctgcccaggccctggacaacagtgtctcctccgtggggggtcctgtg ccatgggtggcaaatcctcatgatgccatgagtcctgcagccccagagggtgctgctgcc ctgccgtga >gi568815588r:124347432_124549943|GENSCAN_predicted_peptide_6|407_aa MPFPAMPTPAMPTGPEAGPAWQPLPQISAGLSASAIRGASQGHPTAVTRLGCEARVRQLW EGTITVPGEELGTSAVPSQHITAATVQPPAPALSSLERWLWDAGRPRTWADRGCLGVWRT GSVYPQVGHKLLSLWGVRPANPCGGHTGRSSQGGGVWEDEGLYCPSWNGGIFSPSQPSDK ALSRETLDLSTQAGNRLPGPKSHLLAASPQGQHVGTGHSANDQMSASPQPGHEAKREKPG WVPGRKSLPPPASPPKRTDTETARRCMNGWARAPAGAIGTVGTDMLALSCYTLTSWVMLG PEQQCPETTQAHLGPWAEILNSALHWKELGLLGEMADSRAEAKTEQMNLKHLFGPKCKEG LKEREHWEAPNAQSVFLGLSLTRCQGDKKDGLAGKDVAIKGSKMQIP >gi568815588r:124347432_124549943|GENSCAN_predicted_CDS_6|1224_bp atgcctttcccggccatgcccaccccggccatgcccacagggccagaggcaggcccagcc tggcagccccttccccagatctctgctgggctctcagcatcagccatccgaggggcctcg cagggtcaccccactgctgtcaccaggcttggctgcgaggccagggtgcggcagctctgg gaaggcacaatcactgtcccaggagaggaactgggcacctctgccgttcccagtcagcac atcaccgcagccacggtacagccacctgcgcctgcactgtcctcgctggaaagatggctg tgggatgcgggcagaccccgcacctgggccgaccgggggtgtctaggggtctggaggaca ggcagtgtctaccctcaggttggccacaaactcctgtccctctggggagtgcgccctgcc aacccctgtgggggccatacaggaaggagcagccagggaggtggagtctgggaggatgaa ggcttgtactgcccgagctggaacgggggcattttctctcccagtcaaccctcggacaag gccctgtccagggagaccctggacctgagcacacaggctggaaacagactgccagggccc aagtcccacctgctggctgcatcccctcagggacagcatgtgggcacagggcactctgcg aatgaccagatgtcagcatctccccagcctggccacgaagcaaagcgggagaaaccaggc tgggtccccggccgcaagtccctgccgccccctgcatctccaccgaagcgcacagacaca gaaacagcacggcgctgcatgaacggctgggccagggcacctgcaggggccatagggaca gttgggacagacatgctggcactgtcctgctacacccttaccagctgggtgatgctgggg cctgagcagcaatgcccagagacaacacaagcacacctggggccctgggctgagattcta aacagtgccctccactggaaggaactggggctccttggagaaatggccgattccagggct gaagcaaagactgaacaaatgaacctgaaacatctttttgggccaaaatgcaaggaaggg ctcaaagaacgggaacactgggaggcccccaatgcccagagcgtgttcttaggcctctct ctgacaaggtgccagggagacaagaaggacgggctggctggaaaagatgtagcaatcaaa gggtcaaaaatgcagattccctga