GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:07:54 Sequence gi568815593r:134506953_134724019 : 217067 bp : 46.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1866 1933 68 2 2 93 61 56 0.243 3.94 1.02 Intr + 2034 2120 87 0 0 119 95 18 0.582 4.69 1.03 Intr + 31037 31131 95 2 2 141 90 168 0.971 21.91 1.04 Intr + 37193 37403 211 0 1 20 46 165 0.549 3.47 1.05 Intr + 37514 37586 73 2 1 60 105 -25 0.396 -4.19 1.06 Intr + 45100 45257 158 0 2 122 83 225 0.706 24.21 1.07 Intr + 48379 48464 86 1 2 99 72 34 0.422 2.36 1.08 Intr + 49188 49240 53 1 2 96 80 -13 0.371 -2.67 1.09 Intr + 52685 52803 119 1 2 80 53 58 0.589 0.76 1.10 Intr + 52878 53038 161 0 2 124 69 219 0.994 23.23 1.11 Intr + 53794 54005 212 2 2 126 80 412 0.999 42.83 1.12 Intr + 55248 55415 168 2 0 118 90 179 0.646 21.24 1.13 Intr + 57542 57658 117 1 0 67 91 141 0.993 12.96 1.14 Intr + 59164 59628 465 0 0 128 96 899 0.996 88.02 1.15 Intr + 62919 62984 66 2 0 113 82 7 0.736 1.70 1.16 Intr + 66693 66810 118 2 1 70 80 68 0.875 4.24 1.17 Intr + 69816 69944 129 1 0 79 92 120 0.950 12.17 1.18 Term + 71542 72365 824 2 2 108 35 823 0.612 72.26 1.19 PlyA + 72837 72842 6 1.05 2.03 PlyA - 73625 73620 6 -0.45 2.02 Term - 74798 74538 261 2 0 75 53 124 0.528 3.13 2.01 Init - 76155 76084 72 0 0 83 60 48 0.275 2.75 2.00 Prom - 96938 96899 40 -4.96 3.05 PlyA - 98920 98915 6 1.05 3.04 Term - 100114 99998 117 1 0 95 41 117 0.999 6.24 3.03 Intr - 101551 101420 132 1 0 80 93 101 0.998 10.64 3.02 Intr - 102722 102619 104 0 2 47 95 62 0.938 2.69 3.01 Init - 105523 105502 22 1 1 71 116 12 0.426 2.39 3.00 Prom - 127882 127843 40 -6.26 4.00 Prom + 129005 129044 40 -4.66 4.01 Init + 135650 135652 3 1 0 108 81 0 0.741 1.30 4.02 Intr + 141484 141642 159 0 0 49 41 134 0.199 4.98 4.03 Intr + 142056 142221 166 2 1 87 113 69 0.317 8.83 4.04 Intr + 159925 160044 120 2 0 50 115 55 0.355 4.87 4.05 Intr + 164857 164934 78 2 0 76 72 90 0.817 5.72 4.06 Intr + 167746 167823 78 2 0 49 97 62 0.757 2.72 4.07 Intr + 168103 168265 163 2 1 59 61 94 0.859 2.83 4.08 Intr + 169071 169173 103 0 1 54 68 89 0.987 3.68 4.09 Intr + 172650 172776 127 2 1 117 56 80 0.981 7.95 4.10 Intr + 185650 185705 56 2 2 102 94 -8 0.713 -0.10 4.11 Intr + 186775 186981 207 0 0 57 72 86 0.489 3.17 4.12 Intr + 190174 190294 121 0 1 44 106 17 0.455 -0.83 4.13 Intr + 190947 191105 159 1 0 98 97 117 0.976 13.56 4.14 Intr + 196807 196980 174 2 0 135 94 20 0.946 7.21 4.15 Intr + 198375 198485 111 1 0 88 86 30 0.849 3.15 4.16 Intr + 201761 201936 176 0 2 41 110 41 0.333 1.26 4.17 Intr + 211117 211221 105 0 0 84 84 77 0.381 7.31 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:134506953_134724019|GENSCAN_predicted_peptide_1|1069_aa MALAAPQFGMSIFRNSGNKCLLLWSLTAFEPHHLLGLPLCVSPVLDTYSADLSCDIYIRI KMLQTAQQHQVGLAPTERKEALRGSQSPQKSASGQRPGSLDRELLMLIKISKCLRVPSGP SSVLGTESTGPVPGELVLCQQDAGATQEGVCYGRELMICRCDSFPRLMEPWGVKGTPEVF RTDLITAMKIPDSYQLSPDDYYILADPWRQEWEKGVQVPAGAEAIPEPVVRVKTPSPPID PSQLTPSSQLTSAVLATAEALSHSFWQPCPRPQDASQSSESLQGGQMVEEEVKEGVEGGG PCVVSKHTCPSAPELDWILPPLEGPPAQASPSSTMLGEGSQPDWPGGSRYDLDEIDAYWL ELINSELKEMERPELDELTLERVLEELETLCHQNMARAIETQEGLGIEYDEDVVCDVCRS PEGEDGNEMVFCDKCNVCVHQACYGILKVPTGSWLCRTCALGVQPKCLLCPKRGGALKPT RSGTKWVHVSCALWIPEVSIGCPEKMEPITKISHIPASRWALSCSLCKECTGTCIQCSMP SCVTAFHVTCAFDHGLEMRTILADNDEVKFKSFCQEHSDGGPRNEPTSEPTEPSQAGEDL EKVTLRKQRLQQLEEDFYELVEPAEVAERLDLAEALVDFIYQYWKLKRKANANQPLLTPK TDEVDNLAQQEQDVLYRRLKLFTHLRQDLERGSPNPLVPLPHPSPVTTSSCSQVRNLCYM VTRRERTKHAICKLQEQIFHLQMKLIEQDLCRERSGRRAKGKKSDSKRKGCEGSKGSTEK KEKVKAGPDSVLGQLAGLSTSFPIDGTFFNSWLAQSVQITAENMAMSEWPLNNGHREDPA PGLLSEELLQDEETLLSFMRDPSLRPGDPARKARGRTRLPAKKKPPPPPPQDGPGSRTTP DKAPKKTWGQDAGSGKGGQGPPTRKPPRRTSSHLPSSPAAGDCPILATPESPPPLAPETP DEAASVAADSDVQVPGPAASPKPLGRLRPPRESKVTRRLPGARPDAGMGPPSAVAERPKV SLHFDTETDGYFSDGEMSDSDVEAEDGGVQRGPREAGAEEVVRMGVLAS >gi568815593r:134506953_134724019|GENSCAN_predicted_CDS_1|3210_bp atggctttagcagcacctcaatttgggatgtctattttccggaacagcgggaataagtgt ctgctgctctggtctctcactgccttcgagccccatcacctgctcgggctccccctctgt gtgtccccagttctggacacttattcagcagacctgtcatgcgacatctacatccgcatc aagatgctccaaactgcccagcagcaccaagtcgggctggccccgacagaacgaaaagaa gccctccgaggaagtcagagtccccagaagtcagccagtgggcagaggcctgggtctctg gaccgggagctgctgatgctgatcaaaatcagcaagtgcttaagagtacctagtgggcct agctcagtgctggggacagaaagcacaggccctgtccctggggagcttgtcctttgtcag caggatgcaggtgctacacaggaaggtgtctgttatgggagggagttgatgatctgtaga tgtgactccttcccgagactcatggagccctggggggtgaagggaaccccagaggttttc cggacagacttgatcacagccatgaagatcccggactcataccagctcagcccggatgac tactacatcctggcagacccatggcgacaggaatgggagaaaggtgtgcaggtgcctgcc ggggcagaggccatcccagagcccgtggtgagggtaaagaccccctcccctcccatagat cccagccagctgacgccctcctcccagctgacctccgccgtcctggctacagctgaggcg ctgagtcacagcttctggcagccttgcccccgaccccaggatgcctctcaatcatctgag tctctgcagggtgggcagatggtggaggaggaagtgaaggagggggtggaaggaggtggg ccctgtgtggtgtcaaagcacacttgtccatcagctcctgagctggactggatcctccca ccactggaaggcccccctgcccaggcatccccgagcagcaccatgcttggtgagggctcc cagcctgattggccagggggcagccgctatgacttggacgagattgatgcctactggctg gagctcatcaactcggagcttaaggagatggagaggccggagctggacgagctgacatta gagcgtgtgctggaggagctggagaccctgtgccaccagaatatggccagggccattgag acgcaggaggggctgggcatcgagtacgacgaggatgttgtctgcgacgtgtgtcgctct cctgagggcgaggatggcaacgagatggtcttctgtgacaagtgcaacgtctgtgtgcat caggcatgctacgggatcctcaaggtgcccacgggcagctggctgtgccggacgtgtgcc ctgggtgtccagccaaagtgcctgctctgccccaagcgaggaggagccttgaagcccact agaagtgggaccaagtgggtgcatgtcagctgtgccctatggattcctgaggtcagcatc ggctgcccagagaagatggagcccatcaccaagatctcgcatatcccagccagccgctgg gctctgtcctgcagcctctgcaaggaatgcacaggcacctgcatccagtgttccatgcct tcctgcgtcacagcgttccatgtcacatgcgcctttgaccacggcctggaaatgcggact atattagcagacaacgatgaggtcaagttcaagtcattctgccaggagcacagtgacggg ggcccacgtaatgagcccacatctgagcccacggaacccagccaggctggcgaggacctg gaaaaggtgaccctgcgcaagcagcggctgcagcagctagaggaggacttctacgagctg gtggagccggctgaggtggctgagcggctggacctggctgaggcactggtcgacttcatc taccagtactggaagctgaagaggaaagccaatgccaaccagccgctgctgacccccaag accgacgaggtggacaacctggcccagcaggagcaggacgtcctctaccgccgcctgaag ctcttcacccatctgcggcaggacctagagaggggctcccccaacccattagtgcccttg cctcacccaagcccagtcaccacctccagctgctcccaggttagaaatctgtgctacatg gtgacaaggcgcgagagaacgaaacacgccatctgcaaactccaggagcagatattccac ctgcagatgaaacttattgaacaggatctgtgtcgagagcggtctgggaggagagcaaag ggcaagaagagtgactcgaagaggaagggctgcgagggctccaagggcagcactgagaag aaagagaaagtgaaggcggggcctgactcagtcctggggcagctggcaggcctgtccacc tcattccccatcgatggcaccttcttcaacagctggctggcacagtcggtgcagatcaca gcagagaacatggccatgagcgagtggccactgaacaatgggcaccgcgaggaccctgct ccagggctgctgtcagaggaactgctgcaggacgaggagacactgctcagcttcatgcgg gacccctcgctgcgacctggtgaccctgctaggaaggcccgaggccgcacccgcctgcct gccaagaagaaaccaccaccaccaccaccgcaggacgggcctggttcacggacgactcca gacaaagcccccaagaagacctggggccaggatgcaggcagtggcaaggggggtcaaggg ccacctaccaggaagccaccacgtcggacatcttctcacttgccgtccagccctgcagcc ggggactgtcccatcctagccacccctgaaagccccccgccactggcccctgagaccccg gacgaggcagcctcagtagctgctgactcagatgtccaagtgcctggccctgcagcaagc cctaagcctttgggccggctccggccaccccgcgagagcaaggtaacccggagattgccg ggtgccaggcctgatgctgggatgggaccaccttcagctgtggctgagaggcccaaggtc agcctgcattttgacactgagactgatggctacttctctgatggggagatgagcgactca gatgtagaggccgaggacggtggggtgcagcggggtccccgggaggcaggggcagaggag gtggtccgcatgggcgtactggcctcctaa >gi568815593r:134506953_134724019|GENSCAN_predicted_peptide_2|110_aa MGLGTMQQRQHRIMQGTAGTGHTQQPASLTDIWVLQDTGSATVQSKATANEEGPESRGQQ CEQRLLEEHQPARGKGPDSLHPLPNSPPTPATREAQPLPQDGVKGQGAKC >gi568815593r:134506953_134724019|GENSCAN_predicted_CDS_2|333_bp atggggctgggaactatgcagcagcggcaacacagaatcatgcagggcacagcgggcact ggccacacccagcagccagccagcctaacagacatctgggttctgcaggacacaggctca gcaacagtccagtccaaagcaacagccaatgaagaagggccagaatccagaggccaacaa tgtgagcagaggctcttggaagaacaccagccagcccgaggcaaggggccagactcgttg caccctctccctaattctccacccacccctgccaccagggaggcccagcccctgccccag gatggtgtgaaggggcaaggggccaagtgctga >gi568815593r:134506953_134724019|GENSCAN_predicted_peptide_3|124_aa MGFRHVQARRVWKNYLPAINGIVFLVDCADHERLLESKEELDSLMTDETIANVPILILGN KIDRPEAISEERLREMFGLYGQTTGKGSISLKELNARPLEVFMCSVLKRQGYGEGFRWMA QYID >gi568815593r:134506953_134724019|GENSCAN_predicted_CDS_3|375_bp atgggatttcgccacgttcaagctcgaagagtgtggaaaaactaccttcctgctatcaat ggcattgtatttctggtggattgtgcagaccacgaaaggctgttagagtcaaaagaagaa cttgattcactaatgacagatgaaaccattgctaatgtgcctatactgattcttgggaat aagatcgacagacctgaagccatcagtgaagagaggttgcgagagatgtttggtttatat ggtcagacaacaggaaaggggagtatatctctgaaagaactgaatgcccgacccttagaa gttttcatgtgtagtgtgctcaaaagacaaggttacggagaaggcttccgctggatggca cagtacattgattaa >gi568815593r:134506953_134724019|GENSCAN_predicted_peptide_4|702_aa MVIFTKPEERNCQSHPTKAATESQIAPRSWLCERGHVHNGGNFRSGDTRHFRPAQWSFSS LLVRCCRPRPAPSNPVIMSQPGIPASGGAPASLQAQNGAALASGSPYTNGAPHGPPPAGG PPPVRALTPLTSSYRDVPQPLFNSAVNQEGITSNTNNGSMVVHSSYDEIEGGGLLEHNTT WCNWSTTLFLELPKWATSLYSGANHLTTSMSGLSLQPEGLRVVNLLQERNMLPSTPLKPP VPNLHEDIQKLNCNPELFRCTLTSIPQTQALLNKAKLPLGLLLHPFKDLVQLPVVTSSTI VRCRSCRTYINPFVSFLDQRRWKCNLCYRVNDDVFIPMPENLLVNLNESKELVQDLLKTL PQMFTKTLETQSALGPALQAAFKLMSPTGGRMSVFQTQLPTLGVGALKPREEPNHRSSAK DIHMTPSTDFYKKLALDCSGQQVAVDLFLLSGQYSDLASLGCISRYSAGSVYYYPSYHHQ HNPVQVQKLQKELQRYLTRKIGFEAVMRIRCTKGLSIHTFHGNFFVRSTDLLSLPNVNPD AGYAVQMSVEESLTDTQLVSFQSALLYTSSKGERRIRVHTLCLPVVSTLNDVFLGADVQA ISGLLANMAVDRSMTASLSDARDALVNAVIDSLSAYRSSVLSNQQPGLMVPFSLRLFPLF VLALLKQGALNISDRTIPQPPILQLSVEKLSRDGAFLMDAGS >gi568815593r:134506953_134724019|GENSCAN_predicted_CDS_4|2106_bp atggtaatcttcacaaaacccgaggagaggaactgtcagtcccaccccacgaaagcggct actgaatctcagattgccccacgatcctggctctgcgagcgaggtcacgttcacaacggc gggaatttccgtagcggtgacacacggcacttccggccggcccagtggtctttcagctct cttcttgtgcgctgttgtcgaccccgaccagccccttccaacccagtcatcatgtcccag ccgggaataccggcctccggcggcgccccagccagcctccaggcccagaacggagccgcc ttggcctcggggtctccctacaccaacggagctcctcatgggccccctccagctggaggc ccacccccagtgagggccctcacgcccctgacatcatcatatagagatgtaccccagccc ttatttaattcagctgtcaaccaagaaggtattacatcaaataccaataacggatctatg gtggtccacagtagttacgacgagattgaaggaggtggcttattggaacacaacaccacc tggtgcaactggagtaccaccctcttccttgaattacccaagtgggccacaagcctttac tcaggtgctaatcatttaaccacaagcatgagtggattaagtctacaaccagagggtcta agagttgtcaatcttcttcaagaaagaaacatgcttccgtcaacacctttgaagcctcca gttccaaatttgcatgaagacatccagaaactcaactgtaacccagagttatttcgatgc acgctgactagcattcctcagacgcaggccttattgaataaagccaaacttcctttgggg ctgctgcttcatcctttcaaagacttagtgcaattgcctgtggttacctccagtacaatt gtgagatgccgttcatgcaggacgtacatcaatcctttcgtcagctttcttgatcaaagg agatggaagtgtaacttatgttatcgagtcaatgatgatgtttttatacctatgccagag aacttattagtaaacttaaatgaaagtaaagagctcgtgcaagatttactgaaaactttg ccacaaatgtttaccaagactctggagacccagagtgccttgggtcctgcactgcaggct gcctttaagctgatgtctccaactggtggtcgaatgtctgtctttcaaacacaactccca actcttggagtgggagccctgaaaccacgagaggaaccaaaccacaggtcatctgctaag gatatacacatgacaccatccactgacttctataagaaattagccttggactgttctggt cagcaagttgctgttgacttattccttctcagtggacagtattctgatttggcttctctg ggttgtatttctcggtattcagcaggtagtgtctattactatccctcttaccatcatcag cacaacccagtccaagtacagaaattacagaaggaactacagagataccttactcggaag attggctttgaggcagtcatgaggattcggtgcaccaaaggtctttccattcatactttc catggaaacttctttgttaggtcaaccgacttactgtctttgcctaacgtcaacccagac gctgggtatgcagtacagatgtcagtggaagagagtcttactgacactcagttggtttct tttcagtcagcactcttgtatacatccagcaaaggcgaaagaagaattcgtgttcatact ttgtgtttgccagtagtttcgactctgaatgatgtctttcttggagctgatgttcaagca atttcagggttattggccaatatggctgttgacagatctatgactgccagtctgagtgac gctcgggatgctctagtgaatgcagtcattgactccctttcagcttaccgttcttcagtc ttaagtaaccagcagcctggactcatggttcctttttctttgcggcttttcccacttttt gtgttggctctccttaaacagggagcactcaacatcagtgatagaaccatacctcagccc cccattcttcagctttcagtggagaagctgagcagagatggagctttcctcatggatgca ggctct