GENSCAN 1.0 Date run: 3-Nov-116 Time: 07:38:25 Sequence gi568815589r:33988353_34226431 : 238079 bp : 45.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.14 Intr - 774 621 154 0 1 94 101 124 0.997 13.95 1.13 Intr - 7981 7871 111 1 0 69 98 91 0.997 8.78 1.12 Intr - 10512 10435 78 0 0 71 91 73 0.960 5.65 1.11 Intr - 28783 28698 86 2 2 15 111 41 0.001 -1.36 1.10 Intr - 60435 60318 118 0 1 136 36 84 0.074 8.04 1.09 Intr - 83008 82924 85 0 1 46 61 71 0.006 0.02 1.08 Intr - 95769 95739 31 1 1 70 45 41 0.002 -4.81 1.07 Intr - 100156 100048 109 1 1 93 15 153 0.858 8.36 1.06 Intr - 101238 101060 179 1 2 135 78 174 0.989 20.74 1.05 Intr - 105096 104934 163 0 1 95 83 83 0.995 8.05 1.04 Intr - 110165 109972 194 0 2 98 86 219 0.995 21.91 1.03 Intr - 118142 118082 61 2 1 76 63 51 0.973 -0.29 1.02 Intr - 119213 119007 207 2 0 92 95 251 0.762 25.47 1.01 Init - 121285 121178 108 1 0 78 -25 99 0.366 -2.18 1.00 Prom - 122814 122775 40 -9.16 2.00 Prom + 125294 125333 40 -7.46 2.01 Init + 126256 126314 59 0 2 72 121 42 0.953 6.40 2.02 Intr + 137296 137660 365 1 2 82 75 156 0.295 8.53 2.03 Intr + 138494 138612 119 0 2 29 53 129 0.226 3.68 2.04 Term + 139420 139569 150 0 0 62 40 69 0.369 -2.59 2.05 PlyA + 140740 140745 6 1.05 3.06 PlyA - 141162 141157 6 1.05 3.05 Term - 145551 144943 609 0 0 52 45 970 0.845 83.60 3.04 Intr - 146226 145613 614 1 2 71 2 845 0.790 66.30 3.03 Intr - 146461 146320 142 1 1 4 31 164 0.459 2.23 3.02 Intr - 163329 163148 182 1 2 36 64 121 0.431 4.09 3.01 Init - 170795 170786 10 2 1 79 89 -3 0.243 -0.47 3.00 Prom - 184127 184088 40 -4.06 4.00 Prom + 184152 184191 40 -5.36 4.01 Init + 190709 190772 64 1 1 99 26 125 0.344 6.75 4.02 Term + 221857 221939 83 2 2 91 48 95 0.459 3.66 4.03 PlyA + 222065 222070 6 1.05 5.02 PlyA - 222791 222786 6 1.05 5.01 Term - 236048 235871 178 1 1 6 41 221 0.418 6.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:33988353_34226431|GENSCAN_predicted_peptide_1|562_aa MVPKPIQRMFDAKNMMAAWDPPLWLLLSSCHVQGPYLFVVDVQTSQITKIPILKDREPGG VTQQGCGIHAIELNPSRTLLATGGDNPNSLAIYRLPTLDPVCVGDDGHKDWIFSIAWISD TMAVSGSRDGSMGLWEVTDDVLTKSDARHNVSRVPVYAHITHKALKDIPKEDTNPDNCKV RALAFNNKNKLLSTKLPYCRENVCLAYGSEWSVYAVGSQAHVSFLDPRQPSYNVKSVCSR ERGSGIRSVSFYEHIITVGTGQGSLLFYDIRAQRFLEERLSACYGSKPRLAGENLKLTTG KGWLNHDETWRNYFSDIDFFPNAVYTHCYDSSGTKLFVAGAPPKKNPMKEIIFGSKVIKG DNNSNSYGIQPGGLGLNGNGPERCSVCFLILGSARLKPHRLQTAPAPTREDRAELHSPVS SDHCRGAREKPQISAAQSTQPQKQVVQATAEQMRLAQVIFDKNDSDFEAKVKQLMEVTGK NQDECIVALHDCNGDVNKAINILLEGNSDTTSWETVGCKKKNFAKENSENKENREKKSEK ESSRGRGNNNRKGRGGNRGREX >gi568815589r:33988353_34226431|GENSCAN_predicted_CDS_1|1686_bp atggtgcccaagcccatccagcggatgtttgatgccaagaatatgatggctgcgtgggac ccccctctgtggctgttgcttagcagctgccatgttcaggggccgtacctatttgtcgta gatgtccagacaagccagatcaccaagatccccattctgaaagaccgggagcctggaggt gtgacccagcagggctgtggtatccatgccatcgagctgaatccttctagaacactgcta gccactggaggagacaaccccaacagtcttgccatctatcgactacctacgctggatcct gtgtgtgtaggagatgatggacacaaggactggatcttttccatcgcatggatcagcgac actatggcagtgtctggctcacgtgatggttctatgggactctgggaggtgacagatgat gttttgaccaaaagtgatgcgagacacaatgtgtcacgggtccctgtgtatgcacacatc actcacaaggccttaaaggacatccccaaagaagacacaaaccctgacaactgcaaggtt cgggctctggccttcaacaacaagaacaagctcctctccaccaaactgccatattgccgt gagaatgtgtgtctggcttatggtagtgaatggtcagtttatgcagtgggctcccaagct catgtctccttcttggatccacggcagccatcatacaacgtcaagtctgtctgttccagg gagcgaggcagtggaatccggtcagtgagtttctacgagcacatcatcactgtgggaaca gggcagggctccctgctgttctatgacatccgagctcagagatttctggaagagaggctc tcagcttgttatgggtccaagcccagactagcaggggagaatctgaaactaaccactggc aaaggctggctgaatcatgatgaaacctggaggaattacttttcagacattgacttcttc cccaatgctgtttacacccactgctacgactcgtctggaacgaaactctttgtggcagga gctcctccaaagaagaatccaatgaaagaaataatctttggttctaaggtgataaaaggt gataataattctaactcctatggaattcaaccaggaggattgggcctgaatggaaacggc cctgagcgctgcagcgtgtgctttctaattctgggttcagctcgtctgaaaccccaccgc ctgcagactgccccggcgcccacccgtgaggaccgggccgagcttcactcgccagtgagc agtgaccattgtcgaggtgctcgggaaaaaccacagatttcagcagcacaatcaacgcaa ccacagaaacaagtggtacaggcaacagctgaacagatgcgtctcgctcaagtgatcttt gataagaatgattcagattttgaagctaaagttaagcagcttatggaagtgacagggaaa aatcaggatgaatgcatagtggccctacatgattgtaatggagatgtgaacaaagctatc aatatattgctggaagggaattcagacacaacttcatgggagactgtagggtgtaagaaa aagaattttgcaaaagaaaattcagaaaacaaagagaatagagagaagaaaagcgagaaa gaatcgagtcgtggacgtggaaacaacaaccggaaaggaagaggcggcaatcgtggcaga gaatnn >gi568815589r:33988353_34226431|GENSCAN_predicted_peptide_2|230_aa MRLLHINLLGSLPETSSAASLYSHWSTEDTVQGARPGSQVFQGCIINDRSSVIMPALQAV AEFEKSTANGVVGPREIRGHIYVQEGLPLQRGRHRDDTWELLFLLGSKRRKAQLRVGYTY RETTSFFQEGRNHPSFLGPMEGHFQRLFGYCWMEVDELGPRESGFWPPTSDPSPPIRRHP EASGLRPHTSKELCWTISSSVNKEIWICFPSRIKDQIEAGSPAIPSRHYH >gi568815589r:33988353_34226431|GENSCAN_predicted_CDS_2|693_bp atgaggctgctccacattaacctactgggatccctgccagaaactagcagcgcggcaagt ctgtactcccactggagcacggaagatacagtccaaggagccagacctggctcccaggtc ttccagggatgcatcatcaatgaccgttcctctgtcatcatgcccgcgctccaggcagtt gcagaattcgaaaaatcaaccgccaacggggtagtaggcccccgggaaatcagaggccat atctatgtccaagaaggccttcccctccaaagggggcggcatagagatgatacctgggaa ctgttgtttcttttggggtccaaacgccgaaaggcccagctcagggttgggtatacatat agggagactacttccttcttccaggagggacgcaatcatcccagcttcctagggcccatg gaaggccattttcaacggttgtttggctactgctggatggaagtggacgagctggggcca agggaatctggcttctggccgccgacctccgaccccagcccgcctatccgcaggcacccg gaagcttcaggcctcaggcctcatacctccaaggagctctgttggaccatctcaagttct gtcaacaaggagatttggatttgttttccaagccgaatcaaagaccagattgaagcaggg tctcctgccatcccaagtcggcactaccactaa >gi568815589r:33988353_34226431|GENSCAN_predicted_peptide_3|518_aa MDSGPFTSDCCCYWQGQELSSVLITEKSKASLNSWTLCPNGSSQLRYMTYSGKEGALWMS ADKEARRTRASSVAAAWQQRGSGGGGGGRSSGCFSRVAGSPSSVVDYLISGGLIDFIAEV DLTSALTRKITLKTPLISSPMDTVTEADVAIAMALMGDTGFIHHNCTPELQAKELRKVKK FEQGFITDPTVLSPSHTVGDVLEAKMRHGFSGIPITETGTMGSKLVGIVTSRDINFLAEK DHTTLLSEVMTPRLELVVAPACVTLKEANEILQRSKKGKLPIINDRNELVAIITGSDLKK NQDYPLASKDSQQAAAVGVDIIVLDSSQGNLVYQIAMVHYIKQKYPHLQVIGRNVVTAAQ AKNLIDAGVDGLCVGMGCGSICVTREVMACGQPQGTAVYKEAKYARRFGVPIIANGSIQT MGHIVKALALGVSTVMMGSLLAITMEAPGEYFSDEVRLKKYWGMGSLDAMEKSSSSQKRY FSKGDKVKIVQGVLGSIQDKGSIQKFVPYLIVGIQHGC >gi568815589r:33988353_34226431|GENSCAN_predicted_CDS_3|1557_bp atggatagtggcccctttacctctgactgctgctgctactggcaaggccaagagctgagt tctgtgctgatcacagagaagagcaaggcctcactgaattcatggactttatgtcccaac gggagctctcaactgaggtacatgacttactccggcaaagaaggggcactctggatgtct gcagataaggaggcccgccggacccgcgctagcagcgtggcagcagcgtggcagcagcgc ggcagcggcggcggcggcggcgggcggtccagcgggtgtttctctcgggtcgcagggtct cccagcagcgtggtggactacctgatcagcggtggactcatagacttcatagctgaggtg gaccttacctcagccctgacccggaagatcacgctgaagacaccgctgatctcctccccc atggacactgtgacagaggctgacgtggccatcgcgatggctctgatgggagatactggt ttcattcaccacaactgcaccccagagctccaggccaaggagctacggaaggtcaagaag tttgaacagggcttcatcacggaccccacggtgctgagcccctcccacactgtaggtgat gtgctggaggccaagatgcggcatggcttctctggcatccccatcactgagacgggcacc atgggcagcaagctggtgggcatcgtcacctcccgagacatcaactttcttgctgagaag gaccacaccaccctcctcagtgaggtgatgacgccaaggctcgagctggtggtagctcca gcatgtgtgacgttgaaagaggcaaatgagatcctgcagcgtagcaagaaagggaagctg cctatcatcaatgatcgcaatgagctggtggccattatcaccggcagcgacctgaagaag aaccaagactaccctctggcctccaaggattcccaacaagcagctgctgtgggcgtcgac atcatagtcttggactcgtcccaagggaacttggtgtatcagatcgccatggtgcattac atcaaacagaagtacccccacctccaggtgattggcaggaacgtggtgacagcagcccag gccaagaacctgattgacgctggtgtggacgggctgtgtgtgggcatgggctgcggctcc atctgcgtcacccgggaagtgatggcttgtggtcagccccagggcactgctgtgtacaag gaggccaagtatgcccggcgctttggtgtgcccatcatagccaatggcagcatccagacc atggggcacatagtcaaggccctggcccttggagtctccacagtgatgatgggctccctg ctggccatcaccatggaggcccctggtgagtacttctcagacgaggtgcggctcaagaag tactggggcatgggctcactggatgccatggagaagagcagcagcagccagaaacgatac ttcagcaagggggataaggtgaagatcgtgcagggtgtcttgggctccatccaggacaaa gggtccattcagaagttcgtgccctacctcatagtgggcatccagcacggctgctag >gi568815589r:33988353_34226431|GENSCAN_predicted_peptide_4|48_aa MAAAALAVATVTAWPGAGRVGAKYADVQRTVIRGLSFANTELVDSDRS >gi568815589r:33988353_34226431|GENSCAN_predicted_CDS_4|147_bp atggcggctgcggcactggcggtggctacggtgacggcctggcccggagcgggcagagtt ggagccaagtacgccgatgtgcagcggacagtaataagggggctaagttttgcaaacaca gagcttgtggacagtgatagatcctga >gi568815589r:33988353_34226431|GENSCAN_predicted_peptide_5|59_aa XIIDVVYNAFNNELVHNKTLVKNCFMLIDSTPYSSTAHRMDSGTSPTMRYPWATRKETS >gi568815589r:33988353_34226431|GENSCAN_predicted_CDS_5|180_bp nngatcatcgatgttgtctacaatgcgtttaataacgagctggtccataacaagaccctg gtgaagaattgcttcatgctcattgacagcacaccatactcatcgacagcacaccgtatg gacagtggtacgagtcccactatgcgctacccctgggccacaagaaaggagacaagctga