GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:08:03 Sequence gi568815593r:172668765_172870952 : 202188 bp : 49.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1021 1566 546 2 0 121 69 1251 0.976 119.35 1.02 Intr + 11579 11676 98 0 2 77 80 57 0.008 3.53 1.03 Intr + 14657 15374 718 1 1 16 109 1464 0.001 132.31 1.04 Intr + 17407 17532 126 2 0 88 82 152 0.998 15.25 1.05 Term + 17917 18161 245 2 2 97 38 411 0.997 33.06 1.06 PlyA + 20770 20775 6 -0.45 2.00 Prom + 22615 22654 40 -6.96 2.01 Init + 27182 27232 51 1 0 90 58 45 0.736 2.86 2.02 Intr + 28818 29176 359 2 2 44 40 250 0.121 9.85 2.03 Intr + 37202 37385 184 2 1 50 92 115 0.830 7.89 2.04 Intr + 42851 42991 141 1 0 54 94 65 0.891 4.25 2.05 Intr + 47323 47474 152 0 2 94 100 24 0.500 3.16 2.06 Intr + 50418 50455 38 0 2 108 99 19 0.180 2.91 2.07 Intr + 56505 56705 201 1 0 13 72 147 0.097 4.86 2.08 Intr + 58243 58355 113 2 2 36 89 137 0.953 8.70 2.09 Intr + 59897 60047 151 1 1 21 97 84 0.738 2.34 2.10 Intr + 64077 64235 159 1 0 90 56 77 0.906 4.66 2.11 Intr + 72100 72236 137 2 2 83 24 98 0.125 3.19 2.12 Term + 79816 80121 306 0 0 33 48 212 0.344 6.82 2.13 PlyA + 81991 81996 6 1.05 3.03 PlyA - 83075 83070 6 1.05 3.02 Term - 93935 93750 186 2 0 98 45 227 0.869 16.89 3.01 Init - 94494 94048 447 0 0 62 -33 488 0.539 28.07 3.00 Prom - 96092 96053 40 -3.16 4.09 PlyA - 96678 96673 6 1.05 4.08 Term - 100368 99998 371 1 2 86 48 321 0.939 22.71 4.07 Intr - 101030 100811 220 2 1 57 67 384 0.998 31.07 4.06 Intr - 101542 101397 146 2 2 38 92 72 0.994 2.60 4.05 Intr - 102469 101822 648 2 0 77 107 966 0.366 89.32 4.04 Intr - 104036 103977 60 0 0 85 70 68 0.344 3.41 4.03 Intr - 105149 105038 112 2 1 81 60 36 0.059 0.05 4.02 Intr - 120661 120521 141 1 0 48 37 130 0.108 4.45 4.01 Init - 125151 125047 105 0 0 17 42 139 0.088 2.42 4.00 Prom - 127209 127170 40 -4.86 5.06 PlyA - 128154 128149 6 1.05 5.05 Term - 149271 149127 145 2 1 95 45 76 0.007 1.38 5.04 Intr - 175418 175242 177 1 0 107 85 74 0.360 8.03 5.03 Intr - 181035 180892 144 2 0 104 83 58 0.214 6.30 5.02 Intr - 183988 183953 36 0 0 84 127 10 0.068 1.88 5.01 Init - 190092 189965 128 0 2 92 95 69 0.117 5.68 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 14655 15374 720 1 0 106 109 1465 0.821 141.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:172668765_172870952|GENSCAN_predicted_peptide_1|577_aa XPSPPARLLATRPCCGPGPERRPVLGEAPRFHAQAKGKNVRLDGHSRRATRRNSFCNGVT FTQRPIRLYEQVRLRLVAVRPGWSGALRFGFTAHDPSLMSAQDIPKYACPDLVTRPGYWA KALPENLALRDTVLAYWADRHGRVFYSVNDGEPVLFHCGVAVGGPLWALIDVYGITDEVQ LLGHALNKDAALMGSCVEPLGDKRRRVGGGEGAGQSAFADTLTPARLSQARFSACLPPSS HDAANFDNNELENNQVVAKLGHLALGRAPGPPPADAAAAAIPCGPRERPRPASSPALLEA DLRFHATRGPDVSLSADRKVACAPRPDGGRTLVFSERPLRPGESLFVEVGRPGLAAPGAL AFGITSCDPGVLRPNELPADPDALLDRKEYWVVARAGPVPSGGDALSFTLRPGGDVLLGI NGRPRGRLLCVDTTQALWAFFAVRGGVAGQLRLLGTLQSSPATTTPSGSLSGSQDDSDSD MTFSVNQSSSASESSLVTAPSSPLSPPVSPVFSPPEPAGIKNGECTVCFDGEVDTVIYTC GHMCLCHSCGLRLKRQARACCPICRRPIKDVIKIYRP >gi568815593r:172668765_172870952|GENSCAN_predicted_CDS_1|1734_bp nacccgagcccaccggcgcgcctcctggccacccggccgtgctgcggccccggccccgag cgacgcccggtcctgggcgaggcgccgcgcttccacgcgcaggccaaaggcaagaacgtg cggctggacggccactcgcgccgggccacacggcgcaacagcttctgcaatggcgtcacg ttcacgcagcggcccatccggctgtacgagcaggtgcggctgcgcctggtggccgtgcgc cctggctggagcggcgcgctgcgcttcggcttcaccgcgcacgatccgtcgctcatgagc gcccaggacatccccaagtacgcctgcccggacctggtcacgcggccgggctactgggcc aaggcactgcccgagaacctggcgctgcgcgacacggtgctggcctactgggccgaccgc cacggccgcgtgttctacagcgtgaacgacggcgagccggtgctcttccactgcggcgtg gccgtgggcggcccgctctgggcgctcattgatgtctacggcatcaccgacgaggtgcag cttctgggccatgctcttaataaggatgctgcactgatgggctcctgtgtggagccgctg ggggacaagaggaggagagttggcgggggggaaggagcagggcaaagcgccttcgctgac acgctgacgcccgcgcgcctcagccaggcccgcttcagcgcctgcctgccgcccagcagc cacgacgcggccaacttcgacaacaacgagctcgagaacaaccaggtggtggccaagctg ggccacctggcgctgggccgcgccccgggcccaccgccagccgacgccgcggccgccgcc attccgtgcgggccccgtgagcgcccgcggcccgcgtcgtcgccggcgctactggaggcc gacctgcgcttccacgcaacacgcgggcccgacgtgagcctgtcggccgaccgcaaagtg gcctgcgcaccgcggcccgacggcggccgcacgctggtcttctccgagcgcccgctgcgg cccggcgagagcctcttcgtggaggtgggccgtccggggctggcggcgcccggcgcgctg gccttcggcatcacgtcgtgcgacccgggcgtgctacggcccaacgagctgcccgccgac ccagacgcgctgctcgaccgcaaagagtactgggtggtggcgcgcgccgggcccgtgccg agcggcggcgacgcgctcagcttcacgctgcggcccggcggcgacgtgctcctgggcatc aacgggcgtccgcgcggccgcctgctgtgcgtcgacaccacgcaggcgctctgggccttc ttcgccgtgcgcggcggcgtcgcgggccagctgcgtctcctcggtaccctgcagtccagc cctgcgaccacgactccatcagggtccctcagcggctcccaggacgatagtgattcagat atgaccttcagtgtcaaccagtcctcctcggcatctgagtcatccctggtgacggccccc agctccccgctgagccccccggtgtcccccgtgttctccccaccggagccggcaggcatc aagaatggcgagtgcacggtgtgcttcgatggcgaggtggacacggtcatctacacgtgt ggacacatgtgcctgtgccacagctgcggcctgcggctcaagcgacaggcccgggcctgc tgccccatctgccggcggcccatcaaggacgtcattaagatctacaggccatag >gi568815593r:172668765_172870952|GENSCAN_predicted_peptide_2|663_aa MGMPPSEPPPRGPYVGPGVKLKVLKFREVARQAPRQKARGLDGFFLEDGALSLSSGHGEK WQRPEQKDIGQGPGAGQIYTPYHPPERILQKPQRPGSQTPANGLVDLGNLEEARQGDIES GSGVKSAFGGGLELRVCMCADEDHLKKKALVMWKGEKTKQARLPRKQEEIQGPKAVVAVV GRRTFAAVWQAERRCRKEPLDSSSKPPPRNCKPTLNPSTSTHQSITQSVNQSNNNTLVLI SRSLQAPKLERVDIGHGDLCECLPSLLQSTPCITQSTRPGPGVTLSSAQHCQEGRRKEVR IKDGSSDPDDFFISIAPQRGLATVMCTNAEQIHNTVTYNWGSVNQPSDGPCHQSIEPASS QQRRGWQWKPDPKGKVPSDALQQVLAELVLEELENSPFLEACSLQSPPHWSSSSIGNELL GAVAGSKENRPMTPAICQVRGNGSEQGNKLEGERRNWTPEISEGGTQHYQENHCSLRTYY DARPIVRDLLDIHDGCDRHWNAHKDTAPSRAFSSSAVPTLPGCCQLPNARAATWGPGDTR AHTVVMCLDGECVRCQRYQARRPEGGRWEREEAPALPAPSLTRGLRKEQRVSWGQTHSIH APAGSTHVRTSAPSAQAHWRSGPDLLRERETWMEKSDEQIPLDIPNPPFREPCEDARISP FYG >gi568815593r:172668765_172870952|GENSCAN_predicted_CDS_2|1992_bp atggggatgcccccttctgagcctccccctcgtggcccctacgtgggccctggagtgaaa ctgaaagtcctgaagttcagggaggtggctcgccaggctcctcggcagaaggcgcgggga ttggacggtttctttctagaagatggagcactgtcgctctctagtggccacggtgagaag tggcaaagacctgagcagaaagacattggtcaaggtccaggtgctggacagatttacacc ccctaccatccccccgagcgcatcctgcagaagccgcaacgaccgggaagccagactcct gcaaatggccttgtggatctgggtaaccttgaggaggcacgtcagggtgacattgagtct gggtctggggtcaagtctgcgtttgggggtggactggagctgagggtgtgcatgtgtgct gatgaggatcacctaaagaagaaggcactggtgatgtggaagggagagaaaaccaaacaa gcaaggcttccaagaaagcaggaggagatccagggcccaaaagctgtggtggctgtggtg gggagaaggacatttgctgctgtatggcaggcagagagaagatgcaggaaggagccactg gacagctcctccaagccaccacctaggaactgcaaacccaccctcaatcccagtacctcc acccatcaatcaattactcaatctgtcaatcaatcaaataacaacaccctggtcctcatc tcacgttctctccaggcccccaaacttgagagggttgacataggccatggagacctgtgc gaatgtctgccttccctactccagtcgaccccctgcatcacccagagcaccaggccaggg cctggagtcacgctctcctctgcccagcactgccaggaaggcagaaggaaggaggtcaga ataaaagatggcagcagtgacccagatgactttttcatctccatcgccccgcaaaggggg ttggccacagtgatgtgcaccaatgctgagcaaattcacaacaccgtcacttacaactgg ggatcagttaatcagccatcagatgggccttgtcaccagtcaatagagccagcgtcaagt caacagaggcgaggctggcaatggaagccagacccaaagggcaaagtgcccagcgacgcc ctgcagcaggtgttggccgagcttgtgttggaggagctggagaacagccccttcctggaa gcctgcagcctacagagcccacctcactggtcctcctcgagcattgggaatgagctattg ggagctgtggcggggagcaaagaaaacaggccgatgacccctgcaatatgccaggtgaga ggcaatggctctgagcagggaaataaactggaaggagaaagacgcaactggactccggaa atatctgaaggtgggacgcaacactatcaggaaaaccactgctccctgaggacctactat gatgccaggcctattgtcagagacctgctagacatccacgacggatgtgataggcattgg aatgcacacaaagatactgccccttccagagccttctctagctctgctgtgcccaccctg ccaggctgctgccagctcccaaatgccagagctgccacctggggccctggagacaccagg gcccacactgttgtgatgtgtttggatggggagtgcgttcggtgccagcgttaccaggct cgaaggcccgaagggggccgctgggaacgcgaggaggccccggcccttcctgctccgtcg ctgacgcgcggcctgcggaaggaacagcgcgtctcttggggccaaacgcattccattcat gccccggcgggcagcacgcacgtccgcacctcggctcccagcgcacaggctcactggcga tccgggcccgacttgcttagggagagagagacttggatggagaagtcagatgagcagatc ccgcttgatatcccgaaccctcccttccgggagccctgcgaggatgcgcgcatcagccca ttctacggctaa >gi568815593r:172668765_172870952|GENSCAN_predicted_peptide_3|210_aa MEAPEGGGGGPAARGPEGQPAPEARVHFRVARFIMEAGVKLGMRSIPIATACTIYHKFFC NTNLDAYDPYLIAVSSIYLAGKAEKQHLRTHDIINVSNRYFNPSGEPLELESRFWELRDS IVQCELLMLRVLRFQVSFQLPHKYLLHYLAQHIAVAVLYLALQVYGVEVPAEVKAEKPWW QVFGEDLTKPIIDNIVSDLIQIYTMDTEIP >gi568815593r:172668765_172870952|GENSCAN_predicted_CDS_3|633_bp atggaagccccggagggcggcggaggggggcctgcagcgcggggccccgaggggcaaccg gcgcccgaagccagggtgcacttccgagtggcgaggttcatcatggaggcaggtgtcaag ctagggatgaggtccattcccattgccactgcttgcaccatttaccataagttcttttgc aataccaacctggacgcttatgacccttacctgattgccgtgtcttccatttacttggcc ggcaaagcggaaaagcagcacctgcggactcatgacatcatcaatgtgtccaacaggtac tttaacccgagcggtgagcccctggaattggaatcccgcttctgggagctccgggacagc attgtgcagtgtgagcttctcatgctgagagttctgcgcttccaggtctccttccagctt ccacacaagtacctgctccactacctggcccagcacatcgcggtggcggtgctctacctg gccctgcaggtctacggagttgaggtgcccgccgaggttaaggctgagaagccgtggtgg caggtgtttggtgaagaccttaccaagccaatcattgataatattgtgtctgatctcatt cagatttataccatggacacagagatcccctaa >gi568815593r:172668765_172870952|GENSCAN_predicted_peptide_4|600_aa MIPCLDKPFDLNFLICKMKLLIPSQEIVTGHKEDNQPKDPELALTECHVAFTLDNTALGA FSYRCLEFSLKHPEAGPGGYVSRPDYAGLSLAPSSVRSSISTAWIGGLGGYTFTTCRRAG RALPLSEDLFAVLDQCSNEEAAYKRAPRARLAAKDIWAVCATRVGGAVGGTAKKPRSPEP RVTLLSQSKSGFWFGAERPGGLAFPRKAPPCPWPREQTKSTAGPITLGALRPAMVMEVGT LDAGGLRALLGERAAQCLLLDCRSFFAFNAGHIAGSVNVRFSTIVRRRAKGAMGLEHIVP NAELRGRLLAGAYHAVVLLDERSAALDGAKRDGTLALAAGALCREARAAQVFFLKGGYEA FSASCPELCSKQSTPMGLSLPLSTSVPDSAESGCSSCSTPLYDQGGPVEILPFLYLGSAY HASRKDMLDALGITALINVSANCPNHFEGHYQYKSIPVEDNHKADISSWFNEAIDFIDSI KNAGGRVFVHCQAGISRSATICLAYLMRTNRVKLDEAFEFVKQRRSIISPNFSFMGQLLQ FESQVLAPHCSAEAGSPAMAVLDRGTSTTTVFNFPVSIPVHSTNSALSYLQSPITTSPSC >gi568815593r:172668765_172870952|GENSCAN_predicted_CDS_4|1803_bp atgattccctgcttggataaaccattcgacctcaatttcctcatctgcaaaatgaagctc ctcatcccgagccaggaaattgtcacaggccacaaagaagataaccagcccaaggaccct gagctggccctcacagagtgtcatgtggcttttaccctagataacacagcccttggagcg ttctcctaccgctgcctggagttttctctgaaacacccagaggcggggcctggtggctat gtcagtagaccagattatgctgggctctctttggctccatcctcagtcagaagcagcatc tccactgcctggattgggggcctaggtggctacacgtttaccacttgtcgccgggcaggg agggcgttgcccttatctgaggacctctttgctgtcctcgaccaatgttcaaatgaagag gccgcatataaacgcgctccccgggccaggctcgctgcgaaggacatttgggctgtgtgt gcgacgcgggtcggaggggcagtcgggggaaccgcgaagaagccgaggagcccggagccc cgcgtgacgctcctctctcagtccaaaagcggcttttggttcggcgcagagagacccggg ggtctagcttttcctcgaaaagcgccgccctgcccttggccccgagaacagacaaagagc accgcagggccgatcacgctgggggcgctgaggccggccatggtcatggaagtgggcacc ctggacgctggaggcctgcgggcgctgctgggggagcgagcggcgcaatgcctgctgctg gactgccgctccttcttcgctttcaacgccggccacatcgccggctctgtcaacgtgcgc ttcagcaccatcgtgcggcgccgggccaagggcgccatgggcctggagcacatcgtgccc aacgccgagctccgcggccgcctgctggccggcgcctaccacgccgtggtgttgctggac gagcgcagcgccgccctggacggcgccaagcgcgacggcaccctggccctggcggccggc gcgctctgccgcgaggcgcgcgccgcgcaagtcttcttcctcaaaggaggatacgaagcg ttttcggcttcctgcccggagctgtgcagcaaacagtcgacccccatggggctcagcctt cccctgagtactagcgtccctgacagcgcggaatctgggtgcagttcctgcagtacccca ctctacgatcagggtggcccggtggaaatcctgccctttctgtacctgggcagtgcgtat cacgcttcccgcaaggacatgctggatgccttgggcatcactgccttgatcaacgtctca gccaattgtcccaaccattttgagggtcactaccagtacaagagcatccctgtggaggac aaccacaaggcagacatcagctcctggttcaacgaggccattgacttcatagactccatc aagaatgctggaggaagggtgtttgtccactgccaggcaggcatttcccggtcagccacc atctgccttgcttaccttatgaggactaatcgagtcaagctggacgaggcctttgagttt gtgaagcagaggcgaagcatcatctctcccaacttcagcttcatgggccagctgctgcag tttgagtcccaggtgctggctccgcactgttcggcagaggctgggagccccgccatggct gtgctcgaccgaggcacctccaccaccaccgtgttcaacttccccgtctccatccctgtc cactccacgaacagtgcgctgagctaccttcagagccccattacgacctctcccagctgc tga >gi568815593r:172668765_172870952|GENSCAN_predicted_peptide_5|209_aa MDFGPKHLAGRGLGILGRALTSSQPCFLSVDNSGPPPMPAGGWRKVPLPSPLMTRNKLSA DSVGEVLCYAPGQRGDPQLLSLGVCEDLTFTKTHMTPAINWKTLVFDVYGCVGTTKGEGS NSGSDHLRCSHVIVSVPCKPRRPWFYHVHLPEDPLLPSITARASKPWAHGSAKTIFNSLA RDQLTPSGKIFYEYLLSDYDGPEITLTLN >gi568815593r:172668765_172870952|GENSCAN_predicted_CDS_5|630_bp atggactttggccccaaacatctggctggcaggggccttggaatcttgggcagggccctc acaagctcccagccctgttttctttctgtggacaacagcggacccccacccatgccagcg gggggctggaggaaggtaccattaccatccccgctgatgacacgaaacaagttgagtgca gacagcgtgggtgaggtcttgtgctatgcaccaggacagagaggagacccacagctgctg tcattgggggtatgtgaggacctaacattcaccaagacacatatgactcctgccatcaac tggaaaacactggtgtttgacgtctatggctgtgtgggtaccacaaagggagaggggtct aactctgggtctgaccatctcaggtgttcccatgtgattgtatctgtgccctgcaaaccc cggaggccctggttttaccacgtccacctaccagaagacccactactgccatctatcacg gctcgggcgtcaaagccttgggctcatggttctgccaagacaatcttcaattcacttgca agagatcaactgaccccttcagggaagatcttctacgaatatttattgagtgactatgac gggccagaaataactctgaccttgaactga