GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:36:16 Sequence gi568815595r:156613887_156814928 : 201042 bp : 39.82% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3373 3518 146 0 2 81 56 131 0.653 8.84 1.02 Intr + 4772 4860 89 2 2 107 68 73 0.900 5.90 1.03 Intr + 8262 9161 900 1 0 74 -32 579 0.514 34.83 1.04 Intr + 9357 9603 247 1 1 19 52 270 0.924 12.10 1.05 Intr + 11178 11450 273 0 0 74 31 205 0.387 9.13 1.06 Term + 18931 19033 103 1 1 80 52 94 0.267 1.77 1.07 PlyA + 19038 19043 6 1.05 2.00 Prom + 21745 21784 40 -1.55 2.01 Init + 47464 47569 106 0 1 62 57 96 0.181 4.33 2.02 Intr + 54945 55009 65 1 2 101 87 50 0.234 3.72 2.03 Intr + 55311 55362 52 2 1 81 106 0 0.173 -1.34 2.04 Intr + 58249 58438 190 2 1 68 65 73 0.164 0.72 2.05 Intr + 60650 60910 261 2 0 73 103 261 0.998 21.78 2.06 Intr + 61288 61505 218 1 2 -3 83 181 0.642 5.72 2.07 Intr + 63771 64728 958 1 1 134 71 433 0.815 35.40 2.08 Intr + 80134 80302 169 1 1 86 53 90 0.649 4.33 2.09 Intr + 81979 82139 161 0 2 97 98 83 0.972 8.06 2.10 Intr + 88113 88232 120 0 0 7 19 167 0.544 0.39 2.11 Intr + 89538 89816 279 0 0 116 113 141 0.999 15.07 2.12 Term + 90798 91245 448 0 1 77 48 283 0.993 16.90 2.13 PlyA + 92842 92847 6 1.05 3.10 PlyA - 93098 93093 6 1.05 3.09 Term - 100407 99998 410 1 2 9 38 330 0.711 14.49 3.08 Intr - 101016 100451 566 2 2 50 -7 304 0.438 8.61 3.07 Intr - 104286 104246 41 0 2 106 92 59 0.628 4.20 3.06 Intr - 105705 105577 129 0 0 59 73 86 0.755 4.17 3.05 Intr - 127063 126885 179 2 2 47 68 126 0.135 5.22 3.04 Intr - 127714 127550 165 2 0 43 49 173 0.548 7.91 3.03 Intr - 128068 127741 328 1 1 101 68 148 0.611 8.45 3.02 Intr - 137297 137209 89 0 2 63 101 56 0.013 3.17 3.01 Init - 157614 157521 94 0 1 55 62 82 0.039 2.89 3.00 Prom - 183857 183818 40 -4.15 4.06 PlyA - 184942 184937 6 1.05 4.05 Term - 185151 184999 153 0 0 59 42 130 0.353 2.44 4.04 Intr - 192844 192746 99 1 0 101 3 143 0.525 6.49 4.03 Intr - 193309 193241 69 1 0 58 103 39 0.628 0.86 4.02 Intr - 196452 195482 971 1 2 -5 -18 931 0.319 62.98 4.01 Init - 196846 196756 91 1 1 67 32 146 0.582 7.60 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:156613887_156814928|GENSCAN_predicted_peptide_1|585_aa MGGKERKGMKEQVAELYREVIADLTDKETLEQSFEGGNESAKQTFERRKIPPTIYSEAPE VQCNKKCFIGVAKWEPPAGGHTEITETIKNLEEVQIVCGTHSPYNSPVWPVRKPNGTCWM MVDYWELNKVTPPLHAAVLSIMDLMDCLMTELGQYHYEVDLANAFFSIDIAPETQEQFAF MWDRQQWTFTVLLQGYVHSPTICPGLVATDLAAWQCPEGVHLFHYIDDIMLTSDSLADLE VVAPLLQQHVAAYGWAINESKVQGPGLAAKFLGVIWSGKTKAIPEAMIDKIQAYPWPTTV KKLQIFVGLLGYQRAFVSFLAQMIKLLHWLTKKGAPWDWDDAAETAFLAAKLAIQQAQAL WVVDLGHPFELDVHVTTDVRWVHSWVTTPQTGTVQTSTLAKWGAYVEQQGTLSTNPLAAE LQEVLGPVVLMQDKAMGPEAALDPEPSPFKEGHPSIPDGAWLTCSNCIVDWAHTYAEVTN VSNSLICTTLPAAAADCLPWHIHPVSSVNWTWLETWGPMADASNAMQQALDKGCHKAHGA PTPWLAHSVYNGSSSPSTKSLSHGELKPSPEADAGTMLLVQPAEP >gi568815595r:156613887_156814928|GENSCAN_predicted_CDS_1|1758_bp atggggggaaaagaaagaaaaggcatgaaggaacaagttgcagaattatacagagaggtc atagcagacctcactgacaaggaaacattagaacaaagctttgaaggagggaatgagtca gccaagcagacatttgaaagaagaaagattcctccgacaatatacagtgaagctcctgaa gttcaatgcaataagaagtgcttcattggcgtggcaaagtgggaacctcctgcaggtggg catacagagataactgagacaattaaaaacctggaggaggtgcagatagtgtgtggcacc cacagcccctacaattctccagtgtggccagttagaaagcctaatggaacttgttggatg atggtggactattgggaactgaataaagtaacaccccctttgcatgcagctgtactgtca atcatggatttgatggactgtttgatgacagaactgggacagtatcactatgaagtggac ttggccaatgcatttttctccattgacattgctccagagacccaggaacagtttgccttc atgtgggataggcaacaatggactttcacagtgttgctgcagggctatgtgcatagcccc accatatgtcctggtctagttgcaacagatttagctgcctggcaatgtccagaaggggtc cacctattccattatattgatgatatcatgttaacctctgattctcttgcagatttagaa gtggtggcacccctcttgcagcaacatgtagcagcatacggttgggccatcaatgaatcc aaggtccaagggcctggattagctgccaaattcttgggagttatctggtcaggtaagaca aaggccatcccagaggctatgattgataagattcaggcatatccctggcccaccacagtg aagaagctgcagatttttgtggggctcctgggatatcagcgggcatttgtgtccttttta gctcaaatgataaaattgttgcattggttaacaaagaagggagctccctgggattgggat gatgcggctgagaccgccttcctggcagccaagctggctattcagcaggcacaagcccta tgggtagttgacttggggcacccatttgagctggatgtgcatgtgaccacagatgtgaga tgggtgcattcatgggtaacgaccccccagactgggacggtgcagacatccactttggca aagtggggtgcctacgtggagcagcagggtacactgagtacaaaccccttagcagcagag ttgcaagaggtcttgggacctgtagtcctaatgcaagataaggccatggggcctgaggca gccctagaccctgagccttcaccatttaaggaagggcatccctcaattcctgatggggca tggttaacatgctccaactgcattgtggactgggcccacacctacgctgaagtgaccaat gtttccaactctttgatctgtaccacccttccagcagcagctgcggactgcttgccttgg cacatacaccccgtgtcttcagtgaactggacatggttggagacttggggccccatggct gatgcctcgaatgcaatgcagcaagctttggacaaaggctgccacaaggcccatggtgca cccaccccctggctggcccatagtgtttataatgggtcctcttccccttctaccaagagc ttgtcccatggtgagctgaagccctcaccagaagcagatgctggtaccatgcttcttgta cagcctgcagaaccgtga >gi568815595r:156613887_156814928|GENSCAN_predicted_peptide_2|1008_aa MKSSNTGCEIQIKHRVTETIVGYGNIDTVIIRKTLGLLTLTFPKQKHSGALSEAEKKNPT SAELLLREIGNLPLEERKKGRNSRGDSIRKGNAFPEISSMCLEVYPMAIHNPMGSREMNF SARHTVIPPSKKDGPRPRLRLRDGPSGGGDRKPLKAEIEVTGELAGLGARSRLSAFAAAE WPVRGSPRRGSVVPRRRLPPSALISGAGTAFHSAPVVRSLRLPFRPRGGGHGSYGSLPAT PRFGLQPRSLEDVFTLTLRAHKSSSSSPPPREALAAFPPRFAAQFCEPPRPVGRSSGGFL DSEEQLELIHIMEMETTEPEPDCVVQPPSPPDDFSCQMRLSEKITPLKTCFKKKDQKRLG TGTLRSLRPILNTLLESGSLDGVFRSRNQSTDENSLHEPMMKKAMEINSSCPPAENNMSV LIPDRTNVGDQIPEAHPSTEAPERVVPIQDHSFPSETLSGTVADSTPAHFQTDLLHPVSS DVPTSPDCLDKVIDYVPGIFQENSFTIQYILDTSDKLSTELFQDKSEEASLDLVFELVNQ LQYHTHQENGIEICMDFLQGTCIYGRDCLKHHTVLPYHWQIKRTTTQKWQSVFNDSQEHL ERFYCNPENDRMRMKYGGQEFWADLNAMNVYETTEFDQLRRLSTPPSSNVNSIYHTVWKF FCRDHFGWREYPESVIRLIEEANSRGLKEVRFMMWNNHYILHNSFFRREIKRRPLFRSCF ILLPYLQLCVDDNAVVVVVVVVVVVVVVVVVVVVVVVVVMVVSGKTRTLGGVPTQAPPPL EATSSSQIICPDGVTSANFYPETWVYMHPSQDFIQVPVSAEDKSYRIIYNLFHKTVPEFK YRILQILRVQNQFLWEKYKRKKEYMNRKMFGRDRIINERHLFHGTSQDVVDGICKHNFDP RVCGKHATMFGQGSYFAKKASYSHNFSKKSSKGVHFMFLAKVLTGRYTMGSHGMRRPPPV NPGSVTSDLYDSCVDNFFEPQIFVIFNDDQSYPYFVIQYEEVSNTVSI >gi568815595r:156613887_156814928|GENSCAN_predicted_CDS_2|3027_bp atgaaaagtagtaacactggatgtgaaattcagatcaaacatagagttacagaaacgata gttggctatggcaatattgacactgtgatcattagaaagactctaggccttttaaccctc acctttcctaaacagaagcactctggtgccttgtcagaggcagagaaaaaaaatcccacg agtgcagaactccttctcagggaaataggaaatcttcctttggaagaaagaaaaaaagga agaaattcaaggggtgactctattaggaaaggaaacgcttttccagaaatctcaagcatg tgtctagaagtgtatcccatggccatccataaccccatgggctctagggaaatgaatttt tcagctaggcacactgtcatccctccttcaaaaaaagatgggccacgtccgaggctccgc ctccgcgacgggccgtcgggaggaggcgaccggaagccacttaaagcagagatcgaggtg acaggcgagctggctggactcggagcgcggtcgaggctttctgcgttcgcggcggcggaa tggcccgtgcgcggctcgccgcgtcgcggctctgtggtccctagacgtcggctcccgccc tcggcgctgatctccggcgcgggcactgctttccactcggctcctgtcgtccgttctctc aggctcccgttcagaccccgggggggagggcacggcagctacgggagccttccggctacc ccgcgtttcgggctgcagcccagaagtttggaagatgttttcacattaactttgagagcg cacaagtcttcgtcttcctccccgccgccgcgggaagcgctcgccgcctttcccccgcgc ttcgcggctcagttctgcgagcccccaagacccgttggacgctcctcgggaggattttta gactctgaggagcagttggagctaatccacattatggaaatggaaaccaccgaacctgag ccagactgtgtagtgcagcctccctctcctcctgatgacttttcatgccaaatgagactc tctgagaagatcactccattgaagacttgttttaagaaaaaggatcagaaaagattggga actggaaccctgaggtctttgaggccaatattaaacactcttctagaatctggctcactt gatggggtttttagatctaggaaccagagtacagatgagaacagcttacatgaacctatg atgaagaaagccatggaaatcaattcatcatgcccaccagcagaaaataatatgtctgtt ctgattcctgataggacaaatgttggggaccagataccggaagcccatccttccactgaa gctccagaacgagtggttccaatccaagatcacagctttccatcagaaaccctcagtggg acggtggcagattccacaccagctcacttccagactgatcttttgcacccagtttcaagt gatgttcctactagtcctgactgcttagacaaagtcatagattatgttccaggcattttc caagaaaacagttttacaatccaatacattctggacaccagtgataagctgagtactgag ctctttcaggacaaaagtgaagaggcttcccttgacctcgtgtttgagctggtgaaccag ttgcagtaccacactcaccaagagaacggaattgaaatttgcatggactttctgcaaggc acttgtatttatggcagggattgtttgaagcaccacactgtcttgccatatcattggcag atcaaaaggacaactactcaaaagtggcagagtgtattcaatgattctcaggagcacttg gaaagattttactgtaacccagaaaatgatagaatgagaatgaagtatggaggacaagaa ttttgggcagatttgaatgccatgaacgtgtatgaaacaactgaatttgaccaactacga aggctgtccacaccaccctctagcaatgtcaactctatttaccacacagtctggaaattc ttctgtagggaccactttggatggagagagtatcccgagtctgtcattcgattgattgaa gaagccaactctcggggtctgaaagaggttcgatttatgatgtggaataaccactacatc ctccacaattcattcttcaggagagagataaaaaggagacccctcttccgctcctgtttt atactgcttccatatttacaactgtgtgttgatgataatgcggtggtggtggtggtggtg gtggtggtggtggtggtggtggtggtggtggtggttgtggtggtggtggtggtggtgatg gtggtgtcaggcaaaactaggacacttggtggggttcccacacaagctcctccacctctt gaagcaacttcatcatcacaaattatctgcccagatggggtcacttcagcaaacttttac cctgaaacttgggtttatatgcatccatctcaggacttcatccaagtccctgtttctgca gaggataaaagttatcggatcatttacaatctttttcataagactgtgcctgagtttaaa tacagaattttgcagatattgagagtccaaaaccagtttctttgggagaaatataaaagg aaaaaggaatatatgaacaggaaaatgtttggccgtgacaggataataaatgagagacat ttatttcatggaacatcccaggatgtggtagatggaatctgcaaacacaactttgaccct cgagtctgtggaaagcatgctacaatgtttggacaaggcagttattttgcaaagaaggca agctactctcataacttttctaagaagtcctccaaaggagtccacttcatgtttctggcc aaagtgctgacgggcagatacacaatgggcagtcatggcatgagaaggcccccgccagtc aatcctggcagtgtcaccagtgacctttatgactcttgtgtggataatttctttgagcct cagatttttgtcatttttaatgatgaccagagttacccttattttgttatccaatatgaa gaagtcagtaacactgtttccatttga >gi568815595r:156613887_156814928|GENSCAN_predicted_peptide_3|666_aa MSVGYLTGDIKMSVGHLTGGMKTGEIRVDVIDSFGSSAFLSEGLKSLSRISQEKGMRMRV MTPPSRFASPGKLNCPWSVIPRGQATLGWNFLHCPPVSQQKKPRALPSDVGQHAESPSSC PSVSGIGGFLVSLTLTMKPRTLAVSVTALKVVRLELVPSDVQMCSEFLPSGVKLQTFAGS VTALKAVRLEFFIPPGGLVVALASEVKLQTFVVSVTAHKSSVDPKPLGWSMGLGAMEQGV ALVGEARAAQEPTEGMGGSGMAAAGPEPCPTGRQLRTGEELSAAPKVLSEYIESAITSYH KLGNYKQQKFISHSLKSGKSKIEVLADLPTQREDDEPHNFVRMYKECLSCWLESGIPNLG VWPKRIHTTAEKYREYEAREQTDQTQVQELHRSQDRDFETMAKLHIPVMVDEVVHCLSPQ KGQIFLDMTFGSGGHTKAILQKESDIVLYALDRDPTAYALAEHLSELYPKQIRAMLGQFS QAEALLMKAGVQPGTFDGVLMDLGCSSMQLDTPERGSSLRKDGPLDIRMDDLLQQSTYIA TKTFQALRIFVNNELNELYTGLKTAQKFLRLGGRLVALSFHSLEDRIVKRFLLGISMTER FNLSVRQQVMKTSQLGSDHENTEEVSMRRAPLMWELIHKKVFSPQDQDVQDNPRGRSAKL RAAIKL >gi568815595r:156613887_156814928|GENSCAN_predicted_CDS_3|2001_bp atgtctgttggatatctgactggagatataaagatgtctgttggacatctgactggaggt atgaagactggagaaataagagtggatgtgattgactcatttggaagcagtgcattttta tcagagggactgaaaagcctttcgagaatcagtcaagaaaaaggaatgcgcatgagagtc atgaccccaccttcaaggtttgcttcaccagggaaactcaactgcccctggtctgtgatc ccaaggggtcaggccactttaggctggaacttcctccactgcccacctgtgagtcaacaa aagaagccaagagctttaccctcagacgtaggccagcatgctgaaagcccgtcttcctgc cctagtgtgtctggaattggtgggttcttggtctcactgactttaacaatgaagccgcgg accctcgcggtgagtgttacagctcttaaggtggtgcgtctggagcttgttccttctgat gttcagatgtgttcggagtttcttccttctggagtgaagctgcagacctttgcggggagt gttacagctcttaaggcagtgcgtctggagtttttcattcctcccggtgggctcgtggtc gcgctggcttcagaagtgaagctacagaccttcgtggtgagtgttacagctcataaaagc agtgtggacccaaagccccttgggtggtcgatgggactgggtgccatggagcagggggtg gcactcgtcggggaggctcgggccgcacaggagcccacggagggtatgggaggctcaggc atggcggctgcaggtcccgagccctgccccacgggaaggcagctaaggacaggcgaggaa ttgagcgcagcgccgaaagtcctgtctgagtacattgagagtgctataacaagttaccat aaactgggaaattataaacaacagaaatttatttctcacagtttgaagtctggaaagtcc aagattgaggtgctggcagatttgcctactcaacgtgaagatgatgagcctcataacttt gtgagaatgtataaagaatgcctttcatgttggttggaatctggcatacctaatttaggt gtctggccaaaaagaatacatactacagcagaaaaatatagagaatatgaagcccgggag caaacagatcaaactcaagtccaggagttacacagatctcaagatagagattttgaaact atggctaaattacatattccagtaatggtggatgaagttgttcattgtttgtcaccacaa aaaggacagatttttctagatatgacatttggttcgggagggcacacaaaagccattctg cagaaggagtcagatattgttctctatgccttggacagagacccaacagcttatgcatta gctgaacatctttcagagttgtatcctaaacaaatccgagctatgctgggccagttcagc caggcagaagccttattaatgaaagctggagtgcagccaggaacttttgatggagttctt atggatcttgggtgttcctccatgcaacttgatactcctgaaagaggttcttcccttcgg aaagatggccctttggacataagaatggatgacttactacagcaatctacctatattgcc accaagactttccaggctcttcgcatatttgtgaacaatgagctcaatgaactctacacg ggactgaagacagctcagaagtttctgagacttggtggtcgtcttgttgccctctccttc cattcactagaggatcgcatcgtcaaaagatttttgcttggaataagcatgacagaaaga tttaacctaagtgttagacaacaagtgatgaaaacatctcaattgggttcagatcatgaa aacacggaagaagtctctatgagaagagctcctttaatgtgggaactgatacacaagaag gtatttagtccacaagatcaggatgtacaagataaccccagagggcgctcagccaagctt agagcagctatcaaattataa >gi568815595r:156613887_156814928|GENSCAN_predicted_peptide_4|460_aa MSGKDEQQEQTIAEDLVVTKYKMEGDIANRGDQVTGRKADVIKAAHLCAEAALRLVKPGN QNTQVTQAWNKVAHSFNCTPVEGMLSHQLKQHVIDGEKTIIQNPTDQQKKDHEKAEFDIH KVYAVDVLVSSGEGKPKDAGQRTTIYKQDPSKQYGLKMKTSRAFFSEVETRFDAMPFTLR AFEDEKKAQMGVVECAKHELLQPFNVLYEKEGEFVAQFKFTVLLMPSGPMRITSGPFEPD LCKSEMEVQDAELKALLQSSTSRKTQKKKTKKASKTAENVASGETLEENEAGDRWVPSPQ LAAPASSPSHHTPDSVKRSSSSPPRTASRAGVSLPPPQFPNPLPSNNNQLQLTLEEAAVH LRIGVELGEDKKRSWQQGRKIILEMSEEVVGGGGGGGSGRGGGGGVVCTCVPNIGQGATL HEHLLNAIDLLNELIFLFKEKEDNCEGLKEGKVNYQKKAL >gi568815595r:156613887_156814928|GENSCAN_predicted_CDS_4|1383_bp atgtcgggcaaggatgagcagcaggagcaaactatcgctgaggacctggtcgtgaccaag tataagatggagggtgacatcgccaacaggggggaccaagtaacagggaggaaggcagat gttattaaggcagctcacctttgtgctgaagctgccctacgcctggtcaaacctggaaat cagaacacacaagtgacacaagcctggaacaaagttgcccactcatttaactgcacgcca gtagaaggtatgctgtcacaccagttgaagcagcatgtcattgatggagaaaaaaccatt atccagaatcccacagaccagcagaagaaggaccatgaaaaagctgaatttgacatacat aaagtgtatgctgtggatgttcttgtcagctcaggagagggcaagcccaaggatgcagga cagagaaccactatttacaaacaagacccttctaaacagtatggactgaaaatgaaaact tcacgtgccttcttcagtgaggtggaaacgcgttttgatgccatgccgtttactttaaga gcatttgaagatgagaagaaggctcagatgggtgtggtggagtgcgccaaacatgaactg ctgcaaccatttaatgttctctatgagaaggagggtgaatttgttgcccagtttaaattt acagttctgctcatgcccagtggccccatgcggataaccagtggtcccttcgagcctgac ctctgcaagtctgagatggaggtccaggatgcagagctaaaggccctcctccagagttct acaagtcgaaaaacccagaaaaagaaaacaaagaaggcctccaagactgcagagaatgtc gccagtggggaaacattagaagaaaatgaagctggggacaggtgggtcccatctccccag cttgctgctcctgcctcatccccttcccaccacaccccggactctgtgaagcgcagttct tcttctccacctaggaccgccagcagagcaggggtctccctgcccccaccccagttcccc aacccactcccttccaacaacaaccagctccaactgactctggaagaagcagcagtacac ttaagaattggagtagagttgggggaagataaaaagagaagttggcagcaggggagaaaa attatcttagaaatgtctgaagaagtagtaggaggaggaggaggaggaggaagtgggaga ggaggaggaggaggagtagtgtgcacgtgtgtcccaaatataggccaaggtgccacatta catgagcatctactaaatgcaattgatcttctaaatgaattgatcttcttgtttaaagaa aaggaagacaactgtgaaggtttgaaggaaggcaaagtgaactaccagaaaaaggcattg tga