GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:27:13 Sequence gi568815595f:156577698_156805128 : 227431 bp : 39.57% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 26 21 6 1.05 1.03 Term - 5300 5136 165 2 0 55 45 205 0.680 9.83 1.02 Intr - 13976 13528 449 0 2 67 86 201 0.085 9.94 1.01 Init - 16605 16554 52 0 1 51 32 92 0.371 1.27 1.00 Prom - 18412 18373 40 -6.85 2.00 Prom + 23083 23122 40 -6.85 2.01 Init + 25915 26041 127 0 1 49 83 93 0.896 5.27 2.02 Intr + 26269 26410 142 2 1 71 83 67 0.993 2.99 2.03 Term + 28384 28726 343 1 1 89 43 176 0.452 6.20 2.04 PlyA + 30528 30533 6 1.05 3.00 Prom + 32863 32902 40 -8.05 3.01 Init + 39562 39707 146 0 2 81 56 131 0.652 8.84 3.02 Intr + 40961 41049 89 2 2 107 68 73 0.898 5.90 3.03 Intr + 44451 45350 900 1 0 74 -32 579 0.514 34.83 3.04 Intr + 45546 45792 247 1 1 19 52 270 0.924 12.10 3.05 Intr + 47367 47639 273 0 0 74 31 205 0.387 9.13 3.06 Term + 55120 55222 103 1 1 80 52 94 0.267 1.77 3.07 PlyA + 55227 55232 6 1.05 4.00 Prom + 57934 57973 40 -1.55 4.01 Init + 83653 83758 106 0 1 62 57 96 0.181 4.33 4.02 Intr + 91134 91198 65 1 2 101 87 50 0.234 3.72 4.03 Intr + 91500 91551 52 2 1 81 106 0 0.173 -1.34 4.04 Intr + 94438 94627 190 2 1 68 65 73 0.164 0.72 4.05 Intr + 96839 97099 261 2 0 73 103 261 0.998 21.78 4.06 Intr + 97477 97694 218 1 2 -3 83 181 0.642 5.72 4.07 Intr + 99960 100917 958 1 1 134 71 433 0.815 35.40 4.08 Intr + 116323 116491 169 1 1 86 53 90 0.649 4.33 4.09 Intr + 118168 118328 161 0 2 97 98 83 0.972 8.06 4.10 Intr + 124302 124421 120 0 0 7 19 167 0.544 0.39 4.11 Intr + 125727 126005 279 0 0 116 113 141 0.999 15.07 4.12 Term + 126987 127434 448 0 1 77 48 283 0.993 16.90 4.13 PlyA + 129031 129036 6 1.05 5.10 PlyA - 129287 129282 6 1.05 5.09 Term - 136596 136187 410 1 2 9 38 330 0.711 14.49 5.08 Intr - 137205 136640 566 2 2 50 -7 304 0.438 8.61 5.07 Intr - 140475 140435 41 0 2 106 92 59 0.628 4.20 5.06 Intr - 141894 141766 129 0 0 59 73 86 0.755 4.17 5.05 Intr - 163252 163074 179 2 2 47 68 126 0.135 5.22 5.04 Intr - 163903 163739 165 2 0 43 49 173 0.548 7.91 5.03 Intr - 164257 163930 328 1 1 101 68 148 0.611 8.45 5.02 Intr - 173486 173398 89 0 2 63 101 56 0.013 3.17 5.01 Init - 193803 193710 94 0 1 55 62 82 0.039 2.89 5.00 Prom - 220046 220007 40 -4.15 6.02 PlyA - 221131 221126 6 1.05 6.01 Term - 221340 221188 153 0 0 59 42 130 0.264 2.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 13976 13312 665 0 2 67 41 261 0.820 12.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:156577698_156805128|GENSCAN_predicted_peptide_1|221_aa MSEACWESEEEQLLLAAMLEVLARAIRQEKEIKGIQLGNEEVKLSLFADDMIVYLENPIV SAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIATRRIKYLGIQL TRDVKDFFKENYKPLLNEIKEDTNKWKNIPCSWIGRINMVKIAILPKILHMTELLPSRDL EDESNYYELVATGSSGGRKGSRGAQFSRDSDCRSSQILVEY >gi568815595f:156577698_156805128|GENSCAN_predicted_CDS_1|666_bp atgagtgaagcctgctgggaaagtgaagaggaacagctgctccttgcagcaatgttggaa gttctggccagggcaatcagacaagagaaagaaataaagggtattcaattaggaaatgag gaagtcaaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccattgtc tcagcccaaaatctccttaagctgataagcaacttcagcaaagtctcagggtacaaaatc aatgtgcaaaaatcacaagcattcctatacaccaataacagacaaacagagagccaaatc atgagtgaactcccattcacaattgctacaaggagaataaaatacctaggaatccaactt acaagggatgtgaaggacttcttcaaggagaactacaaaccactgctcaacgaaataaaa gaggatacaaacaaatggaagaacattccatgctcatggataggaagaatcaatatggtg aaaatagccatactgcccaagattcttcacatgaccgagcttctgccaagcagagatttg gaagatgaaagcaactactatgagctggtggccacaggttcttctggtggccggaagggc agtagaggtgctcagttctctagggacagtgattgcaggagttctcaaattctggtggag tattga >gi568815595f:156577698_156805128|GENSCAN_predicted_peptide_2|203_aa MLVVQLAGSLSTKCSTGENKPKVLQECKNGNAQCRPEKRESRGLFHLIGDYNQSPHLRLP VNGRPQLNTFLAATATQFTTSLDQTVQLARVVGTQDVPNVRLLEGYYQKQGDVLRVRRRQ GKDISKQPKQLATGDKPFHQGGGEPAGRPAGRPGVRVPRSCAGSVCRRSGCAARVLKFQS AALHGGWKRWKKDKIKRNFPTHR >gi568815595f:156577698_156805128|GENSCAN_predicted_CDS_2|612_bp atgctggtagttcagttggcaggatcattaagtactaaatgctccacaggagagaacaag cccaaagtgctccaggagtgcaaaaatgggaatgctcagtgcagacctgagaagagagag tccagagggttgttccacctcattggagattacaaccaaagtccacatttgaggctacca gtgaatggtaggccccagctaaacaccttcctggccgccactgccacacagttcaccacc agtttggaccaaacagtccaactggcaagagtggtcggtacacaggacgtgccgaacgtg cgattattagaaggctattaccagaagcaaggggatgtgctcagggtgaggcgcaggcaa ggaaaagacatttccaaacaacccaaacagcttgcgactggggacaaaccatttcaccag ggcggtggggagccagccggccggccggccggccgacccggggtgcgtgtgcccagaagc tgtgcgggctcagtgtgtagacgctctggctgcgcagcgcgcgtgctcaagttccaaagc gctgcactccacggcggctggaaaaggtggaagaaggacaaaataaaacgaaatttcccc acacatcgctaa >gi568815595f:156577698_156805128|GENSCAN_predicted_peptide_3|585_aa MGGKERKGMKEQVAELYREVIADLTDKETLEQSFEGGNESAKQTFERRKIPPTIYSEAPE VQCNKKCFIGVAKWEPPAGGHTEITETIKNLEEVQIVCGTHSPYNSPVWPVRKPNGTCWM MVDYWELNKVTPPLHAAVLSIMDLMDCLMTELGQYHYEVDLANAFFSIDIAPETQEQFAF MWDRQQWTFTVLLQGYVHSPTICPGLVATDLAAWQCPEGVHLFHYIDDIMLTSDSLADLE VVAPLLQQHVAAYGWAINESKVQGPGLAAKFLGVIWSGKTKAIPEAMIDKIQAYPWPTTV KKLQIFVGLLGYQRAFVSFLAQMIKLLHWLTKKGAPWDWDDAAETAFLAAKLAIQQAQAL WVVDLGHPFELDVHVTTDVRWVHSWVTTPQTGTVQTSTLAKWGAYVEQQGTLSTNPLAAE LQEVLGPVVLMQDKAMGPEAALDPEPSPFKEGHPSIPDGAWLTCSNCIVDWAHTYAEVTN VSNSLICTTLPAAAADCLPWHIHPVSSVNWTWLETWGPMADASNAMQQALDKGCHKAHGA PTPWLAHSVYNGSSSPSTKSLSHGELKPSPEADAGTMLLVQPAEP >gi568815595f:156577698_156805128|GENSCAN_predicted_CDS_3|1758_bp atggggggaaaagaaagaaaaggcatgaaggaacaagttgcagaattatacagagaggtc atagcagacctcactgacaaggaaacattagaacaaagctttgaaggagggaatgagtca gccaagcagacatttgaaagaagaaagattcctccgacaatatacagtgaagctcctgaa gttcaatgcaataagaagtgcttcattggcgtggcaaagtgggaacctcctgcaggtggg catacagagataactgagacaattaaaaacctggaggaggtgcagatagtgtgtggcacc cacagcccctacaattctccagtgtggccagttagaaagcctaatggaacttgttggatg atggtggactattgggaactgaataaagtaacaccccctttgcatgcagctgtactgtca atcatggatttgatggactgtttgatgacagaactgggacagtatcactatgaagtggac ttggccaatgcatttttctccattgacattgctccagagacccaggaacagtttgccttc atgtgggataggcaacaatggactttcacagtgttgctgcagggctatgtgcatagcccc accatatgtcctggtctagttgcaacagatttagctgcctggcaatgtccagaaggggtc cacctattccattatattgatgatatcatgttaacctctgattctcttgcagatttagaa gtggtggcacccctcttgcagcaacatgtagcagcatacggttgggccatcaatgaatcc aaggtccaagggcctggattagctgccaaattcttgggagttatctggtcaggtaagaca aaggccatcccagaggctatgattgataagattcaggcatatccctggcccaccacagtg aagaagctgcagatttttgtggggctcctgggatatcagcgggcatttgtgtccttttta gctcaaatgataaaattgttgcattggttaacaaagaagggagctccctgggattgggat gatgcggctgagaccgccttcctggcagccaagctggctattcagcaggcacaagcccta tgggtagttgacttggggcacccatttgagctggatgtgcatgtgaccacagatgtgaga tgggtgcattcatgggtaacgaccccccagactgggacggtgcagacatccactttggca aagtggggtgcctacgtggagcagcagggtacactgagtacaaaccccttagcagcagag ttgcaagaggtcttgggacctgtagtcctaatgcaagataaggccatggggcctgaggca gccctagaccctgagccttcaccatttaaggaagggcatccctcaattcctgatggggca tggttaacatgctccaactgcattgtggactgggcccacacctacgctgaagtgaccaat gtttccaactctttgatctgtaccacccttccagcagcagctgcggactgcttgccttgg cacatacaccccgtgtcttcagtgaactggacatggttggagacttggggccccatggct gatgcctcgaatgcaatgcagcaagctttggacaaaggctgccacaaggcccatggtgca cccaccccctggctggcccatagtgtttataatgggtcctcttccccttctaccaagagc ttgtcccatggtgagctgaagccctcaccagaagcagatgctggtaccatgcttcttgta cagcctgcagaaccgtga >gi568815595f:156577698_156805128|GENSCAN_predicted_peptide_4|1008_aa MKSSNTGCEIQIKHRVTETIVGYGNIDTVIIRKTLGLLTLTFPKQKHSGALSEAEKKNPT SAELLLREIGNLPLEERKKGRNSRGDSIRKGNAFPEISSMCLEVYPMAIHNPMGSREMNF SARHTVIPPSKKDGPRPRLRLRDGPSGGGDRKPLKAEIEVTGELAGLGARSRLSAFAAAE WPVRGSPRRGSVVPRRRLPPSALISGAGTAFHSAPVVRSLRLPFRPRGGGHGSYGSLPAT PRFGLQPRSLEDVFTLTLRAHKSSSSSPPPREALAAFPPRFAAQFCEPPRPVGRSSGGFL DSEEQLELIHIMEMETTEPEPDCVVQPPSPPDDFSCQMRLSEKITPLKTCFKKKDQKRLG TGTLRSLRPILNTLLESGSLDGVFRSRNQSTDENSLHEPMMKKAMEINSSCPPAENNMSV LIPDRTNVGDQIPEAHPSTEAPERVVPIQDHSFPSETLSGTVADSTPAHFQTDLLHPVSS DVPTSPDCLDKVIDYVPGIFQENSFTIQYILDTSDKLSTELFQDKSEEASLDLVFELVNQ LQYHTHQENGIEICMDFLQGTCIYGRDCLKHHTVLPYHWQIKRTTTQKWQSVFNDSQEHL ERFYCNPENDRMRMKYGGQEFWADLNAMNVYETTEFDQLRRLSTPPSSNVNSIYHTVWKF FCRDHFGWREYPESVIRLIEEANSRGLKEVRFMMWNNHYILHNSFFRREIKRRPLFRSCF ILLPYLQLCVDDNAVVVVVVVVVVVVVVVVVVVVVVVVVMVVSGKTRTLGGVPTQAPPPL EATSSSQIICPDGVTSANFYPETWVYMHPSQDFIQVPVSAEDKSYRIIYNLFHKTVPEFK YRILQILRVQNQFLWEKYKRKKEYMNRKMFGRDRIINERHLFHGTSQDVVDGICKHNFDP RVCGKHATMFGQGSYFAKKASYSHNFSKKSSKGVHFMFLAKVLTGRYTMGSHGMRRPPPV NPGSVTSDLYDSCVDNFFEPQIFVIFNDDQSYPYFVIQYEEVSNTVSI >gi568815595f:156577698_156805128|GENSCAN_predicted_CDS_4|3027_bp atgaaaagtagtaacactggatgtgaaattcagatcaaacatagagttacagaaacgata gttggctatggcaatattgacactgtgatcattagaaagactctaggccttttaaccctc acctttcctaaacagaagcactctggtgccttgtcagaggcagagaaaaaaaatcccacg agtgcagaactccttctcagggaaataggaaatcttcctttggaagaaagaaaaaaagga agaaattcaaggggtgactctattaggaaaggaaacgcttttccagaaatctcaagcatg tgtctagaagtgtatcccatggccatccataaccccatgggctctagggaaatgaatttt tcagctaggcacactgtcatccctccttcaaaaaaagatgggccacgtccgaggctccgc ctccgcgacgggccgtcgggaggaggcgaccggaagccacttaaagcagagatcgaggtg acaggcgagctggctggactcggagcgcggtcgaggctttctgcgttcgcggcggcggaa tggcccgtgcgcggctcgccgcgtcgcggctctgtggtccctagacgtcggctcccgccc tcggcgctgatctccggcgcgggcactgctttccactcggctcctgtcgtccgttctctc aggctcccgttcagaccccgggggggagggcacggcagctacgggagccttccggctacc ccgcgtttcgggctgcagcccagaagtttggaagatgttttcacattaactttgagagcg cacaagtcttcgtcttcctccccgccgccgcgggaagcgctcgccgcctttcccccgcgc ttcgcggctcagttctgcgagcccccaagacccgttggacgctcctcgggaggattttta gactctgaggagcagttggagctaatccacattatggaaatggaaaccaccgaacctgag ccagactgtgtagtgcagcctccctctcctcctgatgacttttcatgccaaatgagactc tctgagaagatcactccattgaagacttgttttaagaaaaaggatcagaaaagattggga actggaaccctgaggtctttgaggccaatattaaacactcttctagaatctggctcactt gatggggtttttagatctaggaaccagagtacagatgagaacagcttacatgaacctatg atgaagaaagccatggaaatcaattcatcatgcccaccagcagaaaataatatgtctgtt ctgattcctgataggacaaatgttggggaccagataccggaagcccatccttccactgaa gctccagaacgagtggttccaatccaagatcacagctttccatcagaaaccctcagtggg acggtggcagattccacaccagctcacttccagactgatcttttgcacccagtttcaagt gatgttcctactagtcctgactgcttagacaaagtcatagattatgttccaggcattttc caagaaaacagttttacaatccaatacattctggacaccagtgataagctgagtactgag ctctttcaggacaaaagtgaagaggcttcccttgacctcgtgtttgagctggtgaaccag ttgcagtaccacactcaccaagagaacggaattgaaatttgcatggactttctgcaaggc acttgtatttatggcagggattgtttgaagcaccacactgtcttgccatatcattggcag atcaaaaggacaactactcaaaagtggcagagtgtattcaatgattctcaggagcacttg gaaagattttactgtaacccagaaaatgatagaatgagaatgaagtatggaggacaagaa ttttgggcagatttgaatgccatgaacgtgtatgaaacaactgaatttgaccaactacga aggctgtccacaccaccctctagcaatgtcaactctatttaccacacagtctggaaattc ttctgtagggaccactttggatggagagagtatcccgagtctgtcattcgattgattgaa gaagccaactctcggggtctgaaagaggttcgatttatgatgtggaataaccactacatc ctccacaattcattcttcaggagagagataaaaaggagacccctcttccgctcctgtttt atactgcttccatatttacaactgtgtgttgatgataatgcggtggtggtggtggtggtg gtggtggtggtggtggtggtggtggtggtggtggttgtggtggtggtggtggtggtgatg gtggtgtcaggcaaaactaggacacttggtggggttcccacacaagctcctccacctctt gaagcaacttcatcatcacaaattatctgcccagatggggtcacttcagcaaacttttac cctgaaacttgggtttatatgcatccatctcaggacttcatccaagtccctgtttctgca gaggataaaagttatcggatcatttacaatctttttcataagactgtgcctgagtttaaa tacagaattttgcagatattgagagtccaaaaccagtttctttgggagaaatataaaagg aaaaaggaatatatgaacaggaaaatgtttggccgtgacaggataataaatgagagacat ttatttcatggaacatcccaggatgtggtagatggaatctgcaaacacaactttgaccct cgagtctgtggaaagcatgctacaatgtttggacaaggcagttattttgcaaagaaggca agctactctcataacttttctaagaagtcctccaaaggagtccacttcatgtttctggcc aaagtgctgacgggcagatacacaatgggcagtcatggcatgagaaggcccccgccagtc aatcctggcagtgtcaccagtgacctttatgactcttgtgtggataatttctttgagcct cagatttttgtcatttttaatgatgaccagagttacccttattttgttatccaatatgaa gaagtcagtaacactgtttccatttga >gi568815595f:156577698_156805128|GENSCAN_predicted_peptide_5|666_aa MSVGYLTGDIKMSVGHLTGGMKTGEIRVDVIDSFGSSAFLSEGLKSLSRISQEKGMRMRV MTPPSRFASPGKLNCPWSVIPRGQATLGWNFLHCPPVSQQKKPRALPSDVGQHAESPSSC PSVSGIGGFLVSLTLTMKPRTLAVSVTALKVVRLELVPSDVQMCSEFLPSGVKLQTFAGS VTALKAVRLEFFIPPGGLVVALASEVKLQTFVVSVTAHKSSVDPKPLGWSMGLGAMEQGV ALVGEARAAQEPTEGMGGSGMAAAGPEPCPTGRQLRTGEELSAAPKVLSEYIESAITSYH KLGNYKQQKFISHSLKSGKSKIEVLADLPTQREDDEPHNFVRMYKECLSCWLESGIPNLG VWPKRIHTTAEKYREYEAREQTDQTQVQELHRSQDRDFETMAKLHIPVMVDEVVHCLSPQ KGQIFLDMTFGSGGHTKAILQKESDIVLYALDRDPTAYALAEHLSELYPKQIRAMLGQFS QAEALLMKAGVQPGTFDGVLMDLGCSSMQLDTPERGSSLRKDGPLDIRMDDLLQQSTYIA TKTFQALRIFVNNELNELYTGLKTAQKFLRLGGRLVALSFHSLEDRIVKRFLLGISMTER FNLSVRQQVMKTSQLGSDHENTEEVSMRRAPLMWELIHKKVFSPQDQDVQDNPRGRSAKL RAAIKL >gi568815595f:156577698_156805128|GENSCAN_predicted_CDS_5|2001_bp atgtctgttggatatctgactggagatataaagatgtctgttggacatctgactggaggt atgaagactggagaaataagagtggatgtgattgactcatttggaagcagtgcattttta tcagagggactgaaaagcctttcgagaatcagtcaagaaaaaggaatgcgcatgagagtc atgaccccaccttcaaggtttgcttcaccagggaaactcaactgcccctggtctgtgatc ccaaggggtcaggccactttaggctggaacttcctccactgcccacctgtgagtcaacaa aagaagccaagagctttaccctcagacgtaggccagcatgctgaaagcccgtcttcctgc cctagtgtgtctggaattggtgggttcttggtctcactgactttaacaatgaagccgcgg accctcgcggtgagtgttacagctcttaaggtggtgcgtctggagcttgttccttctgat gttcagatgtgttcggagtttcttccttctggagtgaagctgcagacctttgcggggagt gttacagctcttaaggcagtgcgtctggagtttttcattcctcccggtgggctcgtggtc gcgctggcttcagaagtgaagctacagaccttcgtggtgagtgttacagctcataaaagc agtgtggacccaaagccccttgggtggtcgatgggactgggtgccatggagcagggggtg gcactcgtcggggaggctcgggccgcacaggagcccacggagggtatgggaggctcaggc atggcggctgcaggtcccgagccctgccccacgggaaggcagctaaggacaggcgaggaa ttgagcgcagcgccgaaagtcctgtctgagtacattgagagtgctataacaagttaccat aaactgggaaattataaacaacagaaatttatttctcacagtttgaagtctggaaagtcc aagattgaggtgctggcagatttgcctactcaacgtgaagatgatgagcctcataacttt gtgagaatgtataaagaatgcctttcatgttggttggaatctggcatacctaatttaggt gtctggccaaaaagaatacatactacagcagaaaaatatagagaatatgaagcccgggag caaacagatcaaactcaagtccaggagttacacagatctcaagatagagattttgaaact atggctaaattacatattccagtaatggtggatgaagttgttcattgtttgtcaccacaa aaaggacagatttttctagatatgacatttggttcgggagggcacacaaaagccattctg cagaaggagtcagatattgttctctatgccttggacagagacccaacagcttatgcatta gctgaacatctttcagagttgtatcctaaacaaatccgagctatgctgggccagttcagc caggcagaagccttattaatgaaagctggagtgcagccaggaacttttgatggagttctt atggatcttgggtgttcctccatgcaacttgatactcctgaaagaggttcttcccttcgg aaagatggccctttggacataagaatggatgacttactacagcaatctacctatattgcc accaagactttccaggctcttcgcatatttgtgaacaatgagctcaatgaactctacacg ggactgaagacagctcagaagtttctgagacttggtggtcgtcttgttgccctctccttc cattcactagaggatcgcatcgtcaaaagatttttgcttggaataagcatgacagaaaga tttaacctaagtgttagacaacaagtgatgaaaacatctcaattgggttcagatcatgaa aacacggaagaagtctctatgagaagagctcctttaatgtgggaactgatacacaagaag gtatttagtccacaagatcaggatgtacaagataaccccagagggcgctcagccaagctt agagcagctatcaaattataa >gi568815595f:156577698_156805128|GENSCAN_predicted_peptide_6|50_aa VPNIGQGATLHEHLLNAIDLLNELIFLFKEKEDNCEGLKEGKVNYQKKAL >gi568815595f:156577698_156805128|GENSCAN_predicted_CDS_6|153_bp gtcccaaatataggccaaggtgccacattacatgagcatctactaaatgcaattgatctt ctaaatgaattgatcttcttgtttaaagaaaaggaagacaactgtgaaggtttgaaggaa ggcaaagtgaactaccagaaaaaggcattgtga