GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:05:49 Sequence gi568815575f:103509065_103709664 : 200600 bp : 39.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 21314 21458 145 1 1 70 73 50 0.561 2.03 1.02 Intr + 22102 22241 140 2 2 79 11 147 0.211 5.36 1.03 Term + 26689 26889 201 0 0 66 50 139 0.324 4.31 1.04 PlyA + 27824 27829 6 1.05 2.00 Prom + 31618 31657 40 -4.65 2.01 Sngl + 35255 35581 327 1 0 91 43 150 0.945 6.56 2.02 PlyA + 36359 36364 6 1.05 3.00 Prom + 41311 41350 40 -4.05 3.01 Init + 42927 42983 57 2 0 62 76 18 0.593 -0.64 3.02 Term + 45502 45918 417 0 0 50 45 216 0.774 7.79 3.03 PlyA + 47540 47545 6 1.05 4.03 PlyA - 48209 48204 6 1.05 4.02 Term - 53155 53056 100 0 1 89 50 76 0.472 0.52 4.01 Init - 61664 61504 161 2 2 89 116 169 0.244 19.15 4.00 Prom - 62988 62949 40 -5.75 5.00 Prom + 66351 66390 40 -6.25 5.01 Init + 67406 67468 63 1 0 79 96 36 0.095 2.71 5.02 Intr + 68030 68149 120 1 0 13 99 73 0.054 0.67 5.03 Intr + 76561 76696 136 0 1 74 44 101 0.030 3.52 5.04 Intr + 77155 77237 83 2 2 125 40 105 0.048 7.84 5.05 Term + 77585 78259 675 1 0 106 36 542 0.099 43.53 5.06 PlyA + 78642 78647 6 1.05 6.00 Prom + 91690 91729 40 -6.25 6.01 Init + 98605 98652 48 0 0 77 103 59 0.818 7.30 6.02 Intr + 98995 99058 64 0 1 36 5 102 0.268 -5.93 6.03 Intr + 99540 100562 1023 1 0 136 75 723 0.366 65.44 6.04 Term + 115837 116144 308 2 2 -14 36 245 0.342 3.39 6.05 PlyA + 117325 117330 6 -0.45 7.00 Prom + 117773 117812 40 -7.65 7.01 Init + 120853 121269 417 0 0 105 52 380 0.321 32.58 7.02 Intr + 122208 122302 95 2 2 57 82 72 0.194 1.34 7.03 Intr + 124221 124311 91 0 1 81 71 56 0.191 2.28 7.04 Term + 129789 130097 309 2 0 -24 35 288 0.272 6.38 7.05 PlyA + 130893 130898 6 1.05 8.00 Prom + 130927 130966 40 -6.05 8.01 Sngl + 135048 135470 423 2 0 68 42 210 0.936 10.44 8.02 PlyA + 135474 135479 6 1.05 9.02 PlyA - 136776 136771 6 1.05 9.01 Sngl - 142956 142567 390 0 0 5 43 260 0.945 9.17 9.00 Prom - 143301 143262 40 -5.85 10.00 Prom + 146293 146332 40 -5.75 10.01 Init + 157178 157486 309 1 0 54 115 146 0.797 11.26 10.02 Term + 159354 159524 171 2 0 73 39 213 0.999 11.74 10.03 PlyA + 162307 162312 6 1.05 11.02 PlyA - 165222 165217 6 1.05 11.01 Sngl - 167963 167097 867 2 0 13 53 795 0.593 64.14 11.00 Prom - 175309 175270 40 -4.55 12.03 PlyA - 176372 176367 6 1.05 12.02 Term - 179166 178982 185 1 2 51 42 114 0.207 -0.18 12.01 Init - 181740 181653 88 0 1 64 106 55 0.172 5.75 12.00 Prom - 194151 194112 40 -5.75 13.03 PlyA - 195414 195409 6 1.05 13.02 Term - 198450 198280 171 0 0 86 52 182 0.983 11.24 13.01 Intr - 199503 199406 98 1 2 78 60 99 0.810 4.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_1|161_aa MWNKKEQRREQKTAGPPGITSTSLCLCSFPHPICPLLFVTVWDVTGIQGKIVTVHWRNMT NTILNQMMKVNITSDIMQASYTPRYDATRRAPELVGSANSCGPTPRSLGTKDKISSPLSS SSPRGVVEWRTRGATIAGEWNLAIFQALEAHLLEGKLPRVL >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_1|486_bp atgtggaacaagaaagaacagcgaagagaacagaaaacagcaggccctccagggatcacc tctacctctctctgtctttgctcctttccccaccccatttgtcccctgctctttgtgaca gtatgggatgtaacaggaatccaaggaaaaatagttactgtacactggagaaacatgaca aacaccatccttaaccaaatgatgaaggttaacatcaccagtgacatcatgcaggcatcg tatactcccagatatgatgcgacaagaagggcaccggaactagtgggatcagccaactca tgtggccctacacccaggtctcttggtacaaaggacaagatttcatctcctctgtcatct tcatcacccagaggggttgtggagtggaggacccgtggggccacaatagcaggggagtgg aatttggccatttttcaagccctggaagcccatcttctagaagggaagctgcccagggtc ctctag >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_2|108_aa MEKKKEITHRESISGENSILETPGDSYEQRRKVPCHPCTGNCRGTQRSAPQRTPVSLATE VTSSHSYQGNPERETQLHTSPSPQEVATPLSCFRKGTTTSSNPEHSLT >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_2|327_bp atggagaaaaaaaaggagattacccacagagaaagtatttctggtgagaacagcatactt gagacaccaggagacagctatgagcaaagaagaaaggtgccatgtcatccatgcacaggg aactgtcgtggtacccagaggagtgctccacagaggacaccagtatcccttgccactgag gtaaccagcagccattcctaccagggaaatccagagagggagacacagctgcatacctct ccctctccacaagaagtggccaccccattgagctgcttcaggaaaggaaccaccacctct tccaaccctgagcattccttaacatag >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_3|157_aa MQRQFSAERSSFQQLVLEQGYLPACTCITEASSNPVGEHFYPHFTEEDARLRESVYLSHF LAPCLYLESSSGTPLEQDTTFTGAKGVLLLPASSDSEGNLSQSGNKATWIWDLFLSLHRR APAATVESKHLKTKPCSPHQAKSVSIKKSSWLSQSPK >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_3|474_bp atgcaaagacaattcagtgcagaaagatcatcctttcaacaacttgtgctggaacagggc tacctaccggcttgcacctgtatcactgaagcttccagcaatcctgtgggtgagcatttt tatccccatttcacagaggaagacgctaggctcagagagagtgtttacctgtctcatttt ttggcaccttgcctttacttggaaagtagctccgggacacccctggagcaggacaccaca ttcactggggccaagggagtcctacttctccctgcctcatcagacagcgagggcaacttg tcccaaagcgggaacaaggcaacttggatatgggatctcttcctctctctacatcgcaga gcacccgcagccacagttgagtctaaacacctgaagacgaaaccctgctctcctcaccag gccaagtcggtctccattaaaaagtcgtcatggttgtcacagtcgccaaaatga >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_4|86_aa MNDFLIDIFEHITTKAGCQVATLSAPLSPPERSRPPEPDAVWGYPQTRCIENQSECVARG EEAWRAVDPSQNLGLALTILREDKGH >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_4|261_bp atgaatgattttcttatcgacatctttgagcacatcaccaccaaggctggctgccaggtt gctacactgagtgctccactatcacctccagagagatccagaccacccgagcctgatgct gtctggggatatccacaaacacgctgtattgaaaaccaaagtgagtgtgttgcccgaggt gaagaggcttggagggcagttgatccctcacagaatttaggtctggctctaaccatatta agagaggacaaaggtcactga >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_5|358_aa MPAGGQAPQLTPVIPAFWEARGLPNSKGRDKVHICQCEEWKGHISYVERDITTSEIKSLP QVSVRAFHAASGTLEWRKGNGYGRGKGWGWGRVESREAGPSVDRDGGLRVCCSQRSAKPE KEEQPVQNPRRSVKDRKRRGNLDMEKLYSENEGMASNQGKMENEEQPQDERKPEVTCTLE DKKLENEGKTENKGKTGDEEMLKDKGKPESEGEAKEGKSEREGESEMEGGSEREGKPEIE GKPESEGEPGSETRAAGKRPAEDDVPRKAKRKTNKGLAHYLKEYKEAIHDMNFSNEDMIR EFDNMAKVQDEKRKSKQKLGAFLWMQRNLQDPFYPRGPREFRGGCRAPRRDIEDIPYV >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_5|1077_bp atgccagctggaggccaggcgccgcagctcacgcctgtaatcccagcgttttgggaggcc aggggcctcccaaattcaaaaggaagagacaaagtccacatctgtcaatgcgaagaatgg aaaggtcacatatcatatgtggagcgggacataacaacttcggaaattaaatctcttccc caggtcagtgtgcgggccttccacgctgccagcggaacactggaatggcggaaggggaac gggtacggcagggggaaaggctgggggtggggtcgggtcgagtcgagggaagctggcccg agtgtggacagagatggcggtctgcgcgtctgttgttcccagcgctctgcgaagcctgaa aaggaggagcaacctgtccagaatccccgcaggtccgtgaaagacaggaaaaggagggga aatctcgacatggaaaaactctacagtgaaaatgaaggaatggcttcaaaccaaggaaag atggaaaatgaagaacagccacaagacgagagaaagccagaagtaacttgtactctggaa gacaagaagttagaaaacgagggaaagacagaaaacaagggcaaaacaggagatgaggaa atgttaaaggataaaggaaagccagagagtgagggagaggcaaaagaaggaaagtcagag agggagggagagtcagagatggagggaggatcagagagagagggaaaaccagagatagag ggaaagccagagagtgaaggagagccagggagtgaaacaagggctgcaggaaagcgccca gctgaggatgatgtacccaggaaagccaaaagaaaaactaataaggggctggctcattac ctcaaggagtataaagaggccatacatgatatgaatttcagcaatgaggacatgataaga gaatttgacaatatggctaaggtgcaggatgagaagagaaaaagcaaacagaaattgggg gcgtttttgtggatgcaaagaaatttacaggaccccttctaccctagaggtccaagggaa ttcaggggtggctgcagggccccacgaagggacattgaagacattccttatgtgtag >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_6|480_aa MASERDTLRGCREMRQVSVKVGAAGGTTEGVEVAGSRSLRVCSQRSARPKKEEQPVQNPC RSVKVVVGELNGQGPIRVESVEGPARAELGPLPIPHSHEHTPYQAEPVQRRWQGLPGNGW LTCVGGVAGYVVGRRPSWGPCQLRGTLTRGEMQVLSRPAFHPMPSVSHVSYLSPLPSIPI PQDRKRRGNLDMEKPYNKNEGNLENEGKPEDEVEPDDEGKSDEEEKPDVEGKTECEGKRE DEGEPGDEGQLEDEGSQEKQGRSEGEGKPQGEGKPASQAKPESQPRAAEKRPAEDYVPRK AKRKTDRGTDDSPKDSQEDLQERHLSSEEMMRECGDVSRAQEELRKKQKMGGFHWMQRDV QDPFAPRGQRGVRGVRGGGPGLRAGTAISAIRKECRGSSHRGNVWAGEGRTPQVQADRAC KVKPRGLAKKEGSVPATEVQPPQQQESLEEKPTLGFLRSPRERPHHLETVVMAAAAQCCV >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_6|1443_bp atggcatcagagagagataccctgcgaggatgccgtgagatgcgccaggtcagtgtgaag gtgggcgctgccggcggaaccacggaaggggtggaggtggctggcagtaggagtctgcgc gtctgttcccagcgctctgcgaggcctaaaaaggaggagcaacctgtccagaatccctgc aggtctgtgaaggtggttgtgggggagctcaatggacagggccctataagggttgagagt gtggagggcccggccagggccgaactggggcccctgcccatcccccactcacatgaacac acaccttaccaggctgaaccagtgcagagaaggtggcagggactgccagggaatggctgg ctcacgtgtgtgggaggagtggcaggctacgtggtgggcagaaggccctcttgggggccc tgccagcttaggggaaccctcaccaggggtgagatgcaagtactatccagacctgcattc caccccatgccctcagtctcccatgtctcttatttgtctcctcttccctccattcccatc ccccaggacaggaaaaggaggggaaatctcgacatggaaaaaccctacaataaaaatgaa ggaaacctggaaaacgagggaaagccagaagatgaagtagagcctgatgatgaaggaaag tcagacgaggaagaaaagccagacgtggaggggaagacagaatgcgagggaaagagagag gatgagggagagccaggtgatgagggacaactggaagatgagggaagccaggaaaagcag ggcaggtccgaaggtgagggcaagccacaaggcgagggcaagccagcctcccaggcaaag ccagagagccagccgcgggccgccgaaaagcgcccggctgaagattatgtgccccggaaa gcaaaaagaaaaacggacagggggacggacgattcccccaaggactctcaggaggactta caggaaaggcatctgagcagtgaggagatgatgagagaatgtggagatgtgtcaagggct caagaggagctaaggaaaaaacagaaaatgggtggttttcattggatgcaaagagatgta caggatccattcgccccaaggggacaacggggtgtcaggggagtgaggggtggaggtcca ggtctgagagcaggtactgccattagtgccattaggaaagaatgcagaggcagcagccac agagggaacgtctgggctggagagggcagaactcctcaagtccaggcagacagagcctgc aaagtgaaaccaaggggactcgcaaagaaggagggttctgttcctgccacagaagtgcag cctccccagcaacaggaatccctggaggagaaacccaccctgggctttctcagatccccc agagaaaggccacaccacttggagactgttgtcatggcagcagctgcccagtgctgtgtg taa >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_7|303_aa MDKPRKENEEEPQSAPKTDEERPPVEHSPEKQSPEEQSSEEQSSEEEFFPEELLPELLPE MLLSEERPPQEGLSRKDLFEGRPPMEQPPCGVGKHKLEEGSFKERLARSRPQFRGDIHGR NLSNEEMIQAADELEEMKRIPTVSARHSCSFSWVLILPLALIPADVLDCLSLQQAQGPLR LMAKLPCTDTPGGGELTALLQSTGPGLRAGTAISAIRKECRGSSHRGNVWAGEGRTPQVQ ADRACKVKPRGLAKKEGSVPATEVQPPQQQESLEKKPTLGFLRSPRERPHPLDTVVMAAA AQC >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_7|912_bp atggacaaaccacgcaaagaaaatgaagaagagccgcagagcgcgcccaagaccgatgag gagaggcctccggtggagcactctcccgaaaagcagtcccccgaggagcagtcttcggag gagcagtcctcggaggaggagttctttcctgaggagctcttgcctgagctcctgcctgag atgctcctctcggaggagcgccctccgcaggagggtctttccaggaaggacctgtttgag gggcgccctcccatggagcagcctccttgtggagtaggaaaacataagcttgaagaagga agctttaaagaaaggttggctcgttctcgcccgcaatttagaggggacatacatggcaga aatttaagcaatgaggagatgatacaggcagcagatgagctagaagagatgaaaagaata cctaccgtatctgctaggcacagctgctccttctcctgggtcctgatccttccactggct ctcatccctgctgacgtgctggactgcttaagcctacagcaggcacagggccctcttagg ctaatggccaagctgccctgcacagacacacctggaggaggagaactgactgccctattg cagtcaacaggtccaggtctgagagcaggtactgccattagtgccattaggaaagaatgc agaggcagcagccacagagggaacgtctgggctggagagggcagaactcctcaagtccag gcagacagggcctgcaaagtgaaaccaaggggactcgcaaagaaggagggttctgttcct gccacagaagtgcagcctccccagcaacaggaatccctggagaagaaacccaccctgggc tttctcagatcccccagagaaaggccacaccccctggatactgttgtcatggcagcagct gcccagtgctaa >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_8|140_aa MVIQRILPDLIQDHQGSICKSLQKPQHYWAQGLSPFKYLESLPKKDRCKQTQTVKTTTKT YLYDSQTSPNNHKHQDHPGKHDLTQLNTAPRTYTGETEICDLSEREFKIAVFRKLKKIQD STEKEFRILSHTFNKDSKII >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_8|423_bp atggtaatccagagaattcttccagatcttatccaagatcaccaaggtagtatctgtaaa agtctgcaaaaaccacagcattattgggctcagggcctaagtcccttcaaatacctggaa agccttcccaagaaagacaggtgcaaacaaacccagactgtgaagactacaacaaaaacc tatctctatgattcccagacatcaccaaacaaccacaagcaccaagatcatccaggaaaa catgacctcactcaactaaatacggcaccaaggacctatactggagaaacagagatatgt gacctctcagaaagagaattcaaaatagctgtattcaggaaactcaaaaaaattcaagat agcacagagaaggaattcagaattctatcacatacatttaacaaagacagtaaaataatt taa >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_9|129_aa MLENASSSGICLYATGAWNWLIDPETQKVSFFTSLWNHPFFTISYITLIGLFFAGIHKRV VAPSIIAAQRQTILAEYNMSCDDTGKLILKPRPHVQCQSSLIAIGRKTALLRISDTAKSH KGFLLQLDM >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_9|390_bp atgttggagaatgcttcttctagtggtatctgtctgtacgctactggtgcctggaactgg ttgatagatccagagacacaaaaggtgtccttcttcacatcattatggaatcatccattt tttaccattagctatatcactctaataggcttgttctttgctggaatacacaagagagta gttgcaccatcaattatagctgctcaacgtcaaacgatattagcagaatacaatatgtct tgtgatgatacaggaaaactaattttgaaacctaggcctcatgttcaatgccaatcttca ctaattgctattggacgtaaaacagcccttcttcgaataagtgatacagcaaaaagccat aaaggattccttttgcagttggatatgtaa >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_10|159_aa MLFKALVVEGQQHPEPASVGPSPPAYPHRRLVCQSHIQAVTYLANSRAESTEQPEPWRES SVLTRIPLLLGDSAKECEPLWWGSIEGIRLRVFAKFSGLSQQKFVGGKRQIGVSHVQEVG HEGEQAALKESCLGVLQIPAEKTLWNGQLKNSELKEKQQ >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_10|480_bp atgctttttaaggcattggtggtggagggtcagcagcatccagagcctgcttctgtgggc ccctcaccacctgcctacccacacagaaggctggtatgtcaatcgcatatccaagcagtg acttatctggccaattccagagcagaaagcacggaacagccagaaccttggagagagagc tctgtgctaaccaggatccccctgctcttaggagacagtgcaaaagagtgtgagcccctc tggtggggcagcatagaagggatcaggctcagggtctttgcaaaattttcagggttgagt cagcagaagtttgttgggggcaaaagacagataggtgtttcccatgtacaagaagtgggc cacgaaggagagcaagctgctctcaaagagagctgtttgggggtgctacagattccagct gaaaaaaccctatggaatggacagctgaaaaactcagagctgaaggagaaacagcaatag >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_11|288_aa MSSRKQGSQPRGQQSAEEENFKKPTRSNMQRSKMRGASSGKKTAGPQQKNLEPALPGRWG GRSAENPPSGSVRKTRKNKQKTPGNGDGGSTSEAPQPPRKKRARADPTVESEEAFKNRME VKVKIPEELKPWLVEDWDLVTRQKQLFQLPAKKNVDAILEEYANCKKSQGNVDNKEYAVN EVVAGIKEYFNVMLGTQLLYKFERPQYAEILLAHPDAPMSQVYGAPHLLRLFVRIGAMLA YTPLDEKSLALLLGYLHDFLKYLAKNSASLFTASDYKVASAEYHRKAL >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_11|867_bp atgagttccagaaagcagggttctcaacctcgtggacagcaatctgcagaagaagagaac ttcaaaaaaccaactagaagcaacatgcagagaagtaaaatgagaggggcctcctcagga aagaagacagctggtccacagcagaaaaatcttgaaccagctctcccaggaagatggggt ggtcgctctgcagagaaccccccttcaggatccgtgaggaagaccagaaagaacaagcag aagactcctggaaacggagatggtggcagtaccagcgaagcacctcagccccctcggaag aaaagggcccgggcagaccccactgttgaaagtgaggaggcgtttaagaatagaatggag gttaaagtgaagattcctgaagaattaaaaccatggcttgttgaggactgggacttagtt accaggcagaagcagctgtttcaactccctgccaagaaaaatgtagatgcaattctggag gagtatgcaaattgcaagaaatcgcagggaaatgttgataataaggaatatgcggttaat gaagttgtggcaggaataaaagaatatttcaatgtgatgttgggcactcagctgctctac aaatttgagaggccccagtatgctgaaatcctcttggctcaccctgatgctccaatgtcc caggtttatggagcaccacacctactgagattatttgtaagaattggagcaatgttggcc tatacgccccttgatgagaaaagccttgcattattgttgggctatttgcatgatttccta aaatatctggcaaagaattctgcatctctctttactgccagtgattacaaagtggcttct gctgagtaccaccgcaaagccctgtga >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_12|90_aa MMGILDRMLEQELGDVGPGARYIPYHTTLGMNDIHDAYRYYRSANQKGEGGGKQNPRIPG TKQPILCWDLAASSGRLERYKPYSQNFLEN >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_12|273_bp atgatgggtatactagacagaatgttagagcaagagttaggagatgtgggtcccggtgcc agatatatcccctatcacactaccttgggaatgaacgatatccatgatgcctatagatat taccgttcagcaaatcagaaaggggaggggggagggaaacaaaatcccagaatcccaggg accaaacagccaatcctttgctgggatttggcggcatctagtggcagactcgagaggtat aaaccttactcacagaactttctggaaaattaa >gi568815575f:103509065_103709664|GENSCAN_predicted_peptide_13|89_aa XSALKDPSLRNTFRSGQTEMTVCIVLQEAIALQEEDIIQESRFYFRGYGLGHCLQARDGG PMEGSGIYSPQPPAPLLREGETTRKLYVD >gi568815575f:103509065_103709664|GENSCAN_predicted_CDS_13|270_bp ntttcagccctgaaagacccaagtcttaggaacaccttccgttctggacaaaccgagatg acagtttgcatcgtcctccaagaggccattgcacttcaggaggaagatatcatccaagaa agtcgtttctatttccgtggctatggcttgggccactgcctgcaggcaagagatggaggt ccaatggaaggttctggcatttatagtccccaacctccagcccctcttctaagggaagga gaaaccacgcggaaactctacgtggactga