GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:29:25 Sequence gi568815595r:63910671_64123419 : 212749 bp : 41.25% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1929 2253 325 2 1 54 113 477 0.982 42.34 1.02 Term + 2487 2740 254 1 2 98 39 146 0.542 5.52 1.03 PlyA + 6788 6793 6 1.05 2.03 PlyA - 7939 7934 6 1.05 2.02 Term - 8425 8232 194 2 2 -5 42 247 0.313 7.40 2.01 Init - 13680 13539 142 0 1 86 26 138 0.535 7.74 2.00 Prom - 16478 16439 40 -3.75 3.00 Prom + 37723 37762 40 -3.75 3.01 Init + 41711 41813 103 1 1 88 115 33 0.753 6.45 3.02 Intr + 45179 45284 106 0 1 35 85 114 0.089 4.15 3.03 Intr + 53765 53790 26 2 2 112 30 30 0.004 -3.65 3.04 Intr + 55846 55913 68 2 2 101 37 77 0.001 1.61 3.05 Intr + 62383 62433 51 0 0 69 38 98 0.003 1.09 3.06 Intr + 64011 64080 70 2 1 30 100 77 0.165 0.94 3.07 Intr + 69245 69497 253 0 1 112 82 91 0.408 6.47 3.08 Intr + 71516 71775 260 2 2 78 93 149 0.386 10.58 3.09 Intr + 72269 72351 83 0 2 80 100 69 0.997 5.74 3.10 Intr + 77389 77654 266 0 2 112 121 150 0.999 16.09 3.11 Intr + 78510 78629 120 0 0 59 75 70 0.736 1.49 3.12 Intr + 79506 79704 199 0 1 45 84 215 0.962 15.23 3.13 Intr + 80068 80189 122 0 2 76 81 69 0.990 3.37 3.14 Intr + 84835 85813 979 1 1 104 91 548 0.929 46.70 3.15 Intr + 86951 87017 67 1 1 88 54 50 0.868 -0.84 3.16 Intr + 89938 89999 62 2 2 80 101 110 0.660 9.03 3.17 Term + 99027 99080 54 2 0 94 42 37 0.058 -3.72 3.18 PlyA + 99312 99317 6 1.05 4.08 PlyA - 99907 99902 6 1.05 4.07 Term - 100094 99998 97 1 1 78 42 63 0.368 -2.84 4.06 Intr - 102937 102769 169 2 1 67 89 238 0.859 19.88 4.05 Intr - 108037 107929 109 1 1 42 82 96 0.962 3.34 4.04 Intr - 108367 108148 220 0 1 89 90 179 0.994 15.58 4.03 Intr - 108771 108626 146 0 2 91 75 107 0.997 7.86 4.02 Intr - 111853 111648 206 2 2 64 87 366 0.995 32.10 4.01 Init - 112749 112605 145 0 1 105 51 313 0.995 29.63 4.00 Prom - 121462 121423 40 -6.05 5.00 Prom + 124319 124358 40 -4.35 5.01 Init + 125957 126041 85 1 1 43 64 52 0.036 -0.67 5.02 Intr + 128675 128805 131 0 2 109 85 33 0.029 4.59 5.03 Intr + 129421 129628 208 0 1 102 47 78 0.033 2.83 5.04 Term + 151593 151711 119 1 2 118 48 61 0.739 2.82 5.05 PlyA + 151844 151849 6 1.05 6.05 PlyA - 151952 151947 6 1.05 6.04 Term - 167913 167714 200 1 2 0 48 211 0.850 4.88 6.03 Intr - 168565 168445 121 1 1 52 73 75 0.470 1.55 6.02 Intr - 176326 176207 120 1 0 81 110 51 0.579 6.37 6.01 Init - 181039 180956 84 1 0 59 116 53 0.905 6.07 6.00 Prom - 187008 186969 40 -6.15 7.04 PlyA - 187730 187725 6 -0.45 7.03 Term - 189255 188381 875 1 2 121 34 948 0.850 84.48 7.02 Intr - 198415 198269 147 2 0 37 -14 189 0.001 2.89 7.01 Init - 200638 200590 49 1 1 84 61 33 0.003 1.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 49761 49549 213 0 0 79 49 152 0.817 6.65 S.002 Init - 126516 126442 75 0 0 53 70 101 0.921 5.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:63910671_64123419|GENSCAN_predicted_peptide_1|192_aa MSERAADDVRGEPRRAAAAAGGAAAAAARQQQQQQQQQQPPPPQPQRQQHPPPPPRRTRP EDGGPGAASTSAAAMATVGERRPLPSPEVMLGQSWNLWVEASKLPGKDGTELDESFKEFG KNREVMGLCREGESSPPDGVCTNPWEVSLTVHWDREHQPTIPTPRLPVPAKMLPEEGGRG QSAWKVWFGGLL >gi568815595r:63910671_64123419|GENSCAN_predicted_CDS_1|579_bp atgtcggagcgggccgcggatgacgtcaggggggagccgcgccgcgcggcggcggcggcg ggcggagcagcggccgcggccgcccggcagcagcagcagcagcagcagcagcagcagccg ccgcctccgcagccccagcggcagcagcacccgccaccgccgccacggcgcacacggccg gaggacggcgggcccggcgccgcctccacctcggccgccgcaatggcgacggtcggggag cgcaggcctctgcccagtcctgaagtgatgctgggacagtcgtggaatctgtgggttgag gcttccaaacttcctgggaaggacgggacagaattggacgaaagtttcaaggagtttggg aaaaaccgcgaagtcatggggctctgtcgggaaggtgagtccagcccccctgatggagtt tgtacaaacccctgggaagtttcattgacagttcactgggaccgggaacatcagcccacc ataccgactccccgactccccgtgcctgcgaagatgctgcctgaggagggagggaggggg cagagcgcttggaaagtttggtttgggggcctcctgtaa >gi568815595r:63910671_64123419|GENSCAN_predicted_peptide_2|111_aa MLQIVYSTADTTAAHECPLSTSKAVWPKSELLTLSLPHSPAFLSSLSGKIWPTRCNWGKV PATGGSFWVSGGSCTYALVKSLQGRRHAQELSARTMPTVTATLILIYGGVS >gi568815595r:63910671_64123419|GENSCAN_predicted_CDS_2|336_bp atgctccagattgtgtattcaactgctgacaccacagccgcacatgaatgcccactaagt acgtcaaaagcagtgtggccaaaatcagagctgctgaccctaagtctcccacattcgcca gcattcctcagttcattaagtggcaagatttggcccacacgctgcaactggggaaaagtc cctgctacaggagggtctttctgggtctctgggggttcctgcacctacgcgttagtgaag agcttacaaggaaggagacacgcacaggagctgtcagcaaggacaatgcccactgtcacg gccacgctgattctcatctatggaggtgtttcttaa >gi568815595r:63910671_64123419|GENSCAN_predicted_peptide_3|962_aa MPIFGFCPAHDDFYLVVCNDCNQVVKPQAFQSHYAGSSVKTTATRTRKHATVAASLVKSV SVGGKTILNCWYDWSPYKVSSKGFHSFGLYERYCKESGGKKQPCKVGLATEDAANEDGHG AWHVVSTQYTVAAVVDVVVLEERRHSSSSKPPLAVPPTSVFSFFPSLSKSKGGSASGSNR SSSGGVLSASSSSSKLLKSPKEKLQLRGNTRPMHPIQQSRVPHGRIMTPSVKVEKIHPKM DGTLLKSAVGPTCPATVSSLVKPGLNCPSIPKPTLPSPGQILNGKGLPAPPTLEKKPEDN SNNRKFLNKRLSEREFDPDIHCGVIDLDTKKPCTRSLTCKTHSLTQRRAVQGRRKRFDVL LAEHKNKTREKELIRHPDSQQPPQPLRDPHPAPPRTSQEPHQNPHGVIPSESKPFVASKP KPHTPSLPSMLEEPSEEAPQSVVRTWPEISSPSPFFGHLLIEVFASRKMPPGCPAQQGGS APIDPPPVHESPHPPLPATEPASRLSSEEGEGDDKEESVEKLDCHYSGHHPQPASFCTFG SRQIGRGYYVFDSRWNRLRCALNLMVEKHLNAQLWKKIPPVPSTTSPISTRIPHRTNSVP TSQCGVSYLAAATVSTSPVLLSSTCISPNSKSVPAHGTTLNAQPAASGAMDPVCSMQSRQ VSSSSSSPSTPSGLSSVPSSPMSRKPQKLKSSKSLRPKESSGNSTNCQNASSSTSGGSGK KRKNSSPLLVHSSSSSSSSSSSSHSMESFRKNCVAHSGPPYPSTVTSSHSIGLNCVTNKA NAVNVRHDQSGRGPPTGSPAESIKRMSVMVNSSDSTLSLGPFIHQSNELPVNSHGSFSHS HTPLDKLIGKKRKCSPSSSSINNSSSKPTKVAKVPAVNNVHMKHTGTIPGAQGLMNSSLL HQDISSPCLRTGISATSPQSPDLKYVYLDVVGDESRHQGKGVGENQIQLPVQSMPPNAVS ET >gi568815595r:63910671_64123419|GENSCAN_predicted_CDS_3|2889_bp atgccaatatttggtttctgtccagcccatgatgatttctacttggtggtgtgtaacgac tgtaatcaggttgtcaaaccgcaggcatttcaatcacattatgcaggcagttccgttaaa accactgctaccaggacaagaaaacatgctaccgtggcagcaagtcttgtaaaatcagtc agtgttggtggtaaaactattctgaattgttggtatgactggtcaccttataaagtgtct tctaagggcttccatagttttggattgtatgaacgttactgtaaggaatctgggggaaaa aaacagccctgcaaagtaggcttggcaactgaagacgctgccaacgaggatggacatggt gcctggcatgtggtttctacacagtacacagtagcagctgttgttgatgttgttgttctt gaagaaagaagacatagctcatccagcaagccgcctttggccgttcctcccacttcagta ttttccttcttcccttctctgtccaaaagcaaaggaggcagtgcaagtggaagcaaccgt tcttccagtggaggtgttcttagcgcatcctcatcaagttccaagttgttgaaatcaccc aaagagaaactgcagctcagggggaacaccaggccaatgcatcccattcagcaaagtaga gttccccatggtagaatcatgacaccctctgtgaaagtggaaaagattcatccgaaaatg gatggcacactactgaaatctgcggtggggccaacctgtcctgctactgtgagttcctta gtcaagcctggccttaactgcccctcaataccaaagccaaccttgccttcacctggacag attctgaatggcaaagggcttcctgcaccgcccactctggaaaagaaacctgaagacaat tccaataataggaaatttttaaataagagattatcagaaagagagtttgatcctgacatc cactgtggggttattgatctcgacaccaagaagccctgcacccggtctttgacatgcaag acacattccttaacccagcgcagggctgtccagggtagaagaaaacgatttgatgtgtta ttagccgagcacaaaaacaaaaccagggaaaaggaattgattcgccatccggactctcag caaccaccgcagcctctcagggacccgcatcccgcccctcctagaacgtcacaggagccg caccaaaaccctcacggagtgattccttccgaatcaaagccttttgtagctagtaaacct aaacctcacacccccagtcttccaagtatgctggaggagccatcagaggaggctcctcag agtgtggtcaggacctggccagagatttcctccccaagtcctttctttggtcatcttctc attgaagtttttgcatctagaaaaatgcctccaggctgccctgctcagcaaggtgggagt gcccccattgaccctcctccagtccatgaatctccacaccctcccctgcctgccactgag ccagcttctcggttatccagtgaggagggcgaaggcgatgacaaagaagagtctgttgaa aaactggactgtcattattcaggtcatcatcctcagccagcatctttttgcacatttggg agccggcagataggaagaggctattacgtgtttgactccaggtggaatcgacttcgctgc gccctcaacctcatggtggagaagcatctgaatgcacagctatggaagaaaatcccacca gtgcccagtaccacctcacccatctccacacgtattcctcaccggacaaactctgtgccg acatcacaatgtggagtcagctatctggcagcagccaccgtctctacatccccagtcctg ctctcatctacctgcatctccccaaatagcaaatcggtaccagctcatggaaccacacta aatgcacagcctgctgcttcaggggcgatggatcctgtgtgcagtatgcaatccagacaa gtgtcctcttcatcctcatccccttccacgccctctggcctttcctcggttccttcctcc cccatgtccaggaaacctcagaaattgaaatccagcaaatctttgaggcccaaggagtct tctggtaacagcactaactgtcaaaatgccagtagcagtaccagtggcggctcaggaaag aaacgcaaaaacagttccccactgttggttcactcttcctcctcctcttcctcctcctcc tcttcttctcattccatggagtcttttaggaaaaactgtgtggctcactctgggcctccc tacccctcaacggtaacatcttcccatagcatcggcctcaactgtgtgacgaataaagca aatgcggtgaacgtccggcatgaccagtcagggaggggcccccccaccgggagccctgct gaatccatcaagaggatgagtgtgatggtgaacagcagtgattctactctttctcttggg ccattcattcaccagtccaatgaactgcctgtcaactcccacggcagtttttcccactca cacactcctctagacaaactcataggaaagaaaagaaagtgctcacccagctcgagcagc atcaacaacagcagcagcaaacccacaaaggttgccaaagtgccagccgtgaacaatgtc cacatgaaacacacaggcaccatcccaggggcacaaggactgatgaacagttccctcctt catcaggatatctcctcaccttgcttacgaacaggaatttcagcaacatcaccccagagc cctgacttaaaatatgtttatttggatgtggttggggacgagagcagacaccaaggaaag ggagttggagagaatcaaatccagctgcctgtacaatccatgccaccgaatgccgtaagt gagacgtaa >gi568815595r:63910671_64123419|GENSCAN_predicted_peptide_4|363_aa MPLENLEEEGLPKNPDLRIAQLRFLLSLPEHRGDAAVRDELMAAVRDNNMAPYYEALCKS LDWQIDVDLLNKMKKANEDELKRLDEELEDAEKNLGESEIRDAMMAKAEYLCRIGDKEGA LTAFRKTYDKTVALGHRLDIVFYLLRIGLFYMDNDLITRNTEKAKSLIEEGGDWDRRNRL KVYQGLYCVAIRDFKQAAELFLDTVSTFTSYELMDYKTFVTYTVYVSMIALERPDLREKV IKGAEILEVLHSLPAVRQYLFSLYECRYSVFFQSLAVVEQEMKKDWLFAPHYRYYVREMR IHAYSQLLESYRSLTLGYMAEAFGVGVEFIDQPDSKNWQYQETIKKGDLLLNRVQKLSRV INM >gi568815595r:63910671_64123419|GENSCAN_predicted_CDS_4|1092_bp atgccgctggagaacctggaggaggagggtctgcccaagaaccccgacttgcgtatcgcg cagctgcgcttcctgctcagcctgcccgagcaccgcggagacgctgccgtgcgcgacgag ctgatggcggccgtccgcgataacaacatggctccttactatgaagccttgtgcaaatcc ctcgactggcagatagacgtggacctactcaataaaatgaagaaggcaaatgaagatgag ttgaagcgtttggatgaggagctggaagatgcagagaagaatctaggagagagcgaaatt cgcgatgcaatgatggcaaaggccgagtacctctgccggataggtgacaaagagggagct ctgacagcctttcgcaagacatatgacaaaactgtggccctgggtcaccgattggatatt gtattctatctccttaggattggcttattttatatggataatgatctcatcacacgaaac acagaaaaggccaaaagcttaatagaagaaggaggagactgggacaggagaaaccgccta aaagtgtatcagggtctttattgtgtggctattcgtgatttcaaacaggcagctgaactc ttccttgacactgtttcaacatttacatcctatgaactcatggattataaaacatttgtg acttatactgtctatgtcagtatgattgccttagaaagaccagatctcagggaaaaggtc attaaaggagcagagattcttgaagtgttgcacagtcttccagcagttcggcagtatctg ttttcactctatgaatgccgttactctgttttcttccaatcattagcggttgtggaacag gaaatgaaaaaggactggctttttgcccctcattatcgatactatgtaagagaaatgaga attcatgcatacagtcagctgctggaatcatataggtcattaacccttggctatatggca gaagcgtttggtgttggtgtggaattcattgatcaacctgatagcaagaactggcagtac caagaaactatcaagaaaggagatctgctactaaacagagttcaaaaactttccagagta attaatatgtaa >gi568815595r:63910671_64123419|GENSCAN_predicted_peptide_5|180_aa MLVLGDASPLSDNPPEAGTLGRKVIAPKDIKSLHWLSFNPAHPDLQTLHSGTVARTVGCA AVLGVRRCVGLLINSVQCLYSDESLDHQAVPQMAMLHCPETDCPYTVTFSPLCLTALRRM SWNVTEPSQNSASPPKQSKAKGWSIVHLGRNPRIQRALFTNLKEMARGYLPVCLMKIIQK >gi568815595r:63910671_64123419|GENSCAN_predicted_CDS_5|543_bp atgcttgtgcttggcgacgcaagccccctgagtgacaatcctccagaagcaggcacttta gggaggaaagtcattgctccaaaagatataaaatctctacactggctctcttttaatcct gcccatcctgatttgcaaacccttcacagtggcactgtggctagaacagttggctgtgct gctgtgctgggtgttcgcaggtgtgtgggtttgctgatcaacagtgttcagtgtctgtat tctgatgaatcactcgaccaccaagcagtgcctcagatggccatgttgcattgccctgag acagactgcccttacacagtgacattttccccgctatgtctcacagctttgaggcgtatg tcatggaatgtcacagagccaagtcagaactctgcttccccgccaaagcaaagcaaagca aaggggtggtccattgttcacttgggcagaaaccctcggattcagagggctctttttact aatctaaaggaaatggcaagagggtatctccctgtctgccttatgaagataatacagaaa tga >gi568815595r:63910671_64123419|GENSCAN_predicted_peptide_6|174_aa MRQLAVGVEAKRMYGEEPGGGVQHRSNKDRVASALNEFFSPGLQGPQRSSSYYFMDCMNS TQLVTSWQSDWRKTESKTLEQAAGVEPGTQGQHIASPCFGAEVAGFQGKKGERDKQTQRH TEGKAIRRRGSYWSYAARAKGCQKPTEAGGVKEGFSPRTLEGTVALLTLSGLQN >gi568815595r:63910671_64123419|GENSCAN_predicted_CDS_6|525_bp atgaggcagctggcagtgggagtggaagctaagagaatgtatggggaggaaccaggaggt ggagtccagcacaggagcaacaaggaccgtgttgcttctgccctgaatgagttcttttct cccgggctgcagggaccacagaggtcttcttcctattacttcatggactgtatgaattcc acccagctggtgacatcatggcagtcggattggaggaagacagagagcaagacgttggaa caagcagcaggggtggagcctgggacacaaggtcagcacatagcaagcccttgctttgga gcagaggtggccggtttccagggcaagaaaggagagagagacaaacagacccagagacac acagaggggaaggccatcagaagacgaggcagctattggagttacgcagccagagccaag ggatgccagaagccaacagaagctggaggagtcaaggaaggattctcccctagaaccctg gaagggactgtggccctgctgacgctttctgggctccaaaactga >gi568815595r:63910671_64123419|GENSCAN_predicted_peptide_7|356_aa MRKSALKVTDKYGVKDRTELPLGYGQAAGGGQDLLEQRSSVGGDFALQGHLALFGDISGL SQPEGCLSADGGAKRQEHLSRFSMPDLSKDSGMNVSEKLSNMGTLNSSMQFRSAESVRSL LSAQQYQEMEGNLHQLSNPIGYRDLQSHGRMHQSFDFDGGMAGSKLPGQEGVRIQPMSER TRRRATSRDDNRRFRPHRSRRSRRSRSDNALHLASEREAISRLKDRPPLRAREDYDQFMR QRSFQESMGHGSRRDLYGQCPRTVSDLALQNAFGDRWGPYFAEYDWCSTCSSSSESDNEG YFLGEPIPQPARLRYVTSDELLHKYSSYGLPKSSTLGGRGQLHSRKRQKSKNCIIS >gi568815595r:63910671_64123419|GENSCAN_predicted_CDS_7|1071_bp atgcgtaaaagtgctttaaaagtgacagataaatatggagtaaaggacagaacagagctc cccttgggctatggtcaagcagcaggtggaggacaggatctgttagagcagcggtcctca gttgggggcgattttgccctccagggacatttggcactgtttggagacatttcggggctg tcacaaccagagggatgcctctctgctgatggtggtgccaagcgccaggagcacctatcc cgattttccatgcctgacctcagcaaagactctggaatgaatgtgtctgagaagctgagc aacatgggcactcttaactcgtccatgcagttccggagcgcagagtcagttcgcagcctg ctctctgcccagcagtaccaggagatggagggaaacctccaccagctcagcaaccccatt ggctacagagacctgcagtcccacggaaggatgcatcagagctttgattttgatggaggg atggcgggcagcaagctgccagggcaggagggcgtgaggatccagcccatgagtgaacgc acccggagaagagctacttcacgcgacgacaaccgccgtttccgacctcacaggtccagg cgttcccgacgctctcgctccgacaacgccctccacctggccagcgaacgcgaggccatc tcccggttaaaagataggccccctctgagagccagggaggactatgaccaatttatgcgc cagcggagcttccaggagagcatggggcatgggtcccggagggacctgtacggccagtgc cctaggactgtgtcggacctggctttgcagaatgcctttggggaccgctggggaccctac ttcgccgagtatgattggtgttccacctgctcctcctcttcagagtctgacaacgagggc tatttcctaggagaacccatcccccagccagcgcgcctgcgatacgtcacaagcgatgag ctgctgcacaaatacagctcctacggcctccccaaatcttccacattaggtggcagagga cagttgcacagcaggaaaagacagaagagcaaaaactgtatcatttcttaa