GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:57:11 Sequence gi568815595r:45727356_45938052 : 210697 bp : 44.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4698 4796 99 0 0 75 121 43 0.703 5.53 1.02 Intr + 11223 11316 94 0 1 70 108 21 0.699 2.27 1.03 Intr + 11426 11518 93 1 0 121 50 5 0.402 0.16 1.04 Intr + 12232 12289 58 0 1 107 119 -27 0.648 0.76 1.05 Term + 16178 16314 137 0 2 60 38 85 0.533 -1.12 1.06 PlyA + 17007 17012 6 1.05 2.00 Prom + 18280 18319 40 -4.96 2.01 Init + 19642 19722 81 0 0 103 75 87 0.523 8.05 2.02 Intr + 20353 20524 172 0 1 -4 77 151 0.659 4.32 2.03 Intr + 20676 20825 150 1 0 83 70 70 0.876 4.83 2.04 Intr + 24423 24472 50 1 2 110 96 -18 0.245 -0.40 2.05 Intr + 26467 26603 137 0 2 98 103 -14 0.135 0.47 2.06 Term + 27032 27074 43 2 1 130 49 41 0.172 1.03 2.07 PlyA + 29243 29248 6 1.05 3.14 PlyA - 29410 29405 6 1.05 3.13 Term - 31772 31623 150 2 0 114 50 197 0.996 16.41 3.12 Intr - 32482 32402 81 1 0 77 69 47 0.619 1.53 3.11 Intr - 32667 32502 166 2 1 86 70 191 0.948 17.06 3.10 Intr - 35717 35558 160 0 1 61 78 385 0.958 33.85 3.09 Intr - 38386 38182 205 1 1 89 76 311 0.999 28.77 3.08 Intr - 43016 42854 163 1 1 71 107 225 0.955 22.68 3.07 Intr - 44103 43862 242 0 2 88 74 502 0.675 45.15 3.06 Intr - 45260 45150 111 2 0 104 102 142 0.998 17.78 3.05 Intr - 48633 48406 228 0 0 101 81 243 0.933 22.97 3.04 Intr - 52745 52654 92 0 2 104 99 140 0.421 16.41 3.03 Intr - 54868 54728 141 2 0 68 96 243 0.990 23.42 3.02 Intr - 66237 66187 51 1 0 102 81 13 0.215 0.88 3.01 Init - 69064 68944 121 1 1 106 105 247 0.998 28.55 3.00 Prom - 69433 69394 40 -7.06 4.12 PlyA - 71329 71324 6 1.05 4.11 Term - 80021 79841 181 1 1 48 49 101 0.038 -0.72 4.10 Intr - 100104 100001 104 0 2 26 72 121 0.504 3.27 4.09 Intr - 101260 101084 177 1 0 63 77 104 0.974 6.92 4.08 Intr - 105766 105695 72 1 0 93 85 32 0.887 3.00 4.07 Intr - 106943 106883 61 1 1 88 88 54 0.999 4.14 4.06 Intr - 108429 108235 195 2 0 79 47 260 0.983 19.53 4.05 Intr - 110696 110572 125 2 2 125 69 42 0.872 5.38 4.04 Intr - 114658 114634 25 0 1 107 101 38 0.145 5.03 4.03 Intr - 127719 127631 89 0 2 100 84 1 0.002 -0.33 4.02 Intr - 134660 134618 43 1 1 68 75 67 0.025 1.64 4.01 Init - 146356 146310 47 1 2 96 89 42 0.614 5.25 4.00 Prom - 162684 162645 40 -6.56 5.00 Prom + 163411 163450 40 -3.36 5.01 Init + 167579 167599 21 1 0 95 79 22 0.547 1.71 5.02 Intr + 173455 174394 940 0 1 131 72 1090 0.539 102.45 5.03 Term + 174777 174976 200 1 2 50 48 150 0.917 4.76 5.04 PlyA + 176291 176296 6 1.05 6.03 PlyA - 176638 176633 6 1.05 6.02 Term - 177203 177028 176 0 2 71 47 122 0.758 4.32 6.01 Init - 178982 178745 238 2 1 64 85 120 0.507 5.63 6.00 Prom - 181594 181555 40 -1.66 7.06 PlyA - 182726 182721 6 1.05 7.05 Term - 194485 194410 76 0 1 120 33 75 0.767 2.51 7.04 Intr - 196410 196301 110 0 2 120 97 242 0.948 27.38 7.03 Intr - 203926 203716 211 0 1 84 116 289 0.995 30.22 7.02 Intr - 209188 209093 96 0 0 84 95 111 0.958 10.52 7.01 Intr - 209359 209245 115 2 1 92 39 52 0.262 0.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:45727356_45938052|GENSCAN_predicted_peptide_1|160_aa XYIAFDFHKECKNMRWDRLSILLDQVAEMQDELRTGKRTHLGLIMDGWNSMIRYYKNNFS DGFRQDSIDLFLGNYSVDELESHSPLSVPRDWKFLALPIIMVVAFSMCIICLLMAGDTWT ETLAYVLFWGVASIGTFFIILYNGKDFVDAPRLVQKEKID >gi568815595r:45727356_45938052|GENSCAN_predicted_CDS_1|483_bp nnatacattgcctttgacttccataaggaatgtaaaaatatgagatgggatcgactaagt attttattggatcaggtagcagaaatgcaagatgaattaagaactggaaagagaactcat ttgggacttataatggatggctggaactcaatgatacgatattataagaacaacttttcc gatggatttagacaagattccatagacttatttcttggaaactattcagtggatgaatta gaatctcatagtcctttaagtgttccaagggactggaaattcctggctttgcctattatc atggttgttgccttttcaatgtgcattatctgtttgcttatggctggtgacacttggaca gaaacactggcctatgtgctcttctggggagttgcaagcattggaacattttttatcatt ctttacaatggcaaagattttgtcgatgctcccagactggtccagaaagaaaagatagac tga >gi568815595r:45727356_45938052|GENSCAN_predicted_peptide_2|210_aa MGPARNLNPYPRLRGGMARAACTSMEQELNTYRNTLAAERRYSLQVSSELFYHSIKLLFI LLILHLSTYFILPGHRTRTQDPPNVPSISKLLGTTVFRTGSRGSCLHACLVQLQPCREPA PMPALGAAYPTAAAGPLYLWVLGLSILIYLPVTGYFHLIHRHLKLHMVKPLLIFTLQASS VPHFPHLRKWHQHPPSRAFAVAVTSPGQAF >gi568815595r:45727356_45938052|GENSCAN_predicted_CDS_2|633_bp atgggcccagcaaggaacctgaacccctaccccagactgcgaggaggcatggccagagcc gcctgcacgtccatggagcaggagctgaacacttatcggaacaccctggctgcggaaagg aggtactccctgcaggtctcctctgagctgttctatcactcgataaagctcctcttcatc ttgctcatcctccacttgtccacatacttcattcttcctggccacaggacaagaactcag gacccgccaaatgttcctagcatctctaagcttctgggcactaccgtcttccgcactggc agccgtggaagctgcttgcatgcatgcctggtccagctgcagccttgcagagagcccgca cccatgccagcacttggagctgcctaccccactgcagcagctggtccgctctacttgtgg gtgttggggctttctattctcatctatttgccggttactggatatttccacctaattcac agacatctcaaacttcacatggtcaaaccactcttgatttttactctgcaggccagctct gttccccactttcctcatctccggaaatggcaccagcatccccccagcagggcctttgca gttgctgtcacctctcctggacaggccttctag >gi568815595r:45727356_45938052|GENSCAN_predicted_peptide_3|636_aa MEKARPLWANSLQFVFACISYAVGLGNVWRFPYLCQMYGGGQARPGMRNVNSEDYIPGSF LVPYIIMLIVEGMPLLYLELAVGQRMRQGSIGAWRTISPYLSGVGVASVVVSFFLSMYYN VINAWAFWYLFHSFQDPLPWSVCPLNGNHTGYDEECEKASSTQYFWYRKTLNISPSLQEN GGVQWEPALCLLLAWLVVYLCILRGTESTGKVVYFTASLPYCVLIIYLIRGLTLHGATNG LMYMFTPKIEQLANPKAWINAATQIFFSLGLGFGSLIAFASYNEPSNNCQKHAIIVSLIN SFTSIFASIVTFSIYGFKATFNYENCLKKVSLLLTNTFDLEDGFLTASNLEQVKGYLASA YPSKYSEMFPQIKNCSLESELDTAVQGTGLAFIVYTEAIKNMEVSQLWSVLYFFMLLMLG IGSMLGNTAAILTPLTDSKIISSHLPKEAISGLVCLVNCAIGMVFTMEAGNYWFDIFNDY AATLSLLLIVLVETIAVCYVYGLRRFESDLKAMTGRAVSWYWKVMWAGVSPLLIVSLFVF YLSDYILTGTLKYQAWDASQGWRWAPMATEKHGPFIFRNQGTISMVWGQLVTKDYPAYAL AVIGLLVASSTMCIPLAALGTFVQRRLKRGDADPVA >gi568815595r:45727356_45938052|GENSCAN_predicted_CDS_3|1911_bp atggagaaagcgcggccgctgtgggccaactcgctacagttcgtgttcgcctgcatctcg tacgccgtgggcctgggcaacgtgtggcgattcccgtacctgtgccagatgtacggcgga ggccaggcccggcccggcatgaggaatgttaattctgaagattacattccaggtagtttc ctggtcccctacatcatcatgcttatcgtggagggaatgccgctcttgtacctggaactg gctgtggggcagcgcatgcggcagggcagcatcggcgcctggaggaccatcagcccgtac ctcagtggtgtcggggtcgccagcgtggtggtctctttcttcctctccatgtactacaac gtcatcaacgcctgggccttctggtacctcttccactccttccaggatcccctgccgtgg tctgtctgcccactgaatggtaaccacacgggctacgatgaggagtgtgagaaggcgtcc tccacacagtacttctggtacaggaaaaccctcaatatctcgccgtccctccaggagaac gggggtgtgcagtgggagccggcgctgtgcctcctcctggcctggctggtggtgtacctg tgcatcctgcgtggcaccgagtccactggcaaggtggtgtatttcacggcgtcactgccc tattgcgtgctcatcatctacctcatcaggggcctcacgctccacggagccaccaatggc ctcatgtacatgttcactcccaagatagagcagctggccaaccccaaggcctggatcaat gcagccacccagatcttcttctcacttggcctgggcttcggcagcctgatcgccttcgcc agctacaatgagccatccaacaactgccagaagcacgccatcatcgtgtccctcatcaac agcttcacctccatatttgccagcattgtcaccttctccatctatggcttcaaggccacc ttcaattatgaaaactgcttgaagaaggtgagtctgctgctgaccaacacttttgacctt gaagatggctttttgacagccagcaacctggagcaggtgaagggctacctcgcatctgcc tacccaagcaaatacagcgagatgttcccgcaaatcaaaaactgcagcttggaatcggag ctagacacggccgtccagggcactggcctggcattcatcgtctacacagaggccattaaa aacatggaggtgtcccagctgtggtcggtgctctacttcttcatgctgctgatgctgggc attgggagcatgctggggaacacagcggccatcctcacccctctgacagacagcaagatc atctccagccacctgcccaaggaggccatctcaggtctggtgtgccttgtcaactgtgcc attggcatggtgttcacgatggaggctgggaactactggtttgacatattcaacgactac gcggccacactgtccctgctgctcatcgtgctggtggagacgattgccgtgtgctacgtg tacgggctgaggagatttgaaagtgaccttaaggccatgaccggccgagctgtgagctgg tactggaaggtgatgtgggctggcgtaagcccactgctgattgtcagcctctttgtcttc tacctgagcgactacatcctcacggggaccctgaagtatcaagcctgggacgcctcccag ggctggcgctgggcccccatggccactgaaaagcatgggccattcattttccgaaaccag gggactatttccatggtgtggggccagctcgtgaccaaagattacccggcctatgcactg gctgtcatcgggctgcttgtggcctcctccaccatgtgcatccccctggcggccctgggg acttttgttcagcgtcgcctcaagaggggagacgcagaccccgtggcctga >gi568815595r:45727356_45938052|GENSCAN_predicted_peptide_4|372_aa MEVFQRKEDFIDELLWKNELHKNDKDNDHQGRQRKSIKLYPTHPLVTFPRSWAKFQEKAL LPPLPAAMAELGLNEHHQNEVINYMRFARSKRGLRLKTVDSCFQDLKESRLVEDTFTIDE VSEVLNGLQAVVHSEVESELINTAYTNVLLLRQLFAQAEKWYLKLQTDISELENRELLEQ VAEFEKAEITSSNKKPILDVTKPKLAPLNEGGTAELLNKDFIKAQDLSNLENTVAALKSE FQKTLNDKTENQKSLEENLATAKHDLLRVQEQLHMAEKELEKKFQQTAAYRNMKEILTKK NDQIKDLRKRLAQSVARNRATATQSCSLQLPTQCQFCAAMLPSALASSSSDPGGSLRLVI FAKYSKLLGIAQ >gi568815595r:45727356_45938052|GENSCAN_predicted_CDS_4|1119_bp atggaggtcttccagagaaaagaggactttatagatgagctgttatggaaaaatgagctc cataagaatgacaaagataatgaccaccagggaaggcagagaaaatccatcaagttgtac cctacccaccctttggtgacttttccaagaagctgggctaagttccaggagaaagctctg ctgccgccgctgcctgccgccatggcagagttgggcctaaatgagcaccatcaaaatgaa gttattaattatatgcgttttgctcgttcaaagagaggcttgagactcaaaactgtagat tcctgcttccaagacctcaaggagagcaggctggtggaggacaccttcaccatagatgaa gtctctgaagtcctcaatggattacaagctgtggttcatagtgaggtggaatctgagctc atcaacactgcctataccaatgtgttacttctgcgacagctgtttgcacaagctgagaag tggtatcttaagctacagacagacatctctgaacttgaaaaccgagaattattagaacaa gttgcagaatttgaaaaagcagagattacatcttcaaacaaaaagcccatcttagatgtc acaaagccaaaacttgctccacttaatgaaggtggaacagcagaactcctaaacaaggat tttataaaggcccaagacttaagtaacttagaaaacactgtcgctgccttaaagagtgag tttcagaagacacttaatgacaagacagaaaaccagaagtcactggaggagaatctggcg acagccaagcacgatctactcagggttcaggagcagctgcacatggctgaaaaggaatta gaaaagaaatttcagcaaacagcagcttatcgaaacatgaaagagattcttaccaagaag aatgaccaaatcaaagatctgaggaaaagactggcacaaagtgtagccagaaacagagcc acagcaacacagagctgtagcctgcagctgccaacccagtgccagttctgtgcagccatg ttgccttcagctctggccagcagttcctccgatcctggaggctctctgcgtcttgttata tttgccaaatattctaaattgcttggaatagctcagtga >gi568815595r:45727356_45938052|GENSCAN_predicted_peptide_5|386_aa MTPTDFTSPIPNMADDYGSESTSSMEDYVNFNFTDFYCEKNNVRQFASHFLPPLYWLVFI VGALGNSLVILVYWYCTRVKTMTDMFLLNLAIADLLFLVTLPFWAIAAADQWKFQTFMCK VVNSMYKMNFYSCVLLIMCISVDRYIAIAQAMRAHTWREKRLLYSKMVCFTIWVLAAALC IPEILYSQIKEESGIAICTMVYPSDESTKLKSAVLTLKVILGFFLPFVVMACCYTIIIHT LIQAKKSSKHKALKVTITVLTVFVLSQFPYNCILLVQTIDAYAMFISNCAVSTNIDICFQ VTQTIAFFHSCLNPVLYVFVGGLRTGTVEHPGFATRRSINAAASGGALGFSPCTVNFCGF SSHAASSKRGHRSTGCCYRPQKQKVS >gi568815595r:45727356_45938052|GENSCAN_predicted_CDS_5|1161_bp atgacacccacagacttcacaagccctattcctaacatggctgatgactatggctctgaa tccacatcttccatggaagactacgttaacttcaacttcactgacttctactgtgagaaa aacaatgtcaggcagtttgcgagccatttcctcccacccttgtactggctcgtgttcatc gtgggtgccttgggcaacagtcttgttatccttgtctactggtactgcacaagagtgaag accatgaccgacatgttccttttgaatttggcaattgctgacctcctctttcttgtcact cttcccttctgggccattgctgctgctgaccagtggaagttccagaccttcatgtgcaag gtggtcaacagcatgtacaagatgaacttctacagctgtgtgttgctgatcatgtgcatc agcgtggacaggtacattgccattgcccaggccatgagagcacatacttggagggagaaa aggcttttgtacagcaaaatggtttgctttaccatctgggtattggcagctgctctctgc atcccagaaatcttatacagccaaatcaaggaggaatccggcattgctatctgcaccatg gtttaccctagcgatgagagcaccaaactgaagtcagctgtcttgaccctgaaggtcatt ctggggttcttccttcccttcgtggtcatggcttgctgctataccatcatcattcacacc ctgatacaagccaagaagtcttccaagcacaaagccctaaaagtgaccatcactgtcctg accgtctttgtcttgtctcagtttccctacaactgcattttgttggtgcagaccattgac gcctatgccatgttcatctccaactgtgccgtttccaccaacattgacatctgcttccag gtcacccagaccatcgccttcttccacagttgcctgaaccctgttctctatgtttttgtg ggaggactaaggaccggcactgtggagcaccctggctttgccactcgccggagcatcaat gccgctgcctctggaggagcccttggattttctccatgcactgtgaacttctgtggcttc agttctcatgctgcctcttccaaaaggggacacagaagcactggctgctgctacagaccg caaaagcagaaagtttcgtga >gi568815595r:45727356_45938052|GENSCAN_predicted_peptide_6|137_aa MVLKILGGIVAVLYTMGPLACSWCPSISAECLGYNETGPMLSAFSEEFCDWRPTTLWISC IPSVLTHPALGGLLELAAPGLSVTTTMGKEEVIATQLSQELCQLSQDAIRATHKAKALDT KAGKSIGVQSWSSQFLP >gi568815595r:45727356_45938052|GENSCAN_predicted_CDS_6|414_bp atggtgctcaaaattctgggaggtatcgtggctgtcttatacaccatggggcccctggca tgcagctggtgcccaagcatatctgctgaatgcttaggttataatgagacaggacccatg ctctcagccttctcagaggagttctgtgactggcgcccaaccacactatggatctcctgc atacccagtgttcttacccatccagccttaggtgggctcttggaattggcagcccctgga ctctctgtaacaactacaatgggcaaagaagaagttatagccactcagctctctcaggag ctgtgtcagctctctcaggatgcaatcagggcaactcacaaggccaaagccttggacacc aaagctgggaaaagcattggcgtccagtcctggtctagtcagttccttccttga >gi568815595r:45727356_45938052|GENSCAN_predicted_peptide_7|202_aa XLPLPTKSARVSVMASMIKGSPMHLDLAACDSKTADPRMDTTSTSLTPEDTEDMPVGQDS EICLLKSGELMIKVPLTVDEIASFGEGSRELFVRSSTYSLIPITVAEAGLTISWVFSSDP KSISFSVVFQEAEDTPLDQCKVLIPTTRCNSHKENIQGQLKVRTPGIYMLIFDNTFSRFV SKKVFYHLTVDRPVIYDGSDFL >gi568815595r:45727356_45938052|GENSCAN_predicted_CDS_7|609_bp nccctccctctacccacaaaatctgccagggtctctgtcatggcctctatgattaaagga tcccccatgcatctggatttggctgcctgtgactccaagactgcggaccccaggatggat actacatcaacctcgctaacgcctgaggacactgaagacatgcccgtggggcaggattcg gaaatctgcctgctgaagtctggagaactgatgatcaaagtacccctcacagtggatgag atcgccagcttcggggagggtagcagggagctgtttgtgaggtccagcacctacagcctg atccccatcactgtggccgaggcaggcctcaccatcagctgggtcttctcctctgacccc aagagcatctccttcagtgtggtcttccaggaggccgaggacacaccgctggatcagtgt aaggtcctcattcccacgacccgatgcaactcccacaaggagaacatccagggccagctc aaggttcgcacacccggcatctacatgctcatcttcgacaataccttctcaaggtttgtc tctaaaaaggtattttatcacttgacggttgatcggcctgtgatctacgatggaagtgat ttcctgtag