GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:17:06 Sequence gi568815575r:47347433_47567481 : 220049 bp : 44.70% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2916 2926 11 2 2 84 87 17 0.048 1.07 1.02 Intr + 18943 19069 127 1 1 56 70 124 0.324 8.08 1.03 Term + 33353 33850 498 1 0 109 42 1125 0.999 104.52 1.04 PlyA + 33954 33959 6 1.05 2.00 Prom + 49474 49513 40 -4.86 2.01 Init + 50366 50497 132 1 0 56 97 61 0.589 3.95 2.02 Intr + 62844 62970 127 2 1 26 68 175 0.808 9.55 2.03 Intr + 63248 63343 96 0 0 91 111 114 0.991 13.98 2.04 Term + 64937 66162 1226 0 2 46 49 833 0.985 66.99 2.05 PlyA + 70887 70892 6 1.05 3.03 PlyA - 71729 71724 6 1.05 3.02 Term - 91668 91339 330 0 0 -37 48 347 0.948 12.86 3.01 Init - 96119 96117 3 2 0 98 53 0 0.168 -2.50 3.00 Prom - 96256 96217 40 -4.76 4.07 PlyA - 96568 96563 6 -0.45 4.06 Term - 102042 99998 2045 1 2 133 54 1400 0.829 126.47 4.05 Intr - 108686 108489 198 0 0 81 109 72 0.895 7.92 4.04 Intr - 108966 108840 127 0 1 98 94 163 0.884 18.15 4.03 Intr - 123633 123576 58 2 1 62 92 49 0.005 1.59 4.02 Intr - 135871 135692 180 0 0 69 52 132 0.002 6.68 4.01 Init - 149307 149207 101 0 2 60 27 102 0.119 1.03 4.00 Prom - 163997 163958 40 -6.16 5.02 PlyA - 164013 164008 6 1.05 5.01 Sngl - 166141 165170 972 1 0 99 40 897 0.996 80.75 5.00 Prom - 182364 182325 40 -5.66 6.00 Prom + 190757 190796 40 -3.56 6.01 Init + 209051 209080 30 1 0 94 93 38 0.671 3.02 6.02 Intr + 213291 213384 94 2 1 50 53 91 0.742 1.34 6.03 Intr + 213697 213987 291 2 0 3 23 250 0.181 7.11 6.04 Intr + 215531 215631 101 0 2 12 89 122 0.112 4.53 6.05 Intr + 215794 215897 104 0 2 47 84 170 0.289 11.47 6.06 Intr + 217365 217467 103 0 1 112 59 182 0.956 17.88 6.07 Intr + 217553 217707 155 1 2 64 53 135 0.977 6.47 6.08 Intr + 217820 217918 99 2 0 78 91 29 0.685 1.43 6.09 Intr + 219207 219348 142 0 1 74 77 168 0.845 14.66 6.10 Intr + 219452 219479 28 1 1 133 101 1 0.876 3.59 6.11 Intr + 219554 219699 146 0 2 98 56 103 0.997 8.10 6.12 Intr + 219798 220000 203 2 2 61 94 306 0.954 26.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:47347433_47567481|GENSCAN_predicted_peptide_1|211_aa MIEFPLYGMGAKMAAPKRHDIAQPYSSPDNATHGPSLRRSVTALCSEEQEKQEEEDEQER KEEEKEEEEEQEEEKEKEEQEEEKEKEEQEEEKEKEEEQEEQEKEKEEQEEEEKEKEEEE EEEQEEEEQEEEKEKEEEQEEEEQEEEEKEKEEEQKEKEQEEEEQEEEEKEKEEEQKEKE QEEEKQQEEEQQEEEEQEEEQGAIIILERQF >gi568815575r:47347433_47567481|GENSCAN_predicted_CDS_1|636_bp atgattgaattcccactgtacggaatgggtgctaagatggcggctcccaagagacacgac atagcgcagccatattcctcgcctgacaacgccactcatggaccaagtctccgccgatcc gtaacagcgctgtgttcagaggaacaggagaagcaggaggaggaggatgagcaggagaga aaggaggaggagaaagaggaggaagaggagcaggaggaggagaaagagaaggaggagcag gaggaggagaaagagaaggaggagcaggaggaggagaaagagaaggaggaggaacaggag gagcaggagaaagaaaaggaggagcaggaggaggaggagaaagagaaggaggaggaggag gaagaggagcaggaggaggaggagcaggaggaggagaaagagaaggaggaagagcaggag gaggaggagcaggaggaggaggagaaagagaaggaggaagagcagaaggagaaagagcag gaggaggaggagcaggaggaggaggagaaagagaaggaggaagagcagaaggagaaagag caggaggaggagaagcagcaggaggaggagcagcaggaggaagaggagcaggaggaggag cagggagcaattattattttagagagacagttttaa >gi568815575r:47347433_47567481|GENSCAN_predicted_peptide_2|526_aa MRSLKWPCGSLSFQFNEITVVSLGGSFPPFGTETPTPVEHNIMKGSVSFEDVAVDFTRQE WHRLDPAQRTMHKDVMLETYSNLASVGLCVAKPEMIFKLERGEELWILEEESSGHGYSGS LSLLCGNGSVGDNALRHDNDLLHHQKIQTLDQNVEYNGCRKAFHEKTGFVRRKRTPRGDK NFECHECGKAYCRKSNLVEHLRIHTGERPYECGECAKTFSARSYLIAHQKTHTGERPFEC NECGKSFGRKSQLILHTRTHTGERPYECTECGKTFSEKATLTIHQRTHTGEKPYECSECG KTFRVKISLTQHHRTHTGEKPYECGECGKNFRAKKSLNQHQRIHTGEKPYECGECGKFFR MKMTLNNHQRTHTGEKPYQCNECGKSFRVHSSLGIHQRIHTGEKPYECNECGNAFYVKAR LIEHQRMHSGEKPYECSECGKIFSMKKSLCQHRRTHTGEKPYECSECGNAFYVKVRLIEH QRIHTGERPFECQECGKAFCRKAHLTEHQRTHIGWSWRCTMKKASH >gi568815575r:47347433_47567481|GENSCAN_predicted_CDS_2|1581_bp atgagaagccttaagtggccgtgtggcagtcttagcttccagttcaatgaaatcactgtt gtatctcttggtggaagctttcctccttttggaacagagacccctacaccagtggagcat aatatcatgaaggggtccgtgtcattcgaggatgtggctgtggatttcacccgacaggag tggcacagactggaccctgctcagaggaccatgcacaaggatgtgatgctggagacctac agcaacctggcatctgtgggcctctgcgtggccaaaccagagatgatcttcaagttggag cgaggagaagagctgtggatattagaggaggaatcctcaggccatggttactcaggatct ctctcactgctgtgtggcaatggttctgttggggataatgccctcaggcatgataatgac cttcttcaccatcagaagattcaaacattggatcaaaatgttgaatataatggatgcagg aaagccttccatgagaaaacaggctttgttagacgtaaaagaacacccagaggagataaa aactttgaatgtcatgaatgtgggaaagcttactgtaggaagtcaaaccttgttgaacat ctgagaatacacacaggagagagaccctatgaatgcggtgaatgtgcaaaaaccttcagt gcaagatcatacctcattgctcatcagaaaactcacacaggggagaggccctttgaatgt aatgaatgtgggaaatcttttggcaggaagtcacaactcatcctacatacaagaacacac actggagagagaccctatgaatgtactgaatgtgggaaaaccttttctgagaaggcaacc ctcacgattcatcagagaactcacacaggggagaaaccctatgaatgtagtgaatgtggg aaaacatttcgtgtaaagatatcccttacccaacaccacagaactcatacaggggagaaa ccttatgaatgtggggagtgtgggaaaaacttccgtgcaaagaaatccctaaatcagcat caaagaattcacacaggtgagaaaccctatgagtgtggtgaatgtgggaaattcttccga atgaagatgactctcaataatcatcaaagaactcacacaggtgaaaagccctatcagtgt aatgaatgtgggaaatctttcagggtgcactcatctcttgggatccatcagagaattcac acaggagagaaaccttacgaatgtaatgagtgtggtaatgctttctatgtgaaagcacgc ctaattgaacatcagaggatgcattcaggagagaaaccctacgaatgtagtgaatgtggg aaaatcttcagtatgaagaaatccctttgtcaacaccggagaactcacacaggagagaaa ccttatgaatgtagtgaatgtggaaatgccttctatgtgaaagtacgcctcattgaacat cagcgaattcacacaggagagagaccctttgagtgtcaagaatgtgggaaagctttctgc cggaaagcacacctcacagaacatcagagaactcacataggctggtcctggcgttgtaca atgaagaaagcctctcactga >gi568815575r:47347433_47567481|GENSCAN_predicted_peptide_3|110_aa MHLEAVEESDEEEDAKSLSLSGKQSAPGSSSKYPHKLKLATDEDEYDEEDDEDGEHNEET EEKAPVEKSIRGTPAKNTQKSNEMKMTHYHQHQDQKSRILSKTGKNLLKH >gi568815575r:47347433_47567481|GENSCAN_predicted_CDS_3|333_bp atgcacttagaagctgtggaggagtcagatgaagaggaggatgccaaatccctaagtcta tctggaaagcaatctgcccctggaagcagcagcaagtatccacataaactaaaacttgct actgatgaagatgaatatgatgaggaagatgatgaagatggtgaacataatgaggaaact gaagaaaaagccccagtggagaaatctatacgaggcactccagccaaaaatacacaaaaa tcaaacgagatgaaaatgactcactaccatcaacaccaagatcagaagtcaagaatcctt tcaaaaacaggaaaaaacctcctaaaacattga >gi568815575r:47347433_47567481|GENSCAN_predicted_peptide_4|902_aa MDVESTAGEDSEEVKKMVEKVYIILEKTCIIISISTVPGAESPQPSAPTEVKLSLLATSA SWPKSDTHDSPRGNCTGSVTAETPSVSRSRPGSRWTVDLNVKAKMIKLLHDPEASVSFED VTVDFSKEEWQHLDPAQRRLYWDVTLENYSHLLSVVKCVYFSEKQRESVTERPLKPNQLD PIVYHVLLTGYQIPKSEAAFKLEQGEGPWMLEGEAPHQSCSGEAIGKMQQQGIPGGIFFH CERFDQPIGEDSLCSILEELWQDNDQLEQRQENQNNLLSHVKVLIKERGYEHKNIEKIIH VTTKLVPSIKRLHNCDTILKHTLNSHNHNRNSATKNLGKIFGNGNNFPHSPSSTKNENAK TGANSCEHDHYEKHLSHKQAPTHHQKIHPEEKLYVCTECVMGFTQKSHLFEHQRIHAGEK SRECDKSNKVFPQKPQVDVHPSVYTGEKPYLCTQCGKVFTLKSNLITHQKIHTGQKPYKC SECGKAFFQRSDLFRHLRIHTGEKPYECSECGKGFSQNSDLSIHQKTHTGEKHYECNECG KAFTRKSALRMHQRIHTGEKPYVCADCGKAFIQKSHFNTHQRIHTGEKPYECSDCGKSFT KKSQLHVHQRIHTGEKPYICTECGKVFTHRTNLTTHQKTHTGEKPYMCAECGKAFTDQSN LIKHQKTHTGEKPYKCNGCGKAFIWKSRLKIHQKSHIGERHYECKDCGKAFIQKSTLSVH QRIHTGEKPYVCPECGKAFIQKSHFIAHHRIHTGEKPYECSDCGKCFTKKSQLRVHQKIH TGEKPNICAECGKAFTDRSNLITHQKIHTREKPYECGDCGKTFTWKSRLNIHQKSHTGER HYECSKCGKAFIQKATLSMHQIIHTGKKPYACTECQKAFTDRSNLIKHQKMHSGEKRYKA SD >gi568815575r:47347433_47567481|GENSCAN_predicted_CDS_4|2709_bp atggatgttgaaagcactgctggtgaggactcagaagaagtaaagaagatggtagagaaa gtctatatcatcttagagaagacatgtatcatcataagcatatccacggtcccgggcgct gaatccccgcaaccctctgcgcccacagaggttaaactctcgctgctggcgacttccgct tcctggcctaaatctgacacgcacgactccccccgcggcaactgcacaggttcggtgaca gccgagacgccgagtgtttcccgcagccggccggggtccagatggactgtggacctaaat gtgaaagctaaaatgataaaacttttacatgaccctgaggcttcagtgtcatttgaggac gtgactgtggacttcagcaaggaggagtggcagcacttggaccctgcccagagacgcctg tactgggatgtgacactagagaactacagccacctgctctcagtggtgaaatgtgtttac ttttctgaaaagcaaagggaaagtgtcactgaaaggcccctgaaacccaatcagctggac ccaatcgtgtatcatgtcctgttaacagggtaccaaattcccaagtcagaggctgccttc aagttggagcaaggagaggggccatggatgctggagggggaagccccacatcagagctgt tcaggtgaggctattgggaaaatgcagcaacagggaattcctggaggaattttcttccac tgtgagagatttgatcaacccataggagaagattcattatgttctattttagaagaactg tggcaagataatgaccagctagagcaacgtcaggaaaaccagaataaccttttaagtcat gtgaaagtattgattaaggagaggggctatgaacataaaaacattgaaaaaataattcat gtgactaccaagcttgttccttcaattaaaagactccataactgtgacacaattttgaag catactttaaactcacataatcataatagaaacagtgcaacaaagaaccttggcaagatt tttggaaatggtaacaatttcccccatagcccttcctctactaagaatgagaatgctaaa acaggagcaaattcctgtgaacatgaccactatgaaaaacatctcagccacaaacaagct cccacccaccatcagaaaattcatcctgaggagaagctttatgtgtgtactgaatgtgta atgggcttcactcagaagtcacatctgtttgagcatcagagaattcatgctggagaaaag tcccgtgaatgtgacaaaagcaacaaagtcttcccccagaaaccccaggttgatgtacat ccaagtgtttatacaggagaaaaaccctatctgtgtactcaatgtgggaaagtctttacc ctcaaatcaaacctcattacacatcaaaaaattcataccgggcagaaaccctacaaatgc agtgaatgtggaaaagcctttttccagagatcagacctctttagacatctgagaattcat acaggagaaaaaccttatgaatgcagtgaatgtggaaaaggcttctcccagaactcagac ctcagtatacatcagaaaactcataccggagagaaacactatgaatgcaatgaatgtggg aaggctttcacaagaaaatcagcactcaggatgcatcagagaatccacacgggagagaaa ccttatgtatgcgctgactgtgggaaggccttcatccagaaatcacatttcaacacacat cagagaattcatactggagaaaagccgtatgaatgcagtgactgtgggaaatccttcact aagaagtcacaactccatgtgcatcaaagaattcacaccggagagaaaccctatatatgt acagaatgtggaaaggtcttcactcacaggacaaacctcaccacacatcagaaaactcat actggggaaaaaccctatatgtgtgctgaatgtggaaaggcttttactgaccagtcaaat ctcattaaacaccagaaaactcacactggagagaaaccctataagtgcaatggctgtgga aaagccttcatatggaagtcgcgcctcaaaatacatcagaaatctcatattggagagaga cactatgaatgcaaggactgcgggaaagccttcatccagaaatcaacactaagcgtgcat cagagaatccatacaggagagaaaccgtacgtttgtcctgaatgcgggaaggcctttatc cagaaatcgcacttcattgcgcatcatagaatccatactggagagaagccttatgaatgc agcgactgtgggaaatgcttcactaagaagtcacaactccgtgtgcatcagaaaatccac acaggtgagaagcccaatatatgtgctgaatgtggaaaggccttcactgaccgatcaaat ctcataacacatcagaaaatccacactagggagaaaccctatgaatgtggtgactgcggg aaaaccttcacctggaagtcacgcctcaatatacatcagaagtctcatactggagaaaga cactatgaatgtagtaaatgtgggaaagctttcatccagaaagccacactaagtatgcat cagataattcatacaggaaagaaaccttatgcttgtacagaatgtcagaaggcctttact gacagatcgaatctcattaaacaccagaaaatgcatagtggagaaaaacgctataaagcc agtgactga >gi568815575r:47347433_47567481|GENSCAN_predicted_peptide_5|323_aa MARAAAGADKKPSRCGSGREGEGLSEGHKSMTGLYELVWRVLHALLCLHRTLTSWLRVRF GTWNWIWRRCCRAASAAVLAPLGFTLRKPPAVGRNRRHHRHPRGGSCLAAAHHRMRWRAD GRSLEKLPVHMGLVITEVEQEPSFSDIASLVVWCMAVGISYISVYDHQGIFKRNNSRLMD EILKQQQELLGLDCSKYSPEFANSNDKDDQVLNCHLAVKVLSPEDGKADIVRAAQDFCQL VAQKQKRPTDLDVDTLGSLLSSNGCPDPDLVLKFGPVDSTLGFLPWHIRLTEIVSLPSHL NISYEDFFSALRQYAACEQRLGK >gi568815575r:47347433_47567481|GENSCAN_predicted_CDS_5|972_bp atggcccgcgcggccgcaggggcggataaaaagccgtcgcgctgcgggagtgggcgggag ggagaggggttgtccgagggccacaagagtatgacggggctgtacgagctggtgtggcgg gtgctgcacgcgctgctctgtctgcaccgcacgctcacctcctggctccgcgttcggttc ggcacctggaactggatctggcggcgctgctgccgagccgcctctgccgcggtcctagcg ccgctcggcttcacgctccgcaagcccccggcagtcggcaggaaccgccgtcaccaccgg cacccgcgcggggggtcgtgcctggcagccgcacaccaccggatgcgctggcgcgcggac ggtcgttccttggagaagctgcctgtgcatatgggcctggtgatcaccgaggtggagcag gaacccagcttctcggacatcgcgagcctcgtggtgtggtgtatggccgtgggcatctcc tacattagcgtctacgaccaccaaggtattttcaaaagaaataattccagattgatggat gaaattttaaaacaacagcaagaacttctgggcctagattgttcaaaatactcaccagaa tttgcaaatagtaatgacaaagacgatcaagttttaaattgccatttggcagtgaaggtg ctgtctccggaagatggaaaagcagatattgtaagagctgctcaggacttttgccagtta gtggcccagaagcaaaagagacccacagatttggatgtagatacgttaggcagtttactt agttcaaatggttgtcctgatcctgatttagtattgaagttcggtcctgtggacagcaca ttaggctttcttccctggcacatcagattgactgagattgtctctttgccttcccaccta aacatcagttatgaggactttttctctgcccttcgtcaatatgcagcctgtgaacagcgt ctgggaaagtag >gi568815575r:47347433_47567481|GENSCAN_predicted_peptide_6|499_aa MALSRPQQGPRPAVLKGYELQSPRAGEHRRSLVLIPTHFKDGWKAESRRANNESPRGDGG GLCEETRREAQDGDGGGCSGVTGEGGPGRARFLERLPGTGREPGPKAQLQDGCAWAPAFP ARNRRSGLTRGETIVDSGGSMEPPRGPPANGAEPSRAVGTVKVYLPNKQRTVVTVRDGMS VYDSLDKALKVRGLNQDCCVVYRLIKGRKTVTAWDTAIAPLDGEELIVEVLEDVPLTMHN FVRKTFFSLAFCDFCLKFLFHGFRCQTCGYKFHQHCSSKVPTVCVDMSTNRQQFYHSVQD LSGGSRQHEAPSNRPLNELLTPQGPSPRTQHCDPEHFPFPAPANAPLQRIRSTSTPNVHM VSTTAPMDSNLIQLTGQSFSTDAAGSRGGSDGTPRGSPSPASVSSGRKSPHSKSPAEQRE RKSLADDKKKVKNLGYRDSGYYWEVPPSEVQLLKRIGTGSFGTVFRGRWHGDVAVKVLKV SQPTAEQAQAFKNEMQVLS >gi568815575r:47347433_47567481|GENSCAN_predicted_CDS_6|1497_bp atggcgctctccaggccgcagcaggggccgagacccgctgtcctcaaaggctatgaacta cagtctcctagagctggtgaacatcgccgatcattggttctgattccaacccatttcaaa gatgggtggaaggctgagtcccgcagagccaataacgagagtccgagaggcgacggaggc ggactctgtgaggaaacaagaagagaggcccaagatggagacggcggcggctgtagcggc gtgacaggtgagggcgggcccgggagggctcggtttctggagcggctgccgggcacgggc agggagcccggaccgaaagctcagctccaggatggctgcgcctgggccccggcgttccct gcccggaaccggaggagtggtttgacccggggcgagaccatcgtcgacagcgggggctcc atggagccaccacggggcccccctgccaatggggccgagccatcccgggcagtgggcacc gtcaaagtatacctgcccaacaagcaacgcacggtggtgactgtccgggatggcatgagt gtctacgactctctagacaaggccctgaaggtgcggggtctaaatcaggactgctgtgtg gtctaccgactcatcaagggacgaaagacggtcactgcctgggacacagccattgctccc ctggatggcgaggagctcattgtcgaggtccttgaagatgtcccgctgaccatgcacaat tttgtacggaagaccttcttcagcctggcgttctgtgacttctgccttaagtttctgttc catggcttccgttgccaaacctgtggctacaagttccaccagcattgttcctccaaggtc cccacagtctgtgttgacatgagtaccaaccgccaacagttctaccacagtgtccaggat ttgtccggaggctccagacagcatgaggctccctcgaaccgccccctgaatgagttgcta accccccagggtcccagcccccgcacccagcactgtgacccggagcacttccccttccct gccccagccaatgcccccctacagcgcatccgctccacgtccactcccaacgtccatatg gtcagcaccacggcccccatggactccaacctcatccagctcactggccagagtttcagc actgatgctgccggtagtagaggaggtagtgatggaaccccccgggggagccccagccca gccagcgtgtcctcggggaggaagtccccacattccaagtcaccagcagagcagcgcgag cggaagtccttggccgatgacaagaagaaagtgaagaacctggggtaccgggactcaggc tattactgggaggtaccacccagtgaggtgcagctgctgaagaggatcgggacgggctcg tttggcaccgtgtttcgagggcggtggcatggcgatgtggccgtgaaggtgctcaaggtg tcccagcccacagctgagcaggcccaggctttcaagaatgagatgcaggtgctcagn