GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:32:01 Sequence gi568815575f:47270669_47513591 : 242923 bp : 44.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1470 1465 6 -0.45 1.02 Term - 3147 2698 450 0 0 37 48 239 0.862 9.99 1.01 Init - 3788 3672 117 2 0 61 75 114 0.633 5.60 1.00 Prom - 7594 7555 40 -5.06 2.00 Prom + 9956 9995 40 -5.86 2.01 Init + 19432 19486 55 0 1 78 106 21 0.575 4.35 2.02 Term + 30222 30400 179 1 2 -47 40 249 0.324 4.45 2.03 PlyA + 35146 35151 6 1.05 3.04 PlyA - 36054 36049 6 1.05 3.03 Term - 42425 42409 17 0 2 134 42 27 0.713 1.10 3.02 Intr - 42659 42566 94 2 1 85 80 24 0.727 0.84 3.01 Init - 50861 50709 153 2 0 109 55 153 0.981 14.08 3.00 Prom - 62376 62337 40 -2.36 4.00 Prom + 75071 75110 40 -5.36 4.01 Init + 79680 79690 11 2 2 84 87 17 0.057 1.07 4.02 Intr + 95707 95833 127 1 1 56 70 124 0.295 8.08 4.03 Term + 110117 110614 498 1 0 109 42 1125 0.999 104.52 4.04 PlyA + 110718 110723 6 1.05 5.00 Prom + 126238 126277 40 -4.86 5.01 Init + 127130 127261 132 1 0 56 97 61 0.589 3.95 5.02 Intr + 139608 139734 127 2 1 26 68 175 0.808 9.55 5.03 Intr + 140012 140107 96 0 0 91 111 114 0.991 13.98 5.04 Term + 141701 142926 1226 0 2 46 49 833 0.985 66.99 5.05 PlyA + 147651 147656 6 1.05 6.03 PlyA - 148493 148488 6 1.05 6.02 Term - 168432 168103 330 0 0 -37 48 347 0.948 12.86 6.01 Init - 172883 172881 3 2 0 98 53 0 0.168 -2.50 6.00 Prom - 173020 172981 40 -4.76 7.07 PlyA - 173332 173327 6 -0.45 7.06 Term - 178806 176762 2045 1 2 133 54 1400 0.829 126.47 7.05 Intr - 185450 185253 198 0 0 81 109 72 0.895 7.92 7.04 Intr - 185730 185604 127 0 1 98 94 163 0.884 18.15 7.03 Intr - 200397 200340 58 2 1 62 92 49 0.005 1.59 7.02 Intr - 212635 212456 180 0 0 69 52 132 0.002 6.68 7.01 Init - 226071 225971 101 0 2 60 27 102 0.119 1.03 7.00 Prom - 240761 240722 40 -6.16 8.02 PlyA - 240777 240772 6 1.05 8.01 Sngl - 242905 241934 972 1 0 99 40 897 0.996 80.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:47270669_47513591|GENSCAN_predicted_peptide_1|188_aa MAGRAGGRGLEQLLHLIAAASTMSRVLVPCHVKGTVALQHTSAHTPAKWVTCLWDYCLMP NPHSEEGAQEYVSLFKQQILCDMARISELHLILQQPSPLWLSFTVEELQIYQQGPKSPSM IFPKWLSHPVPCEQPALLHEGLPDPSRVSSEVQQMWALTEMIRASHTSARIGHFDVDGCY DLNLLSYT >gi568815575f:47270669_47513591|GENSCAN_predicted_CDS_1|567_bp atggcaggaagggctggaggtcggggcctggagcagctattgcacttaatcgcggctgct agcaccatgtcccgcgttttggtgccttgccatgtgaaaggcaccgtagccctgcagcac acctcagcacacacaccggccaagtgggtgacctgcctgtgggactactgtctgatgccc aacccacacagtgaggagggagcccaggagtatgtgtcgctgttcaagcaacagatactg tgtgacatggccagaatatcggagctacacctgattctgcagcagccatcaccactgtgg ctgtctttcacagtggaggagctgcagatctatcagcagggaccaaagagcccctccatg atcttccccaagtggctctcccacccagtgccctgtgagcaacctgcactcctccatgag ggtctcccagaccccagcagggtatcctctgaggtgcagcagatgtgggcactgacagag atgatccgggccagtcacacctccgcgaggataggccactttgatgtagatggctgttat gacctgaacttactctcctacacttga >gi568815575f:47270669_47513591|GENSCAN_predicted_peptide_2|77_aa MAKFAVYNQKYPTNPRKEKGNSFEEEEGEEEEEEEEEERGEGEEEEEEEEEKGEGEEEET IWLILGQSHIASTYLHD >gi568815575f:47270669_47513591|GENSCAN_predicted_CDS_2|234_bp atggcaaaatttgctgtctataatcagaaatatcccactaatccaaggaaagaaaagggg aactccttcgaagaagaagaaggagaagaagaagaagaagaagaagaagaagaaagagga gaaggagaggaggaggaggaggaggaggaggaaaaaggggaaggggaagaagaagaaaca atatggttaattttaggccagagtcacatagctagcacatatctccatgactaa >gi568815575f:47270669_47513591|GENSCAN_predicted_peptide_3|87_aa MAAPEEGYSVGAAILFEVKAKDAEGRVAPLSGGLCSLKSPPTPLVYNRKSPLNLKKCDKP TLLCCCLTTAWPLDKLASNVKPRHCYM >gi568815575f:47270669_47513591|GENSCAN_predicted_CDS_3|264_bp atggcggcgcccgaggaaggctacagtgtgggcgccgccattttgtttgaggttaaggca aaggatgctgaaggacgagtagcaccgctgtcgggagggctatgttcgttgaaatcgccc cctactcccctggtctacaaccgcaaatccccactgaatctaaagaaatgtgacaagcca actttactgtgctgctgcttaacaactgcttggccgcttgacaaattggcatccaacgtg aagccaaggcactgctacatgtaa >gi568815575f:47270669_47513591|GENSCAN_predicted_peptide_4|211_aa MIEFPLYGMGAKMAAPKRHDIAQPYSSPDNATHGPSLRRSVTALCSEEQEKQEEEDEQER KEEEKEEEEEQEEEKEKEEQEEEKEKEEQEEEKEKEEEQEEQEKEKEEQEEEEKEKEEEE EEEQEEEEQEEEKEKEEEQEEEEQEEEEKEKEEEQKEKEQEEEEQEEEEKEKEEEQKEKE QEEEKQQEEEQQEEEEQEEEQGAIIILERQF >gi568815575f:47270669_47513591|GENSCAN_predicted_CDS_4|636_bp atgattgaattcccactgtacggaatgggtgctaagatggcggctcccaagagacacgac atagcgcagccatattcctcgcctgacaacgccactcatggaccaagtctccgccgatcc gtaacagcgctgtgttcagaggaacaggagaagcaggaggaggaggatgagcaggagaga aaggaggaggagaaagaggaggaagaggagcaggaggaggagaaagagaaggaggagcag gaggaggagaaagagaaggaggagcaggaggaggagaaagagaaggaggaggaacaggag gagcaggagaaagaaaaggaggagcaggaggaggaggagaaagagaaggaggaggaggag gaagaggagcaggaggaggaggagcaggaggaggagaaagagaaggaggaagagcaggag gaggaggagcaggaggaggaggagaaagagaaggaggaagagcagaaggagaaagagcag gaggaggaggagcaggaggaggaggagaaagagaaggaggaagagcagaaggagaaagag caggaggaggagaagcagcaggaggaggagcagcaggaggaagaggagcaggaggaggag cagggagcaattattattttagagagacagttttaa >gi568815575f:47270669_47513591|GENSCAN_predicted_peptide_5|526_aa MRSLKWPCGSLSFQFNEITVVSLGGSFPPFGTETPTPVEHNIMKGSVSFEDVAVDFTRQE WHRLDPAQRTMHKDVMLETYSNLASVGLCVAKPEMIFKLERGEELWILEEESSGHGYSGS LSLLCGNGSVGDNALRHDNDLLHHQKIQTLDQNVEYNGCRKAFHEKTGFVRRKRTPRGDK NFECHECGKAYCRKSNLVEHLRIHTGERPYECGECAKTFSARSYLIAHQKTHTGERPFEC NECGKSFGRKSQLILHTRTHTGERPYECTECGKTFSEKATLTIHQRTHTGEKPYECSECG KTFRVKISLTQHHRTHTGEKPYECGECGKNFRAKKSLNQHQRIHTGEKPYECGECGKFFR MKMTLNNHQRTHTGEKPYQCNECGKSFRVHSSLGIHQRIHTGEKPYECNECGNAFYVKAR LIEHQRMHSGEKPYECSECGKIFSMKKSLCQHRRTHTGEKPYECSECGNAFYVKVRLIEH QRIHTGERPFECQECGKAFCRKAHLTEHQRTHIGWSWRCTMKKASH >gi568815575f:47270669_47513591|GENSCAN_predicted_CDS_5|1581_bp atgagaagccttaagtggccgtgtggcagtcttagcttccagttcaatgaaatcactgtt gtatctcttggtggaagctttcctccttttggaacagagacccctacaccagtggagcat aatatcatgaaggggtccgtgtcattcgaggatgtggctgtggatttcacccgacaggag tggcacagactggaccctgctcagaggaccatgcacaaggatgtgatgctggagacctac agcaacctggcatctgtgggcctctgcgtggccaaaccagagatgatcttcaagttggag cgaggagaagagctgtggatattagaggaggaatcctcaggccatggttactcaggatct ctctcactgctgtgtggcaatggttctgttggggataatgccctcaggcatgataatgac cttcttcaccatcagaagattcaaacattggatcaaaatgttgaatataatggatgcagg aaagccttccatgagaaaacaggctttgttagacgtaaaagaacacccagaggagataaa aactttgaatgtcatgaatgtgggaaagcttactgtaggaagtcaaaccttgttgaacat ctgagaatacacacaggagagagaccctatgaatgcggtgaatgtgcaaaaaccttcagt gcaagatcatacctcattgctcatcagaaaactcacacaggggagaggccctttgaatgt aatgaatgtgggaaatcttttggcaggaagtcacaactcatcctacatacaagaacacac actggagagagaccctatgaatgtactgaatgtgggaaaaccttttctgagaaggcaacc ctcacgattcatcagagaactcacacaggggagaaaccctatgaatgtagtgaatgtggg aaaacatttcgtgtaaagatatcccttacccaacaccacagaactcatacaggggagaaa ccttatgaatgtggggagtgtgggaaaaacttccgtgcaaagaaatccctaaatcagcat caaagaattcacacaggtgagaaaccctatgagtgtggtgaatgtgggaaattcttccga atgaagatgactctcaataatcatcaaagaactcacacaggtgaaaagccctatcagtgt aatgaatgtgggaaatctttcagggtgcactcatctcttgggatccatcagagaattcac acaggagagaaaccttacgaatgtaatgagtgtggtaatgctttctatgtgaaagcacgc ctaattgaacatcagaggatgcattcaggagagaaaccctacgaatgtagtgaatgtggg aaaatcttcagtatgaagaaatccctttgtcaacaccggagaactcacacaggagagaaa ccttatgaatgtagtgaatgtggaaatgccttctatgtgaaagtacgcctcattgaacat cagcgaattcacacaggagagagaccctttgagtgtcaagaatgtgggaaagctttctgc cggaaagcacacctcacagaacatcagagaactcacataggctggtcctggcgttgtaca atgaagaaagcctctcactga >gi568815575f:47270669_47513591|GENSCAN_predicted_peptide_6|110_aa MHLEAVEESDEEEDAKSLSLSGKQSAPGSSSKYPHKLKLATDEDEYDEEDDEDGEHNEET EEKAPVEKSIRGTPAKNTQKSNEMKMTHYHQHQDQKSRILSKTGKNLLKH >gi568815575f:47270669_47513591|GENSCAN_predicted_CDS_6|333_bp atgcacttagaagctgtggaggagtcagatgaagaggaggatgccaaatccctaagtcta tctggaaagcaatctgcccctggaagcagcagcaagtatccacataaactaaaacttgct actgatgaagatgaatatgatgaggaagatgatgaagatggtgaacataatgaggaaact gaagaaaaagccccagtggagaaatctatacgaggcactccagccaaaaatacacaaaaa tcaaacgagatgaaaatgactcactaccatcaacaccaagatcagaagtcaagaatcctt tcaaaaacaggaaaaaacctcctaaaacattga >gi568815575f:47270669_47513591|GENSCAN_predicted_peptide_7|902_aa MDVESTAGEDSEEVKKMVEKVYIILEKTCIIISISTVPGAESPQPSAPTEVKLSLLATSA SWPKSDTHDSPRGNCTGSVTAETPSVSRSRPGSRWTVDLNVKAKMIKLLHDPEASVSFED VTVDFSKEEWQHLDPAQRRLYWDVTLENYSHLLSVVKCVYFSEKQRESVTERPLKPNQLD PIVYHVLLTGYQIPKSEAAFKLEQGEGPWMLEGEAPHQSCSGEAIGKMQQQGIPGGIFFH CERFDQPIGEDSLCSILEELWQDNDQLEQRQENQNNLLSHVKVLIKERGYEHKNIEKIIH VTTKLVPSIKRLHNCDTILKHTLNSHNHNRNSATKNLGKIFGNGNNFPHSPSSTKNENAK TGANSCEHDHYEKHLSHKQAPTHHQKIHPEEKLYVCTECVMGFTQKSHLFEHQRIHAGEK SRECDKSNKVFPQKPQVDVHPSVYTGEKPYLCTQCGKVFTLKSNLITHQKIHTGQKPYKC SECGKAFFQRSDLFRHLRIHTGEKPYECSECGKGFSQNSDLSIHQKTHTGEKHYECNECG KAFTRKSALRMHQRIHTGEKPYVCADCGKAFIQKSHFNTHQRIHTGEKPYECSDCGKSFT KKSQLHVHQRIHTGEKPYICTECGKVFTHRTNLTTHQKTHTGEKPYMCAECGKAFTDQSN LIKHQKTHTGEKPYKCNGCGKAFIWKSRLKIHQKSHIGERHYECKDCGKAFIQKSTLSVH QRIHTGEKPYVCPECGKAFIQKSHFIAHHRIHTGEKPYECSDCGKCFTKKSQLRVHQKIH TGEKPNICAECGKAFTDRSNLITHQKIHTREKPYECGDCGKTFTWKSRLNIHQKSHTGER HYECSKCGKAFIQKATLSMHQIIHTGKKPYACTECQKAFTDRSNLIKHQKMHSGEKRYKA SD >gi568815575f:47270669_47513591|GENSCAN_predicted_CDS_7|2709_bp atggatgttgaaagcactgctggtgaggactcagaagaagtaaagaagatggtagagaaa gtctatatcatcttagagaagacatgtatcatcataagcatatccacggtcccgggcgct gaatccccgcaaccctctgcgcccacagaggttaaactctcgctgctggcgacttccgct tcctggcctaaatctgacacgcacgactccccccgcggcaactgcacaggttcggtgaca gccgagacgccgagtgtttcccgcagccggccggggtccagatggactgtggacctaaat gtgaaagctaaaatgataaaacttttacatgaccctgaggcttcagtgtcatttgaggac gtgactgtggacttcagcaaggaggagtggcagcacttggaccctgcccagagacgcctg tactgggatgtgacactagagaactacagccacctgctctcagtggtgaaatgtgtttac ttttctgaaaagcaaagggaaagtgtcactgaaaggcccctgaaacccaatcagctggac ccaatcgtgtatcatgtcctgttaacagggtaccaaattcccaagtcagaggctgccttc aagttggagcaaggagaggggccatggatgctggagggggaagccccacatcagagctgt tcaggtgaggctattgggaaaatgcagcaacagggaattcctggaggaattttcttccac tgtgagagatttgatcaacccataggagaagattcattatgttctattttagaagaactg tggcaagataatgaccagctagagcaacgtcaggaaaaccagaataaccttttaagtcat gtgaaagtattgattaaggagaggggctatgaacataaaaacattgaaaaaataattcat gtgactaccaagcttgttccttcaattaaaagactccataactgtgacacaattttgaag catactttaaactcacataatcataatagaaacagtgcaacaaagaaccttggcaagatt tttggaaatggtaacaatttcccccatagcccttcctctactaagaatgagaatgctaaa acaggagcaaattcctgtgaacatgaccactatgaaaaacatctcagccacaaacaagct cccacccaccatcagaaaattcatcctgaggagaagctttatgtgtgtactgaatgtgta atgggcttcactcagaagtcacatctgtttgagcatcagagaattcatgctggagaaaag tcccgtgaatgtgacaaaagcaacaaagtcttcccccagaaaccccaggttgatgtacat ccaagtgtttatacaggagaaaaaccctatctgtgtactcaatgtgggaaagtctttacc ctcaaatcaaacctcattacacatcaaaaaattcataccgggcagaaaccctacaaatgc agtgaatgtggaaaagcctttttccagagatcagacctctttagacatctgagaattcat acaggagaaaaaccttatgaatgcagtgaatgtggaaaaggcttctcccagaactcagac ctcagtatacatcagaaaactcataccggagagaaacactatgaatgcaatgaatgtggg aaggctttcacaagaaaatcagcactcaggatgcatcagagaatccacacgggagagaaa ccttatgtatgcgctgactgtgggaaggccttcatccagaaatcacatttcaacacacat cagagaattcatactggagaaaagccgtatgaatgcagtgactgtgggaaatccttcact aagaagtcacaactccatgtgcatcaaagaattcacaccggagagaaaccctatatatgt acagaatgtggaaaggtcttcactcacaggacaaacctcaccacacatcagaaaactcat actggggaaaaaccctatatgtgtgctgaatgtggaaaggcttttactgaccagtcaaat ctcattaaacaccagaaaactcacactggagagaaaccctataagtgcaatggctgtgga aaagccttcatatggaagtcgcgcctcaaaatacatcagaaatctcatattggagagaga cactatgaatgcaaggactgcgggaaagccttcatccagaaatcaacactaagcgtgcat cagagaatccatacaggagagaaaccgtacgtttgtcctgaatgcgggaaggcctttatc cagaaatcgcacttcattgcgcatcatagaatccatactggagagaagccttatgaatgc agcgactgtgggaaatgcttcactaagaagtcacaactccgtgtgcatcagaaaatccac acaggtgagaagcccaatatatgtgctgaatgtggaaaggccttcactgaccgatcaaat ctcataacacatcagaaaatccacactagggagaaaccctatgaatgtggtgactgcggg aaaaccttcacctggaagtcacgcctcaatatacatcagaagtctcatactggagaaaga cactatgaatgtagtaaatgtgggaaagctttcatccagaaagccacactaagtatgcat cagataattcatacaggaaagaaaccttatgcttgtacagaatgtcagaaggcctttact gacagatcgaatctcattaaacaccagaaaatgcatagtggagaaaaacgctataaagcc agtgactga >gi568815575f:47270669_47513591|GENSCAN_predicted_peptide_8|323_aa MARAAAGADKKPSRCGSGREGEGLSEGHKSMTGLYELVWRVLHALLCLHRTLTSWLRVRF GTWNWIWRRCCRAASAAVLAPLGFTLRKPPAVGRNRRHHRHPRGGSCLAAAHHRMRWRAD GRSLEKLPVHMGLVITEVEQEPSFSDIASLVVWCMAVGISYISVYDHQGIFKRNNSRLMD EILKQQQELLGLDCSKYSPEFANSNDKDDQVLNCHLAVKVLSPEDGKADIVRAAQDFCQL VAQKQKRPTDLDVDTLGSLLSSNGCPDPDLVLKFGPVDSTLGFLPWHIRLTEIVSLPSHL NISYEDFFSALRQYAACEQRLGK >gi568815575f:47270669_47513591|GENSCAN_predicted_CDS_8|972_bp atggcccgcgcggccgcaggggcggataaaaagccgtcgcgctgcgggagtgggcgggag ggagaggggttgtccgagggccacaagagtatgacggggctgtacgagctggtgtggcgg gtgctgcacgcgctgctctgtctgcaccgcacgctcacctcctggctccgcgttcggttc ggcacctggaactggatctggcggcgctgctgccgagccgcctctgccgcggtcctagcg ccgctcggcttcacgctccgcaagcccccggcagtcggcaggaaccgccgtcaccaccgg cacccgcgcggggggtcgtgcctggcagccgcacaccaccggatgcgctggcgcgcggac ggtcgttccttggagaagctgcctgtgcatatgggcctggtgatcaccgaggtggagcag gaacccagcttctcggacatcgcgagcctcgtggtgtggtgtatggccgtgggcatctcc tacattagcgtctacgaccaccaaggtattttcaaaagaaataattccagattgatggat gaaattttaaaacaacagcaagaacttctgggcctagattgttcaaaatactcaccagaa tttgcaaatagtaatgacaaagacgatcaagttttaaattgccatttggcagtgaaggtg ctgtctccggaagatggaaaagcagatattgtaagagctgctcaggacttttgccagtta gtggcccagaagcaaaagagacccacagatttggatgtagatacgttaggcagtttactt agttcaaatggttgtcctgatcctgatttagtattgaagttcggtcctgtggacagcaca ttaggctttcttccctggcacatcagattgactgagattgtctctttgccttcccaccta aacatcagttatgaggactttttctctgcccttcgtcaatatgcagcctgtgaacagcgt ctgggaaagtag