GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:46:41 Sequence gi568815597f:28887438_29215794 : 328357 bp : 43.76% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6347 6400 54 1 0 87 92 44 0.757 6.05 1.02 Term + 20068 20139 72 0 0 39 48 84 0.062 -2.59 1.03 PlyA + 22038 22043 6 1.05 2.03 PlyA - 24737 24732 6 1.05 2.02 Term - 27552 27199 354 0 0 102 42 252 0.964 16.49 2.01 Init - 50791 50549 243 1 0 80 12 148 0.046 4.13 2.00 Prom - 71747 71708 40 -3.76 3.02 PlyA - 72748 72743 6 1.05 3.01 Sngl - 95173 94886 288 1 0 94 42 392 0.643 30.39 3.00 Prom - 96552 96513 40 -4.16 4.00 Prom + 97165 97204 40 -8.76 4.01 Init + 100001 100468 468 1 0 68 110 362 0.987 31.99 4.02 Intr + 105893 106105 213 1 0 -4 105 158 0.526 7.21 4.03 Intr + 109778 109882 105 1 0 79 100 16 0.770 2.31 4.04 Intr + 124428 124470 43 2 1 106 103 34 0.622 4.51 4.05 Intr + 130858 131005 148 2 1 57 68 170 0.198 11.19 4.06 Intr + 142963 143050 88 1 1 43 90 46 0.319 0.27 4.07 Intr + 145656 145808 153 2 0 107 77 53 0.986 6.37 4.08 Intr + 148389 148486 98 2 2 82 67 74 0.999 3.51 4.09 Intr + 151817 151989 173 2 2 131 68 139 0.983 15.79 4.10 Intr + 165667 165875 209 2 2 80 95 175 0.943 16.20 4.11 Intr + 177545 177721 177 1 0 76 64 152 0.798 11.72 4.12 Intr + 183336 183441 106 2 1 67 77 -3 0.279 -3.71 4.13 Intr + 187083 187168 86 1 2 59 86 77 0.129 4.14 4.14 Intr + 208718 209167 450 1 0 72 67 326 0.949 22.40 4.15 Intr + 210370 210498 129 0 0 84 91 160 0.997 16.79 4.16 Intr + 218577 218708 132 2 0 100 91 -22 0.520 0.14 4.17 Intr + 221899 222000 102 0 0 99 87 63 0.992 7.67 4.18 Intr + 224931 225011 81 2 0 103 75 70 0.991 7.03 4.19 Term + 228262 228360 99 0 0 117 49 219 0.988 19.03 4.20 PlyA + 229061 229066 6 -0.45 5.03 PlyA - 229695 229690 6 1.05 5.02 Term - 234411 233468 944 1 2 118 49 865 0.852 77.86 5.01 Init - 236662 236419 244 1 1 68 99 78 0.719 3.57 5.00 Prom - 249634 249595 40 -4.26 6.09 PlyA - 250001 249996 6 1.05 6.08 Term - 261789 260973 817 2 1 78 36 501 0.808 36.51 6.07 Intr - 262755 262666 90 2 0 95 91 1 0.523 0.21 6.06 Intr - 267473 267259 215 2 2 110 79 123 0.944 11.11 6.05 Intr - 272049 271937 113 1 2 55 42 60 0.829 -1.80 6.04 Intr - 273080 272938 143 1 2 101 66 163 0.939 15.40 6.03 Intr - 294356 294209 148 0 1 52 76 307 0.067 25.19 6.02 Intr - 304173 304056 118 0 1 117 94 15 0.585 5.04 6.01 Init - 304783 304775 9 1 0 93 77 0 0.547 -0.21 6.00 Prom - 305059 305020 40 -10.15 7.09 PlyA - 305389 305384 6 1.05 7.08 Term - 306742 306585 158 2 2 109 43 169 0.993 12.60 7.07 Intr - 308576 308504 73 2 1 110 109 24 0.966 5.68 7.06 Intr - 308821 308761 61 0 1 84 91 50 0.970 3.64 7.05 Intr - 310722 310525 198 2 0 73 72 76 0.872 2.97 7.04 Intr - 313152 313079 74 0 2 123 94 71 0.985 9.40 7.03 Intr - 314608 314506 103 0 1 71 77 75 0.995 4.88 7.02 Intr - 315796 315694 103 2 1 78 96 158 0.999 14.83 7.01 Intr - 319468 319325 144 2 0 84 91 141 0.988 14.25 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 50791 50501 291 1 0 80 38 139 0.902 3.75 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:28887438_29215794|GENSCAN_predicted_peptide_1|41_aa MAPWPLLDWLRDPDGCFWTCQETRARGKKIEIVKVDLVDGC >gi568815597f:28887438_29215794|GENSCAN_predicted_CDS_1|126_bp atggctccatggcctctcctggactggctgcgtgaccccgacggctgcttctggacatgc caagaaacaagggccagagggaagaagattgaaattgtaaaagtggacctggtggatggt tgttaa >gi568815597f:28887438_29215794|GENSCAN_predicted_peptide_2|198_aa MDKTDAKIHMEIQGIARTAKTMLKNKTTVRGLTLTNFKAYHKPTVIKTMWYQHKDKHVDQ RNGTESPEINSYIYIQLVATRRLWPAPRSSGRVPAGDSACTRGSSAGEPAPRPGACRGLQ TKRLQRRRGRQDRPHGDSSTPTAPAPAPPPPPRLTGGSGAGLLRPPLLQPDEQWQEEAPA SEARALCGSRARRLLLAD >gi568815597f:28887438_29215794|GENSCAN_predicted_CDS_2|597_bp atggacaaaactgatgctaaaattcatatggaaatacaagggatagccagaacagccaaa acaatgttgaaaaataagaccacagttagaggactcacacttaccaatttcaaagcttac cacaaacctacagtaatcaagacaatgtggtaccagcataaggataaacacgtagatcaa cggaatggaactgagagtccagaaataaactcatacatctatatccagttggttgcaaca aggcggctctggcccgcccctcgctcctcgggccgcgtcccggctggcgactcggcgtgc acacgaggctcctccgcgggagagcccgcgccccggcccggggcctgtcgggggttgcag acaaagaggctgcagcgccgccgcggccgccaggaccgtccccacggggacagctccacg cccaccgccccggctcccgcgccgccgccgccgcctcgcctcaccggtggctccggggcc gggctcctgcgcccgccactgctgcagcccgacgaacaatggcaggaggaggcgccggcg tccgaggctcgggccctctgcggctcgcgggcccgccggctgctgctggctgactga >gi568815597f:28887438_29215794|GENSCAN_predicted_peptide_3|95_aa MGKFMKPGKVVLVLAGRYSGCKAVIVKNIDDSTSDCPYSHALVSGIDHYPCKVTAAMGKK KITKGSKIKSFAKVCNYNHLMPKARREAKVKFEKR >gi568815597f:28887438_29215794|GENSCAN_predicted_CDS_3|288_bp atgggcaagttcatgaaacctgggaaggtggtgcttgtcctggctggacgctactctgga tgcaaagccgtcatcgtgaagaacattgatgatagcacctcagattgcccctacagccat gctctggtgtctggaattgaccactacccctgcaaagtgacagctgccatgggcaagaag aagatcaccaaggggtcaaagatcaagtcttttgcgaaagtttgtaactacaatcaccta atgcccaaagcccgacgagaggccaaggtcaagtttgaaaagagataa >gi568815597f:28887438_29215794|GENSCAN_predicted_peptide_4|1019_aa MTTEKSLVTEAENSQHQQKEEGEEAINSGQQEPQQEESCQTAAEGDNWCEQKLKASNGDT PTHEDLTKNKERTSESRGLSRLFSSFLKRPKSQVSEEEGKEVESDKEKGEGGQKEIEFGT SLDEEIILKAPIAAPEPELKTDPSLDLHSLSSAETQPAQEELREDPDFEIKEGEGLEECS KIEVKEESPQSKAETELKASQKPIRKHRNMHCKVSLLDDTVYECVVEKHAKGQDLLKRVC EHLNLLEEDYFGLAIWDNATSKTWLDSAKEIKKQVRALLGSYTIQSELGDYDPELHGVDY VSDFKLAPNQTKELEEKVMELHKSYRSMTPAQADLEFLENAKKLSMYGVDLHKAKDLEGV DIILGVCSSGLLVYKDKLRINRFPWPKVLKISYKRSSFFIKIRPGEQEQYESTIGFKLPS YRAAKKLWKVCVEHHTFFRLTSTDTIPKSKFLALGSKFRYSGRTQAQTRQASALIDRPAP HFERTASKRASRSLDGAAAVDSADRSPRPTSAPAITQGQVAEGGVLDASAKKTVVPKAQK ETVKAEVKKEDEPPEQAEPEPTEAWKDLDKSQEEIKKHHASISELKKNFMESVPEPRPSE WDKRLSTHSPFRTLNINGQIPTGEGSMYCGVQRSSNGLPKASFSALKFNKFKQIRIKHMA VSPFTVAEYDTKRNTLVQIYESIPEMVKKSINGIRTEEVAVVTKGPSTNPDSEWEGPKHS VVPSKSQMTTSSESLQSFAFGSLSISSKETEEKEEGAAGYLDIKEMPRGPTGGCIGVEEQ ASALKFSVTPASCQLQPGVKKAESSEEHVTPGEPPGKQNGSFLDFHVGNQFPTLIRSFQP PLVKTQTVTISDNANAVKSEIPTKDVPIVHTETKTITYEAAQCWDFRHEPPCLATNLVVG VSFILSKHLRVKLLGHRTNDQYEETLTDDNSGDLDPGVLLTAQTITSETPSSTTTTQITK TVKGGISETRIEKRIVITGDADIDHDQVLVQAIKEAKEQHPDMSVTKVVVHQETEIADE >gi568815597f:28887438_29215794|GENSCAN_predicted_CDS_4|3060_bp atgacaacagagaagagtttagtgactgaggccgaaaattcacagcaccaacagaaggaa gagggtgaggaagccataaactcaggccaacaagaacctcagcaggaggaatcttgtcaa acagcagctgaaggagataattggtgtgaacagaagctgaaagcttctaatggagacact cctacacatgaagacttgaccaagaacaaggagcggacatcagaaagcagaggactttca cgactattctcctcgtttctcaaaaggcccaaatctcaggtgtccgaggaagaaggcaaa gaagtagagtcagataaagaaaaaggtgaaggaggtcagaaagagatagaatttggaacc agtcttgatgaagagatcattttaaaggccccaattgcagctcctgaaccggaactcaaa acagacccatctttggatcttcattcattaagcagtgcagaaacacagcctgctcaggaa gaactcagagaagatccagattttgaaattaaggaaggagaaggacttgaagagtgctcc aaaatagaagtaaaagaagaaagccctcaatcaaaagcagaaacagaattaaaagcttcc caaaaaccaatcagaaaacacaggaacatgcactgcaaggtttctttgttggatgacaca gtttatgaatgtgttgtggagaaacatgctaagggacaagatttgcttaaacgagtatgt gagcatctcaatcttttggaagaagactattttggtctagccatttgggataacgcaacc tctaagacatggctggattccgccaaagaaataaaaaagcaggttcgtgcattattaggt tcttacaccatccagtctgaactgggagactacgacccagaactccatggcgtggattat gttagtgattttaaactggccccgaatcagaccaaggaacttgaagagaaggtcatggaa ctgcataagtcatacaggtccatgactccagctcaggctgacttggagtttcttgagaat gccaaaaagttgtctatgtatggagttgatcttcataaagcaaaggacttggaaggagta gatatcatcctaggtgtctgctctagtggccttctggtttacaaagataagctgagaatt aaccgcttcccttggcccaaagtgctgaagatttcttataaacgtagtagctttttcatc aagattcggcctggagagcaagagcagtatgaaagtaccatcggattcaaacttcccagt taccgagcagctaagaaattatggaaagtctgtgtagaacatcacacgtttttcagattg acatctacagacaccattcccaaaagcaaatttcttgcgctaggatccaaatttcgatac agtggccggactcaagctcagaccaggcaagctagtgctctaattgacaggcctgcccca cacttcgagcgtacagcaagtaaacgggcgtcccggagcctcgatggagcagcagctgtc gattcggcagaccgaagtcctcggcccacttctgcacctgccattactcagggtcaggtt gcagaaggtggcgtcctagatgcctctgctaaaaaaacagtggtccctaaagcacagaag gaaacagtgaaggctgaagtgaaaaaggaagacgagccacctgagcaagctgagccagag cccacagaagcatggaaggatttagacaagagtcaagaggagatcaaaaaacatcatgcc agcatcagtgagctgaaaaagaacttcatggagtctgtaccagaaccacggcctagtgaa tgggataaacgcttatccactcactcacccttccgaactcttaacatcaatgggcaaatc cccacaggagaaggaagtatgtattgtggggtacaaaggagtagcaatggactgcccaaa gccagcttttctgcattgaagtttaacaaatttaaacaaatcagaattaagcatatggca gtatctccttttactgtagcagaatatgacacaaaaagaaacacattggtgcagatctat gaaagcatacctgaaatggttaagaagagtataaatggcattcgcacagaggaggtggct gtcgtgacaaaggggccatctactaaccctgactctgaatgggagggtcccaagcattcg gtagttcctagtaaaagccagatgaccacctcgtcggagtctctgcaaagctttgccttt ggctccctctccataagcagcaaggagacagaagagaaggaggagggggcagctggctat cttgatattaaggagatgccaagaggcccaactgggggatgtataggagtggaggaacag gccagtgccttaaagttctcagtaacaccagcttcctgtcagctgcaacctggtgtaaaa aaggcagagagtagtgaagaacatgttacaccaggagagccacctggaaaacaaaatgga tcatttcttgactttcatgtgggtaaccagttccccaccctcattcgaagtttccagcct cccctggtgaagacacaaactgtcaccatctcagataatgccaatgctgtgaaaagtgaa atcccaaccaaagacgtccctattgtccacactgagaccaagaccatcacttatgaggct gcccagtgctgggatttcaggcatgagccaccgtgcctggccacaaatcttgttgtgggc gtttcatttatcttgagtaaacatctaagagtcaaattgctgggtcataggacaaatgac cagtatgaggaaacattgactgacgacaacagtggagacttggacccaggagtcttgctg acagctcaaactatcacatctgagaccccaagcagcaccaccacaactcaaattaccaag actgtaaaaggtgggatttcagagacacgtattgaaaagagaattgtgatcacaggagat gctgatattgaccatgatcaggtccttgtacaagccatcaaggaggcaaaggagcagcac ccagacatgtcagtgaccaaggtggtcgtccaccaggagaccgagattgctgatgagtga >gi568815597f:28887438_29215794|GENSCAN_predicted_peptide_5|395_aa MAELGPRVPGVTRRAQGAYLRSFDLSCRLSAAVGPGPARVGRSRGLEAGPGRGWGRAVET RAALEPPRADWDPDPRGAAAPERPDDGEMTAGSPEECGEVRRSPEGRVSRLGRRLGRRRR PRSPPEPLRVRARLRLRSPSGAFAALGALVVLVGMGIAVAGYWPHRAGAPGSRAANASSP QMSELRREGRGGGRAHGPHERLRLLGPVIMGVGLFVFICANTLLYENRDLETRRLRQGVL RAQALRPPDGPGWDCALLPSPGPRSPRAVGCAEPEIWDPSPRRGTSPVPSVRSLRSEPAN PRLGLPALLNSYPLKGPGLPPPWGPRTQTGHVIITVQPSGSCIEHSKSLDLGLGELLLGA PAARDCAHRSWPRLDRLSLGGYAKLGGGGDLGARV >gi568815597f:28887438_29215794|GENSCAN_predicted_CDS_5|1188_bp atggcagaactcgggccacgggtgcccggagtcaccaggcgcgcacagggcgcctacctg cgatcctttgacctgagctgccgcctttcagcggcggtggggccgggcccggcgcgggtc ggacggtcccgggggctggaggcggggccggggcggggctggggccgggccgtggagacc cgggcggctctggagcctccgcgcgcggactgggacccggacccgcgcggcgctgcggcg ccagagcgcccagacgacggcgagatgacggccgggagccccgaagaatgcggggaggtg cggaggagccccgagggccgcgtctctcgcttgggccgccgcctgggccgccgccggcgc ccgcgctccccgcccgagcctctgcgggtgcgggcgcggctgcggctgcgctcgccgtcg ggggcgttcgcggcgctgggggcgctcgtggtactggtgggtatgggcattgcagtggcc ggctactggccgcaccgggccggggccccagggtcccgggccgccaatgccagctcgccc cagatgagcgagctgcgacgcgagggtcgcggcgggggccgggctcacggcccgcacgag cggctgcggctcctcgggccggtgatcatgggcgtcggcctgttcgtgttcatctgcgcc aacacactgctgtatgagaaccgagacttggagacgcgacggctccgccagggggtgctg cgggcccaggcgctccggccccccgacggcccgggctgggactgcgccctccttcccagc cccggccctaggagtccccgagccgtaggctgcgcagagccagaaatctgggacccgtcc ccgcgtcggggtacttcacccgtcccgtcagtgcggagtctgcgttcagagcccgctaat cctcgcttggggttacctgccttgctcaacagctacccgctgaagggccccgggctgccc ccaccctggggtccacggacgcagactggccatgtgatcatcaccgtgcagccgtctggc tcctgcattgaacattccaagtctctggatctgggccttggggagctcctccttggggcc ccagcagctcgggactgtgctcaccgaagctggccacggctggaccgcctcagtcttggg ggctatgccaaattgggaggaggaggggacttgggggcccgggtctga >gi568815597f:28887438_29215794|GENSCAN_predicted_peptide_6|550_aa MGEVSGTSDCTDDQCRQVKKALEGGKAARGHRSKIKIRFFRPGGLGPGPAITAVAGMPRV YIGRLSYQARERDVERFFKGYGKILEVDLKNGYGFVEFDDLRDADDAVYELNGKDLCGER VIVEHARGPRRDGSYGSGRSGYGYRRSGRDKYGPPTRTEYRLIVENLSSRCSWQDLKDYM RQAGEVTYADAHKGRKNEGVIEFVSYSDMKRALEKLDGTEVNGRKIRLVEDKPGSRRRRS YSRSRSHSRSRSRSRHSRKSRSRSGSSKSSHSKSRSRSRSGSRSRSKSRSRSQSRSRSKK EKSRSPSKEKSRSRSHSAGKSRSKSKDQAEEKIQNNDNVGKPKSRSPSRHKSKSKSRSRS QERRVEEEKRGSVSRGRSQEKSLRQSRSRSRSKGGSRSRSRSRSKSKDKRKGRKRSREES RSRSRSRSKSERSRKRGSKRDSKAGSSKKKKKEDTDRSQSRSPSRSVSKEREHAKSESSQ REGRGESENAGTNQETRSRSRSNSKSKPNLPSESRSRSKSASKTRSRSKSRSRSASRSPS RSRSRSHSRS >gi568815597f:28887438_29215794|GENSCAN_predicted_CDS_6|1653_bp atgggtgaggttagtgggacaagtgattgtacagatgatcagtgcaggcaagtgaaaaaa gccttagaaggagggaaggctgccagaggccacaggagtaaaattaagattaggttcttt aggccaggggggctggggccggggccagccatcactgccgttgccgggatgccgcgggtg tacatcggccgcctgagctaccaggcccgggagcgcgatgtggagcgcttctttaagggc tacgggaagatcctggaggtggatctgaagaacggatatggttttgtggagtttgatgat ctgcgtgatgcagatgatgctgtttatgaactgaatggcaaagacctttgtggtgagcga gtaattgttgagcatgcccgcggcccacggcgagatggcagttacggttctggacgcagt ggatatggttatagaagaagtggccgagataaatatggccctcctactcgcacagagtac agacttattgtggagaatttgtcaagtcggtgcagctggcaagacctaaaggattatatg cgtcaggcaggagaagtgacttatgcagatgctcacaagggacgcaaaaatgaaggggtg attgaatttgtatcttattctgatatgaaaagagctttggaaaagttggatggaactgaa gtcaatgggagaaaaatcagattagttgaagacaagccaggttccagacgacgccggtcc tactccagaagccggagtcattcaaggtctcgctctcgaagcagacattcccgtaagagc agaagccgaagtggcagcagcaaaagcagtcattctaagagtagatctcggtccaggtcg ggctcccgctcccggagcaagagccggagccggagccagagtcggagccggagcaagaaa gagaaaagcaggagccccagcaaggaaaagagccgcagccgcagccatagcgctggcaag agccgcagcaagagcaaagaccaagctgaagagaagatccaaaacaatgacaatgtcggg aaacccaagagccggagtcctagcaggcataaaagtaagagcaaaagtcggagcaggagt caggagaggagagtggaggaggagaagcgagggagtgtgagcaggggcaggagccaggag aagagcctccgccagagtcggagccggagcaggagcaaagggggcagcaggagccggagc aggagccgcagcaagagcaaggacaagaggaagggcaggaagagaagcagagaggagagc cgcagtcgcagtcgcagccgcagcaagagtgagaggagcagaaagcgaggcagcaagcga gacagcaaggcgggcagcagcaagaagaagaagaaggaagacactgaccgctcccagtcc agatctccatcccgctccgtgtcaaaggagcgggaacatgccaagtctgaatccagccag agggaaggtcgaggagagagtgagaatgctggcaccaatcaggagacccggtccaggtcg agatccaattccaaatcgaaaccaaaccttccatcagaatcacgctccagatcaaagtca gcttcaaaaacccgatctcggtccaagtctagatccaggtctgcttccagatcgccctcc cgatctagatctaggtcccactcaaggtcctaa >gi568815597f:28887438_29215794|GENSCAN_predicted_peptide_7|304_aa XTWRTEAVFSEEALIQVPSDIPLQSAATLGVNPCTAYRMLMDFEQLQPGDSVIQNASNSG VGQAVIQIAAALGLRTINVVRDRPDIQKLSDRLKSLGAEHVITEEELRRPEMKNFFKDMP QPRLALNCVGGKSSTELLRQLASVAAFDSHGHVNPIVKHAGKGSRSHAPYENLMPENLRR NSFIPKPSPPLQSMEKLSSMKPVLVPTRRGGTMVTYGGMAKQPVVASVSLLIFKDLKLRG FWLSQWKKDHSPDQFKELILTLCDLIRRGQLTAPACSQVPLQDYQSALEASMKPFISSKQ ILTM >gi568815597f:28887438_29215794|GENSCAN_predicted_CDS_7|915_bp ngaacctggcggaccgaggctgtgttcagcgaggaagcactgatccaagttccgagtgac atccctcttcagagcgctgccaccctgggtgtcaatccctgcacagcctacaggatgttg atggacttcgagcaactgcagccaggggattctgtcatccagaatgcatccaacagcgga gtggggcaagcagtcatccagatcgccgcagccctgggcctaagaaccatcaatgtggtc cgagacagacctgatatccagaagctgagtgacagactgaagagtctgggggctgagcat gtcatcacagaagaggagctaagaaggcccgaaatgaaaaacttctttaaggacatgccc cagccacggcttgctctcaactgtgttggtgggaaaagctccacagagctgctgcggcag ttagcatcagtggcagcattcgattctcatgggcatgtgaaccctattgtgaagcacgca gggaagggatctaggtcccatgctccttatgagaatctaatgcctgagaatctgaggcgg aacagtttcatcccaaaaccatctcccccactccagtccatggaaaaattgtcttccatg aaaccggtcctggtgccaacaaggcgtggaggaaccatggtaacctatggggggatggcc aagcagcccgtcgtagcctctgtgagcctgctcatttttaaggatctcaaacttcgaggc ttttggttgtcccagtggaagaaggatcacagtccagaccagttcaaggagctgatcctc acactgtgcgatctcatccgccgaggccagctcacagcccctgcctgctcccaggtcccg ctgcaggactaccagtctgccttggaagcctccatgaagcccttcatatcttcaaagcag attctcaccatgtga