GENSCAN 1.0 Date run: 4-Nov-116 Time: 07:57:59 Sequence gi568815594r:42019959_42252677 : 232719 bp : 40.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2880 2972 93 0 0 86 69 122 0.903 8.26 1.02 Intr + 3344 3426 83 2 2 38 106 83 0.807 3.46 1.03 Intr + 4131 4251 121 1 1 61 64 64 0.528 -0.07 1.04 Term + 9262 9685 424 1 1 81 51 355 0.618 24.88 1.05 PlyA + 10941 10946 6 1.05 2.00 Prom + 17704 17743 40 -4.85 2.01 Init + 22808 22969 162 1 0 70 97 127 0.810 11.58 2.02 Intr + 23222 23370 149 1 2 69 116 106 0.942 9.61 2.03 Intr + 29419 29521 103 1 1 119 94 -9 0.173 2.06 2.04 Intr + 40233 40288 56 2 2 128 66 34 0.065 2.06 2.05 Intr + 47758 47861 104 1 2 86 75 45 0.063 1.90 2.06 Intr + 50568 50733 166 1 1 73 82 100 0.794 6.00 2.07 Intr + 53719 53826 108 1 0 39 92 81 0.769 2.08 2.08 Intr + 55699 55828 130 1 1 66 108 82 0.999 7.78 2.09 Intr + 58254 58367 114 2 0 110 69 97 0.995 9.72 2.10 Term + 65212 65262 51 0 0 101 37 52 0.406 -2.15 2.11 PlyA + 65575 65580 6 1.05 3.11 PlyA - 65595 65590 6 1.05 3.10 Term - 86010 85942 69 0 0 59 44 132 0.436 2.86 3.09 Intr - 93520 93503 18 1 0 94 103 25 0.036 0.49 3.08 Intr - 97777 97668 110 2 2 96 40 55 0.061 0.58 3.07 Intr - 100336 100096 241 1 1 81 69 126 0.135 6.20 3.06 Intr - 102978 102861 118 2 1 29 91 76 0.166 1.55 3.05 Intr - 112055 111935 121 0 1 89 27 60 0.000 -1.37 3.04 Intr - 124036 123470 567 2 0 114 68 425 0.995 34.92 3.03 Intr - 126325 126089 237 2 0 86 52 87 0.016 1.56 3.02 Intr - 132418 131699 720 2 0 119 81 517 0.034 44.57 3.01 Init - 133022 132644 379 2 1 83 -45 297 0.540 13.01 3.00 Prom - 135132 135093 40 -2.95 4.00 Prom + 136890 136929 40 -7.65 4.01 Sngl + 138488 138709 222 1 0 75 48 293 0.896 18.90 4.02 PlyA + 139423 139428 6 1.05 5.04 PlyA - 140994 140989 6 1.05 5.03 Term - 142780 142648 133 0 1 20 45 117 0.038 -2.82 5.02 Intr - 152602 152427 176 1 2 85 86 160 0.329 13.32 5.01 Init - 179569 179327 243 1 0 48 113 162 0.375 12.48 5.00 Prom - 179822 179783 40 -9.05 6.03 PlyA - 180203 180198 6 1.05 6.02 Term - 181341 181082 260 1 2 112 47 133 0.818 6.33 6.01 Init - 183824 183671 154 2 1 75 68 117 0.480 7.99 6.00 Prom - 184434 184395 40 -6.65 7.00 Prom + 186067 186106 40 -8.55 7.01 Init + 186135 186179 45 2 0 64 95 61 0.310 5.14 7.02 Term + 198925 199116 192 0 0 86 38 119 0.304 3.14 7.03 PlyA + 200843 200848 6 1.05 8.00 Prom + 210233 210272 40 -5.65 8.01 Init + 214220 214250 31 1 1 88 89 15 0.588 1.55 8.02 Intr + 218736 218940 205 1 1 70 55 206 0.769 12.84 8.03 Term + 225223 225475 253 1 1 33 33 210 0.464 4.13 8.04 PlyA + 226225 226230 6 1.05 9.03 PlyA - 226744 226739 6 1.05 9.02 Term - 227295 227267 29 1 2 94 48 41 0.538 -2.04 9.01 Intr - 229264 229088 177 2 0 68 87 153 0.364 12.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 112235 112287 53 1 2 53 86 73 0.834 0.39 S.002 Term - 132418 131490 929 2 2 119 42 570 0.907 46.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:42019959_42252677|GENSCAN_predicted_peptide_1|240_aa XDLEQLRKIRRRSPHEDTESFTVYLRSDVEAKSLEVWGSPEALAREKKLRKEAEIEYREK GSINSATLINSADIYPIPAMNKKLCLVFEMYMQVRLLGARKANEEGSWKKFFWNSEKKEF LGRTGGSWFKIILFYIIFYGCLTGIFIGMIQVVLLTISEFKPTYQNHISGPPPGLTQISQ IQQTEIAFHPNDPKHCEAYVLNVVRFLEKYKDSAQKDDMIFENCDNVPSEPKERGDFNQE >gi568815594r:42019959_42252677|GENSCAN_predicted_CDS_1|723_bp nntgatctagaacaacttcgaaaaatcagacgacgaagtccccatgaagatactgagtct tttactgtatacttgagatcagatgtggaagcaaaatctttggaagtttggggaagccct gaagctcttgccagagagaaaaaattgcgtaaggaagcagaaatagaatacagagaaaaa gggtctattaactcagctacattaattaattcagcagatatttatccaatacccgctatg aacaagaagctgtgcttggtgtttgaaatgtatatgcaagtaagactcttaggagccagg aaagccaatgaggagggcagttggaagaaattcttctggaactcagagaagaaggagttt ttaggcaggactggtggcagttggtttaagatcattctattctacataatattttatggc tgcctgactggcatcttcatcggaatgatccaagtggtgctgctcaccatcagtgaattt aaacccacatatcagaaccacatatcaggacccccaccaggattaacacagatttctcag atccagcagactgaaattgcctttcatcctaatgatcccaagcactgtgaggcatatgtg ctgaacgtagttaggttcctggaaaagtacaaagattcagcccagaaggatgacatgatt tttgaaaattgtgacaatgtgcccagtgaacccaaagaacgaggagactttaatcaagaa tga >gi568815594r:42019959_42252677|GENSCAN_predicted_peptide_2|380_aa MPPDGEIPPSRDQQTPHTGELQLTSGRCSSGMKLPEEGTGSNVCCSAAPAVDTQDHNSSP AREQNWTENEFDEVGFRKWVITNSSKLKEHVPTQRKKAKNLEKSNGLNCFFKFLAWIYTG SASMFSEAIHSLSDTCNQGLLALGISKSVQTPDPSHPFSLGTPTSSHITMPYMVSELGCL NSPNLSECGGVCNPLYDSLGSLGVGTLLGMVSAFLIYTNTEALLGRSIQPEQVQRLTELL ENDPSVRFSSLPNGPLQHGSLLNQSVQIDKAIESECEQDKSHRAIHDVKATDLGLGKVRF KAEVDFDGRVVTRSYLEKQDFDQMLQEIQEVKTPEELETFMLKHGENIIDTLGAEVDRLE KELKTLFIIYLEGSSYADLP >gi568815594r:42019959_42252677|GENSCAN_predicted_CDS_2|1143_bp atgcctcctgatggggagatacctcccagcagggatcaacagacacctcatacaggagag ctccagctgacatctggcaggtgctcctctgggatgaagcttccagaggaaggaacaggc agcaatgtttgctgttctgcagcccctgctgttgatacccaggatcacaactcctcgcca gcaagggaacaaaactggacggagaatgagtttgacgaagtaggcttcagaaagtgggta ataacaaactcctccaagctaaaggagcatgttccaacccaacgcaagaaagctaagaac cttgaaaaaagcaatggattaaactgcttctttaaatttcttgcctggatttataccggt tcagcaagtatgttctcagaagctatacactcattatctgatacttgtaatcagggttta ctagcattgggcatcagtaagtctgttcaaacaccagatccttctcatccgttttctttg ggcaccccaacttcctcccacatcacaatgccgtacatggtcagtgaattggggtgtcta aatagtcccaatctgagtgagtgtgggggtgtgtgcaatccactgtatgacagcctaggt tctttgggtgtgggcaccttattaggcatggtctcagcattcctcatctacactaacaca gaagcactcttagggcggtccatccagccagaacaagtacaacggctcactgaactcctg gagaatgacccatcagtaagatttagctccttgccaaatgggcctcttcaacatggcagc ttgcttaatcaaagcgtgcaaattgataaggcaatagagagcgagtgtgaacaagacaaa agtcacagggcaattcatgatgttaaagccacagatctgggattaggtaaagtaagattt aaggcagaagtagattttgatgggcgagttgttacaagatcatatttggaaaaacaagat tttgaccaaatgttacaagaaattcaagaagtgaaaactcctgaagaactagagaccttt atgcttaaacatggagaaaatattattgatactttaggagctgaagtagatagacttgag aaggaactgaaaacactcttcattatctacctggaaggctcatcatatgcagacctacct tag >gi568815594r:42019959_42252677|GENSCAN_predicted_peptide_3|859_aa MVSCTFSGPLRETNENVKKFYALRAFMFRMSSEAAMLGESRTPKPRKHRATTRAKIFKRF FSEGSESNSRLVEELAVIHTYSDDPAPTTSPSSVQPREFGVMQGAPRARFGSRTPPAAAE ASSPHLGIGEAACQSGARAAAPRAGARRCQPQRQAAAAAATAQTHTLPHARTRADPAGRR RRHPRSPAPGGEGTCSEGPAPRRRMEEEMQPAEEGPSVPKIYKQRSPYSVLKTFPSKRPA LAKRYERPTLVELPHVRAPPPPPPPFAPHAAVSISSSEPPPQQFQAQSSYPPGPGRAAAA ASSSSPSCTPATSQGHLRTPAQPPPASPAASSSSSFAAVVRYGPGAAAAAGTGGTGSDSA SLELSAGGTSHTHMWRSQSTLPGSDTMVSVFGLMAQRRWQHRSLKQFEWGILGSWGTWPC GQDWLEKEGQVAVLLPRSEGNTAPKKSRMILDAFAQQCSRVLSLLNCGGKLLDSNHSQSM ISCVKQEGSSYNERQEHCHIGKGVHSQTSDNVDIEMQYMQRKQQTSAFLRVFTDSLQNYL LSGSFPTPNPSSASEYGHLADVDPLSTSPVHTLGGWTSPATSESHGHPSSSTLPEEEEEE DEEGYCPRCQELEQEVISLQQENEELRRKLESIPENISLDSTASLCKSRHLSREPPVKSD FPNPLQQALAGGASRPFSGAQQSIAYRVNSELEDGIRSPVPLSCEALEMDLTSLGSKQLL NNYPVYITSKQWDEAVNSSKKDGRRLLRYLIRFVFTTDELKYSCGLGKRKRSVQSGETGP ERRPLDPVKVTCLREFIRMHCTSNPDWWMPSEEQINKVFSDAVGHARQGRAKTDGDQNHL MTQFSEPVLIIEQRTTDEE >gi568815594r:42019959_42252677|GENSCAN_predicted_CDS_3|2580_bp atggtgtcatgcacgttctcggggcccctacgggaaacaaatgaaaacgtgaaaaagttc tacgccttgcgagcttttatgttccgcatgagctcagaggccgcgatgctcggggaaagc aggaccccaaagccccgtaaacaccgcgcgaccacccgggccaagatcttcaagaggttc ttttcagaaggatcggagagcaattcccgattggtagaagaacttgctgtaatacacacg tactctgacgaccccgccccaacgactagcccctcctctgtgcaaccccgagagtttggg gtcatgcagggggcgccacgagctcgtttcggaagccggaccccgcccgcagccgcagaa gcctcgagtccacatctgggcattggcgaggcagcctgtcaatcaggagctcgggcggca gccccccgcgcgggggctcggcgatgccagcctcagcgacaggcggcggcggcggcggcc acggcacagacacacaccctcccacacgcgcgcaccagggcagacccggcgggcaggcgg cggaggcaccctcggagcccggcgcccggcggggaggggacgtgctccgagggaccggcc ccgaggcgccggatggaggaagagatgcagccggcagaggaggggcccagcgtccccaaa atctacaagcagcgcagcccctacagcgtcctcaagacgttccccagcaagagaccggcg ctggccaagcgctacgagcgacccaccctggtggagctgccgcacgtgcgggcgcccccg ccgcccccgccgcccttcgcgccgcacgccgccgtctccatcagcagcagcgagccgccg ccgcagcagttccaggcgcagagctcctacccccccgggcccggccgggccgccgccgcc gcttcgtcgtcgtcgccgtcctgcacgcccgccacatcccagggccacttgaggactccg gcgcagccgccgcccgcgtcccccgccgcctcctcgtcgtcttcgttcgccgctgtcgtc aggtatggcccaggcgcggcggcggccgccggcaccggcggcacgggtagcgacagcgcc agcctggagctcagcgcaggagggacctcacatacacatatgtggaggtcccagtccaca cttccaggatctgacaccatggtctctgtctttggattgatggctcagagaagatggcag catagatctttaaagcagtttgagtggggaattcttggatcttggggtacttggccatgt ggacaggattggctggagaaggagggtcaggtggcggtcctgctgccaaggtctgagggt aatactgctcctaagaagagtcgaatgatcttggatgcctttgcccagcagtgcagtcga gttcttagcctcttaaattgtggaggaaaactcctggactccaaccattctcagtccatg atttcttgcgtaaagcaggaaggctcaagttacaacgaaagacaggagcactgtcacatt gggaaaggggtccacagtcagacctcagacaatgtagacatagagatgcagtatatgcaa aggaaacaacaaacttctgcctttttgagggttttcactgactctctacaaaattacctg ctctcgggaagctttccaactccaaacccctcgtcagccagtgaatatggccatctggcc gacgtggatcctctgtcaacctctcctgtgcatacattaggtggctggacttccccagca acgtccgaatcccatggccacccatcttcatctacactgccagaagaggaggaggaggag gacgaggaaggctattgtcctcgatgccaagagctggagcaggaggttatttcactgcaa caagaaaatgaagagctcagaaggaaattagagagcatcccagaaaatatttcacttgat tccacagcttccctgtgtaaatctaggcatctatccagagagcccccagtcaagagtgat tttccaaatcctttgcagcaggccttggctgggggtgcttcaagaccattttcaggggca cagcaaagcatcgcttacagggtgaactctgaacttgaggatggcatccgcagccccgtc cctttgagttgtgaggccttggaaatggatttgacctccttgggaagcaagcagctgttg aacaactatcctgtctacataacgagcaaacagtgggatgaggctgtaaattcttcaaag aaagatgggagacggctccttcgatacctcatcagatttgttttcacaaccgatgagctt aagtactcatgcggccttgggaaaaggaaaaggtcagtgcagtcaggagagacaggtccc gaaagacgccctctggatccagttaaagtaacatgcctccgagaattcattaggatgcat tgtacctccaaccccgattggtggatgccctcggaagagcagataaacaaagtgttcagc gacgctgtcggtcacgcccgacaggggcgggcgaaaacagatggagatcagaatcaccta atgacacagttctcagaacctgtcctcatcattgagcaacgcacgactgatgaagagtag >gi568815594r:42019959_42252677|GENSCAN_predicted_peptide_4|73_aa MQEQWQAFSPIGSGNGRLAGSGAQQTPCQIQRDGSQWQVCDGGKQQWWTASESSARAVTN MDQKRVRLQDLIE >gi568815594r:42019959_42252677|GENSCAN_predicted_CDS_4|222_bp atgcaggaacaatggcaagcctttagtccgatcgggagtggcaatgggcgccttgctgga tcaggagcacagcagacaccctgccagatccagagggatggaagtcagtggcaggtctgc gacggcggcaaacagcagtggtggacggcaagcgaaagctcagctcgagctgtaacaaac atggaccagaagagagtgcggttgcaagatttaatagagtga >gi568815594r:42019959_42252677|GENSCAN_predicted_peptide_5|183_aa MTQGFPRGTEKKRLEIEQKIRMVALRGEEPFGSPSQKTEEGIPESLPMEFQEAKWFDKDT HILAKTLLAYLDNGIQGQWLKGMLAHTENLIRDIKELKEQPHGSEGKRVPGRENSKLRDP ETRLLLEGSRNSQTGVDGVGLGSKSETPSKKKKEVPQTGWLKQQKFEQRQGRRENVMEDE DGD >gi568815594r:42019959_42252677|GENSCAN_predicted_CDS_5|552_bp atgacccagggcttccctagaggtacagaaaagaagaggttggagattgagcaaaaaata agaatggtggcactaagaggagaggaaccattcggttcaccttcacagaaaactgaagaa ggaattcctgaaagcttaccaatggagttccaggaagccaagtggtttgataaagacaca cacatacttgccaagaccttgttagcctacttggataatggaatacagggacagtggttg aagggcatgcttgctcacactgagaatttgatcagagacataaaggagctgaaggagcaa ccccatggatctgaggggaagagagttccaggcagagagaacagcaagctcagagaccct gaaacaaggctgctccttgagggctccagaaacagccagactggtgtggatggagtgggc ctcggcagcaagagtgaaactccatctaaaaaaaaaaaagaagtaccacaaactgggtgg ctgaaacaacagaaatttgaacagagacagggacgtagggagaatgtcatggaagatgaa gatggagattga >gi568815594r:42019959_42252677|GENSCAN_predicted_peptide_6|137_aa MEAMAKSRSSVCLGILVCSELLRALPGAAGYLSPSVGPGTTKDPMPASFGPGRHANIIIF NVCLGVQKTEKHGLKDLLTEKEKHHENLNWNWHGGQEFSCSEYLHGGGLGTSNGFQSKPK CQHYPTTEAMKVLTESG >gi568815594r:42019959_42252677|GENSCAN_predicted_CDS_6|414_bp atggaagcaatggctaagagcagatcttccgtgtgccttggcatcctggtgtgctctgaa ttgttaagagctttgcctggagctgcaggatacctcagcccttcagtaggacctgggaca accaaagaccccatgccggccagctttggcccagggcgccatgcaaatatcatcattttc aatgtctgccttggtgttcaaaagactgagaaacacgggttaaaagacttgttaacggag aaggagaaacatcatgaaaacctaaattggaactggcatggaggccaggaattcagttgt tctgagtatctccacggtggtggacttggaacctcaaatgggttccagtcaaaacccaag tgtcaacactacccaacgacagaggccatgaaggttctaacagagtctggctga >gi568815594r:42019959_42252677|GENSCAN_predicted_peptide_7|78_aa MAGYNILRAVSFNDQEQGQSRDYTGMNEDAICMVGPSEHLQARWRLFSSCPPTFLLRPPP LLMATQISYFTPISVVIS >gi568815594r:42019959_42252677|GENSCAN_predicted_CDS_7|237_bp atggcgggatacaatatcctgagagcagtttccttcaacgaccaggagcagggacaaagc agagattacacggggatgaacgaagatgcaatctgcatggttggcccctcagaacatctt caggctcggtggcgtcttttcagttcctgtccccccaccttcctcctgaggccacccccg ctgctcatggccacacaaatctcttatttcacccccatttctgttgtgatctcctag >gi568815594r:42019959_42252677|GENSCAN_predicted_peptide_8|162_aa METPEIEAWNDRQQEETKPEEKNGAGSSVKTLPGCKEAALIMQRMQLEELFDLTVVTEIC AASSDLEQEVILEFWKDSSSGHSKLKTFWKWFTILDAIKNTCESWEEVKISTLREVWKKL IPVLKDDFEGFKTLVEEVTADVVEIARELELEVEPEDETELL >gi568815594r:42019959_42252677|GENSCAN_predicted_CDS_8|489_bp atggagacgcccgagattgaagcctggaatgacagacagcaagaagaaacgaaaccagag gagaagaatggagctggatcgtctgtgaagaccctgccaggctgcaaagaagctgcactc ataatgcagagaatgcagctggaggagctctttgatttaactgttgttactgaaatctgt gcagcatcaagtgacttagagcaggaagttatcttagaattctggaaagattcaagctct ggacatagtaaattgaaaaccttctggaaatggtttaccattctcgatgccattaagaac acttgtgaatcatgggaggaagtcaaaatatcaacattaagagaagtttggaagaagtta attccagtcctcaaggatgactttgaggggttcaagactttagtggaggaagtaactgca gatgtggtggaaatagcaagagaactagaattagaagtggagcccgaagatgagactgaa ttgctgtaa >gi568815594r:42019959_42252677|GENSCAN_predicted_peptide_9|68_aa XNNTTSPLAVMQSHVTSPAQWTVNKRDIRPFHAKAVNAKDLTAPGNLAALVRVAEPQDER QPPAKCGY >gi568815594r:42019959_42252677|GENSCAN_predicted_CDS_9|207_bp nagaataacaccacaagccctcttgcagttatgcaaagtcacgtgactagccctgcacaa tggactgtgaataagagagatatacgtcccttccatgcaaaggctgtgaatgccaaggat ctcacagctccgggcaatctggcagccctggtgagggtagcagaaccacaagatgaaagg cagccaccagccaaatgtggctactag